all AI news
[P] Ported nanoGPT to Apple's new MLX framework: Early Results on Macbook M3 Pro GPU
Dec. 31, 2023, 9:43 p.m. | /u/brownmamba94
Machine Learning www.reddit.com
I've been working on an exciting project and wanted to share my progress with you. I successfully ported Andrej Karpathy's nanoGPT framework into Apple's new machine learning framework, MLX. This has opened up some intriguing possibilities for running GPT models on Mac GPUs.
Code: [https://github.com/vithursant/nanoGPT\_mlx](https://github.com/vithursant/nanoGPT_mlx)
**Details:**
* **Hardware:** Macbook M3 Pro with 11-core CPU, 14-core GPU, 18GB Unified Memory
* **Performance:** Pre-training a 45M parameter character-level GPT-2 model on the Shakespeare dataset at 0.37 iterations/second.
* …
andrej karpathy apple framework gpt gpu hey m3 pro macbook machine machine learning machinelearning mlx nanogpt progress project running
More from www.reddit.com / Machine Learning
[D] Geometrical meaning of Layer Normalization
1 day, 1 hour ago |
www.reddit.com
How are large network attack datasets made? [p]
1 day, 2 hours ago |
www.reddit.com
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne