Dec. 31, 2023, 9:43 p.m. | /u/brownmamba94

Machine Learning www.reddit.com

Hey fellow ML enthusiasts,

I've been working on an exciting project and wanted to share my progress with you. I successfully ported Andrej Karpathy's nanoGPT framework into Apple's new machine learning framework, MLX. This has opened up some intriguing possibilities for running GPT models on Mac GPUs.
Code: [https://github.com/vithursant/nanoGPT\_mlx](https://github.com/vithursant/nanoGPT_mlx)

**Details:**

* **Hardware:** Macbook M3 Pro with 11-core CPU, 14-core GPU, 18GB Unified Memory
* **Performance:** Pre-training a 45M parameter character-level GPT-2 model on the Shakespeare dataset at 0.37 iterations/second.
* …

andrej karpathy apple framework gpt gpu hey m3 pro macbook machine machine learning machinelearning mlx nanogpt progress project running

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US