all AI news
[P] Llama2 inference in a single file of pure Mojo
Sept. 14, 2023, 12:32 p.m. | /u/Albatross9855
Machine Learning www.reddit.com
I was really excited that Mojo became publicly available and thinking which project can I implement to learn Mojo concepts. Since I have already ported llama2.c to [pure Python](https://www.reddit.com/r/MachineLearning/comments/15qcbel/p_llama2py/), I decided why not try to port llama2.py to Mojo now.. And here is what I got
First round of llama2.c vs llama2.🔥 battle. Mojo demonstrated 20% better performance than C in a single threaded execution of llama2 inference and 250x times better performance than Python
https://i.redd.it/0gcwwfc2r7ob1.gif
For reference …
inference llama2 machinelearning mojo operations performance python reference vectorization
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Senior Machine Learning Engineer
@ Samsara | Canada - Remote