all AI news
[P] Llama2 inference in a single file of pure Mojo
Sept. 14, 2023, 12:32 p.m. | /u/Albatross9855
Machine Learning www.reddit.com
I was really excited that Mojo became publicly available and thinking which project can I implement to learn Mojo concepts. Since I have already ported llama2.c to [pure Python](https://www.reddit.com/r/MachineLearning/comments/15qcbel/p_llama2py/), I decided why not try to port llama2.py to Mojo now.. And here is what I got
First round of llama2.c vs llama2.🔥 battle. Mojo demonstrated 20% better performance than C in a single threaded execution of llama2 inference and 250x times better performance than Python
https://i.redd.it/0gcwwfc2r7ob1.gif
For reference …
inference llama2 machinelearning mojo operations performance python reference vectorization
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
Senior AI/ML Developer
@ Lemon.io | Remote
Consultant(e) Confirmé(e) Power BI & Azure - H/F
@ Talan | Lyon, France
Research Manager-Data Science
@ INFICON | East Syracuse, NY, United States
Data Scientist
@ Ubisoft | Singapore, Singapore
Data Science Assistant – Stage Janvier 2024 (F/H/NB)
@ Ubisoft | Paris, France
Data Scientist
@ dentsu international | Milano, Italy