[P] Llama2 inference in a single file of pure Mojo | allainews.com

Sept. 14, 2023, 12:32 p.m. | /u/Albatross9855

Machine Learning www.reddit.com

Hi everyone!

I was really excited that Mojo became publicly available and thinking which project can I implement to learn Mojo concepts. Since I have already ported llama2.c to [pure Python](https://www.reddit.com/r/MachineLearning/comments/15qcbel/p_llama2py/), I decided why not try to port llama2.py to Mojo now.. And here is what I got

First round of llama2.c vs llama2.🔥 battle. Mojo demonstrated 20% better performance than C in a single threaded execution of llama2 inference and 250x times better performance than Python

https://i.redd.it/0gcwwfc2r7ob1.gif

For reference …

inference llama2 machinelearning mojo operations performance python reference vectorization

More from www.reddit.com / Machine Learning

[R] A Primer on the Inner Workings of Transformer-based Language Models 4 hours ago | www.reddit.com

abstract advanced authors insights +9

[Discussion] Should I go to ICML and present my paper? 16 hours ago | www.reddit.com

academia data data scientist future +10

[P] Panza: A personal email assistant, trained and running on-device 16 hours ago | www.reddit.com

assistant automated email emails +9

[Discussion] Seeking help to find the better GPU setup. Three H100 vs Five A100? 18 hours ago | www.reddit.com

70b a100 budget five +9

[D] Something I always think about, for top conferences like ICML, NeurIPS, CVPR,..etc. How many … 19 hours ago | www.reddit.com

conferences cvpr etc good +8

[D] Benchmark creators should release their benchmark datasets in stages 20 hours ago | www.reddit.com

benchmark benchmarks concerns data +11

[P] spRAG - Open-source RAG implementation for challenging real-world tasks 21 hours ago | www.reddit.com

core hey implementation machinelearning +7

[D] Paper accepted to ICML but not attending in person? 23 hours ago | www.reddit.com

authors conference icml machinelearning +6

[D] Why do juniors (undergraduates or first- to second-year PhD students) have so many papers … 1 day, 1 hour ago | www.reddit.com

academic conferences etc hello +12

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior Machine Learning Engineer

@ Samsara | Canada - Remote

View on ai-jobs.net