Aug. 8, 2023, 2:37 p.m. | Unconventional Coding

Unconventional Coding www.youtube.com

CPU-Llama: https://github.com/unconv/cpu-llama
Llama 2 Flask API: https://github.com/unconv/cpu-llama

In this video I show you how you can run the Llama 2 language model on CPU (without a GPU)

Support: https://buymeacoffee.com/unconv
Consultations: https://www.buymeacoffee.com/unconv/e/146735
Memberships: https://www.buymeacoffee.com/unconv/membership

00:00 Launching EC2 Instance
02:57 Installing Llama 2
03:49 Installing Llama 2 Flask API
04:21 Modify Llama 2 Repo for CPU
07:17 Running Llama 2 on CPU
08:00 Trying to run it locally

api cpu ec2 flask gpu instance language language model llama llama 2 running show video

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

AI Engineering Manager

@ M47 Labs | Barcelona, Catalunya [Cataluña], Spain