all AI news
[N] Language Processing Unit (LPU) makes inference of LLMs 10x faster
Feb. 21, 2024, 6:32 p.m. | /u/vvkuka
Machine Learning www.reddit.com
For the **comparison**:
* “According to Groq, in similar tests, ChatGPT loads at 40-50 tokens per second, and Bard at 70 tokens per second on typical GPU-based computing systems.
* Context for 100 tokens per second per …
billion comparison faster groq inference language language processing llama llms machinelearning mixtral parameters per processing running speed tokens
More from www.reddit.com / Machine Learning
[D] Does DSPy actually change the LM weights?
1 day, 1 hour ago |
www.reddit.com
[D] Culture of Recycling Old Conference Submissions in ML
1 day, 4 hours ago |
www.reddit.com
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US