all AI news
I made my own batching/caching API over the weekend. 200+ tk/s with Mistral 5.0bpw esl2 on an RTX 3090. It was for a personal project, and it's not complete, but happy holidays! It will probably just run in your LLM Conda env without installing anyth
More from www.reddit.com / Ai Prompt Programming
Voice chatting with llama 3 8B
5 days, 7 hours ago |
www.reddit.com
Llama 3 benchmark is out 🦙🦙
1 week, 1 day ago |
www.reddit.com
Open Interface - Control Any Computer Using GPT-4V
1 week, 2 days ago |
www.reddit.com
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Global Data Architect, AVP - State Street Global Advisors
@ State Street | Boston, Massachusetts
Data Engineer
@ NTT DATA | Pune, MH, IN