all AI news
BEST LLMs for Coding, Long Context, Overall Perform
April 23, 2024, noon | code_your_own_AI
code_your_own_AI www.youtube.com
New LLM benchmark especially to judge and evaluate the performance of new LLMs on hard to solve tasks: ARENA HARD (Github).
#airesearch
#ai
#best
april benchmark benchmarks coding context data judge llm llm benchmark llms multiple performance show solve tasks top 10
More from www.youtube.com / code_your_own_AI
New Discovery: Retrieval Heads for Long Context
1 day, 7 hours ago |
www.youtube.com
Multi-Token Prediction (forget next token LLM?)
2 days, 7 hours ago |
www.youtube.com
NEW LLM Test: Reasoning & gpt2-chatbot
3 days, 12 hours ago |
www.youtube.com
LLMs: Rewriting Our Tomorrow (plus code) #ai
4 days, 19 hours ago |
www.youtube.com
Autonomous AI Agents: 14 % MAX Performance
6 days, 7 hours ago |
www.youtube.com
480B LLM as 128x4B MoE? WHY?
1 week, 1 day ago |
www.youtube.com
No more Fine-Tuning: Unsupervised ICL+
1 week, 2 days ago |
www.youtube.com
NEW Phi-3 mini 3.8B LLM for Your PHONE: 1st TEST
1 week, 3 days ago |
www.youtube.com
BEST LLMs for Coding, Long Context, Overall Perform
1 week, 4 days ago |
www.youtube.com
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne