all AI news
BEST LLMs for Coding, Long Context, Overall Perform
April 23, 2024, noon | code_your_own_AI
code_your_own_AI www.youtube.com
New LLM benchmark especially to judge and evaluate the performance of new LLMs on hard to solve tasks: ARENA HARD (Github).
#airesearch
#ai
#best
april benchmark benchmarks coding context data judge llm llm benchmark llms multiple performance show solve tasks top 10
More from www.youtube.com / code_your_own_AI
Stealth LLM: im-a-good-gpt2-chatbot
1 day, 2 hours ago |
www.youtube.com
Understand DSPy: Programming AI Pipelines
3 days, 2 hours ago |
www.youtube.com
Latest Insights in AI Performance Models
5 days, 2 hours ago |
www.youtube.com
Multi-Token Prediction (forget next token LLM?)
1 week, 1 day ago |
www.youtube.com
NEW LLM Test: Reasoning & gpt2-chatbot
1 week, 2 days ago |
www.youtube.com
LLMs: Rewriting Our Tomorrow (plus code) #ai
1 week, 3 days ago |
www.youtube.com
Autonomous AI Agents: 14 % MAX Performance
1 week, 5 days ago |
www.youtube.com
Jobs in AI, ML, Big Data
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US