all AI news
Topic: leaderboard
Is it a NEW OpenAI MODEL? (Testing gpt2-chatbot)
1 day, 21 hours ago |
www.youtube.com
Benchmarking LLMs via Uncertainty Quantification
5 days, 14 hours ago |
arxiv.org
Options for accessing Llama 3 from the terminal using LLM
1 week, 2 days ago |
simonwillison.net
GPT-4 Just Got Supercharged!
1 week, 6 days ago |
www.youtube.com
Command R+ now ranked 6th on the LMSYS Chatbot Arena
3 weeks, 1 day ago |
simonwillison.net
Berkeley Function-Calling Leaderboard
1 month, 2 weeks ago |
simonwillison.net
New LLM Benchmark Leaderboard: WildBench
1 month, 2 weeks ago |
www.youtube.com
[P] Accuracy Ranking of Classifiers on Tabular Data
1 month, 3 weeks ago |
www.reddit.com
OpenAI robots and MWC tech lead ZDNET's Innovation Index
1 month, 3 weeks ago |
www.zdnet.com
Introducing the Red-Teaming Resistance Leaderboard
2 months, 1 week ago |
huggingface.co
Introducing the Red-Teaming Resistance Leaderboard
2 months, 1 week ago |
huggingface.co
LEGOBench: Scientific Leaderboard Generation Benchmark
2 months, 1 week ago |
arxiv.org
Don't Overlook China's Open Source LLMs
2 months, 2 weeks ago |
thesequence.substack.com
Meet ‘Smaug-72B’: The new king of open-source AI
2 months, 3 weeks ago |
venturebeat.com
Multi: Multimodal Understanding Leaderboard with Text and Images
2 months, 3 weeks ago |
arxiv.org
OpenAI Launches New Store For Users to Share Custom Chatbots
3 months, 3 weeks ago |
bloomberg.com
Nvidia’s Stock Breakout Puts Amazon Within Sight
3 months, 3 weeks ago |
bloomberg.com
Is it a NEW OpenAI MODEL? (Testing gpt2-chatbot)
1 day, 21 hours ago |
www.youtube.com
Benchmarking LLMs via Uncertainty Quantification
5 days, 14 hours ago |
arxiv.org
Items published with this topic over the last 90 days.
Latest
Is it a NEW OpenAI MODEL? (Testing gpt2-chatbot)
1 day, 21 hours ago |
www.youtube.com
Benchmarking LLMs via Uncertainty Quantification
5 days, 14 hours ago |
arxiv.org
Options for accessing Llama 3 from the terminal using LLM
1 week, 2 days ago |
simonwillison.net
GPT-4 Just Got Supercharged!
1 week, 6 days ago |
www.youtube.com
Command R+ now ranked 6th on the LMSYS Chatbot Arena
3 weeks, 1 day ago |
simonwillison.net
Berkeley Function-Calling Leaderboard
1 month, 2 weeks ago |
simonwillison.net
New LLM Benchmark Leaderboard: WildBench
1 month, 2 weeks ago |
www.youtube.com
[P] Accuracy Ranking of Classifiers on Tabular Data
1 month, 3 weeks ago |
www.reddit.com
OpenAI robots and MWC tech lead ZDNET's Innovation Index
1 month, 3 weeks ago |
www.zdnet.com
Introducing the Red-Teaming Resistance Leaderboard
2 months, 1 week ago |
huggingface.co
Introducing the Red-Teaming Resistance Leaderboard
2 months, 1 week ago |
huggingface.co
LEGOBench: Scientific Leaderboard Generation Benchmark
2 months, 1 week ago |
arxiv.org
Don't Overlook China's Open Source LLMs
2 months, 2 weeks ago |
thesequence.substack.com
Meet ‘Smaug-72B’: The new king of open-source AI
2 months, 3 weeks ago |
venturebeat.com
Multi: Multimodal Understanding Leaderboard with Text and Images
2 months, 3 weeks ago |
arxiv.org
OpenAI Launches New Store For Users to Share Custom Chatbots
3 months, 3 weeks ago |
bloomberg.com
Nvidia’s Stock Breakout Puts Amazon Within Sight
3 months, 3 weeks ago |
bloomberg.com
Topic trend (last 90 days)
Top (last 7 days)
Is it a NEW OpenAI MODEL? (Testing gpt2-chatbot)
1 day, 21 hours ago |
www.youtube.com
Benchmarking LLMs via Uncertainty Quantification
5 days, 14 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Associate Data Engineer
@ Nominet | Oxford/ Hybrid, GB
Data Science Senior Associate
@ JPMorgan Chase & Co. | Bengaluru, Karnataka, India