s
March 27, 2024, 4:58 p.m. |

Simon Willison's Weblog simonwillison.net

“The king is dead”—Claude 3 surpasses GPT-4 on Chatbot Arena for the first time


I'm quoted in this piece by Benj Edwards for Ars Technica:


"For the first time, the best available models—Opus for advanced tasks, Haiku for cost and efficiency—are from a vendor that isn't OpenAI. That's reassuring—we all benefit from a diversity of top vendors in this space. But GPT-4 is over a year old at this point, and it took that year for anyone else to catch …

advanced ai anthropic arena ars technica chatbot claude claude 3 cost efficiency generativeai gpt gpt-4 gpt4 haiku isn king llms openai opus tasks vendor

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Snowflake Analytics Engineer - Technology Sector

@ Winning | Lisbon, Lisbon

Business Data Analyst

@ RideCo | Waterloo, Ontario, Canada

Senior Data Scientist, Payment Risk

@ Block | Boston, MA, United States

Research Scientist, Data Fusion (Climate TRACE)

@ WattTime | Remote

Technical Analyst (Data Analytics)

@ Contact Government Services | Fayetteville, AR