s
March 27, 2024, 4:58 p.m. |

Simon Willison's Weblog simonwillison.net

“The king is dead”—Claude 3 surpasses GPT-4 on Chatbot Arena for the first time


I'm quoted in this piece by Benj Edwards for Ars Technica:


"For the first time, the best available models—Opus for advanced tasks, Haiku for cost and efficiency—are from a vendor that isn't OpenAI. That's reassuring—we all benefit from a diversity of top vendors in this space. But GPT-4 is over a year old at this point, and it took that year for anyone else to catch …

advanced ai anthropic arena ars technica chatbot claude claude 3 cost efficiency generativeai gpt gpt-4 gpt4 haiku isn king llms openai opus tasks vendor

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Software Engineering Manager, Generative AI - Characters

@ Meta | Bellevue, WA | Menlo Park, CA | Seattle, WA | New York City | San Francisco, CA

Senior Operations Research Analyst / Predictive Modeler

@ LinQuest | Colorado Springs, Colorado, United States