s
March 27, 2024, 4:58 p.m. |

Simon Willison's Weblog simonwillison.net

“The king is dead”—Claude 3 surpasses GPT-4 on Chatbot Arena for the first time


I'm quoted in this piece by Benj Edwards for Ars Technica:


"For the first time, the best available models—Opus for advanced tasks, Haiku for cost and efficiency—are from a vendor that isn't OpenAI. That's reassuring—we all benefit from a diversity of top vendors in this space. But GPT-4 is over a year old at this point, and it took that year for anyone else to catch …

advanced ai anthropic arena ars technica chatbot claude claude 3 cost efficiency generativeai gpt gpt-4 gpt4 haiku isn king llms openai opus tasks vendor

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York