April 15, 2024, 11:06 p.m. | /u/ok373737

Artificial Intelligence www.reddit.com

If we exclude the refusals (e.g., "I cannot answer") ,and only tally votes for actual responses, Claude 3 Opus continues to be marginally superior to the new GPT-4 Turbo.

Yes, you might think it’s pure bias on my part, but if you’re looking to compare the chatbots based on the quality of their responses when they do provide an answer, then excluding refusals might be a reasonable approach. This could give you a clearer picture of how well each chatbot …

arena artificial bias chatbot chatbot arena claude claude 3 claude 3 opus edge felt gpt gpt-4 opus responses think turbo

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York