April 15, 2024, 11:06 p.m. | /u/ok373737

Artificial Intelligence www.reddit.com

If we exclude the refusals (e.g., "I cannot answer") ,and only tally votes for actual responses, Claude 3 Opus continues to be marginally superior to the new GPT-4 Turbo.

Yes, you might think it’s pure bias on my part, but if you’re looking to compare the chatbots based on the quality of their responses when they do provide an answer, then excluding refusals might be a reasonable approach. This could give you a clearer picture of how well each chatbot …

arena artificial bias chatbot chatbot arena claude claude 3 claude 3 opus edge felt gpt gpt-4 opus responses think turbo

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analyst (Digital Business Analyst)

@ Activate Interactive Pte Ltd | Singapore, Central Singapore, Singapore