Chatbot Arena - if you’ve felt that Claude 3 Opus still holds a slight edge over the new GPT-4 Turbo, we now understand why | allainews.com

April 15, 2024, 11:06 p.m. | /u/ok373737

Artificial Intelligence www.reddit.com

If we exclude the refusals (e.g., "I cannot answer") ,and only tally votes for actual responses, Claude 3 Opus continues to be marginally superior to the new GPT-4 Turbo.

Yes, you might think it’s pure bias on my part, but if you’re looking to compare the chatbots based on the quality of their responses when they do provide an answer, then excluding refusals might be a reasonable approach. This could give you a clearer picture of how well each chatbot …

arena artificial bias chatbot chatbot arena claude claude 3 claude 3 opus edge felt gpt gpt-4 opus responses think turbo

More from www.reddit.com / Artificial Intelligence

How AI is Used in Sports 2 hours ago | www.reddit.com

artificial sports

Why is "thispersondoesnotexist" capable of making hyper-realistic pictures of faces while AI Tools like Midjourney … 9 hours ago | www.reddit.com

ai tools artificial dall dall-e +5

Aravind Srinivas: The fact that this change is even being discussed widely is a big … 17 hours ago | www.reddit.com

aravind aravind srinivas artificial become +6

Sam Altman - "No Fixed Timeline for GPT5" 1 day, 8 hours ago | www.reddit.com

altman artificial call gpt5 +4

Google blasted for AI that refuses to say how many Jews were killed by the … 1 day, 8 hours ago | www.reddit.com

ai assistant artificial assistant commitment +8

Disney Style Song. Very Beautiful 1 day, 22 hours ago | www.reddit.com

artificial disney song style

AI already uses as much energy as a small country. It's only the beginning 2 days, 9 hours ago | www.reddit.com

age agency ai impact artificial +19

Biological neurons use multidirectional propagation - could/should we recreate it in artificial neurons? Doable e.g. … 2 days, 12 hours ago | www.reddit.com

artificial distribution kan modelling +3

i combined AI image expanders and human work to make a 4:3 show widescreen (episode … 2 days, 17 hours ago | www.reddit.com

ai image artificial human image +4

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net