Nov. 28, 2023, 8:54 p.m. | 1littlecoder


🏆 This leaderboard is based on the following three benchmarks.

Chatbot Arena - a crowdsourced, randomized battle platform. We use 100K+ user votes to compute Elo ratings.
MT-Bench - a set of challenging multi-turn questions. We use GPT-4 to grade the model responses.
MMLU (5-shot) - a test to measure a model's multitask accuracy on 57 tasks.

🔗 Links 🔗

ChatBOT Arena Leaderboard from Lmsys -

Arena Leaderboard Elo Ranking Method -

Play at the Arena - …

arena benchmarks chatbot compute elo gpt gpt-4 kind leaderboard llm mmlu platform questions rankings responses set test

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

Data Analyst


EY GDS Internship Program - Junior Data Visualization Engineer (June - July 2024)

@ EY | Wrocław, DS, PL, 50-086

Staff Data Scientist

@ ServiceTitan | INT Armenia Yerevan

Master thesis on deterministic AI inference on-board Telecom Satellites

@ Airbus | Taufkirchen / Ottobrunn

Lead Data Scientist

@ Picket | Seattle, WA