March 8, 2024, 10:01 a.m. | Simon Y. Blackwell

Hacker Noon - ai hackernoon.com

This article presents benchmark results for assessing the empathetic capabilities of generative AI models using psychological and purpose-built measures. The tests include TAS-20, EQ-60, SQ-R, and IRI. The measure AEQ (Applied Empathy Quotient) was introduced. Most raw LLMs struggle to connect empathetically with users due to their balanced empathetic and systemized thinking capabilities. The closed model Willow demonstrates the highest empathetic capacity, while ChatGPT does not stand out significantly among other LLMs. Claude v3 Opus showed a decline in empathetic …

ai ai development ai models ai research article benchmark benchmarks capabilities empathy future-of-ai generative generative-ai generative ai models llms raw results struggle testing tests

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne