5h ago

Tests Show Emotional Intelligence Varies Widely Across Frontier LLMs

6656294.5K

——0——

Original post

We tested 11 frontier LLMs on 200 real human–AI conversations to measure emotional intelligence The result that surprised us: EQ doesn't scale with size or recency. Claude Haiku 4.5 beats Sonnet 4.6. Opus 4.6 performs better than 4.7 It's an orthogonal capability and labs aren't optimizing for it

11:54 AM · May 27, 2026

Tests Show Emotional Intelligence Varies Widely Across Frontier LLMs

Sentiment

Cluster engagement