5h ago

Tests Show Emotional Intelligence Varies Widely Across Frontier LLMs

0
Original post

We tested 11 frontier LLMs on 200 real human–AI conversations to measure emotional intelligence The result that surprised us: EQ doesn't scale with size or recency. Claude Haiku 4.5 beats Sonnet 4.6. Opus 4.6 performs better than 4.7 It's an orthogonal capability and labs aren't optimizing for it

11:54 AM · May 27, 2026 View on X