1h ago

Pareto AI Releases AttuneBench, First Open EQ Benchmark From Real Conversations

75411273.3K

——0——

Original post

1/ Today we're releasing AttuneBench, the first open EQ benchmark grounded in real multi-turn human-model conversations, scored against what the person actually felt and wanted at each turn. Built by the research team at @pareto_ai in collaboration with @thoughtfullab. Most existing EQ benchmarks rely on: - synthetic prompts - single-turn interactions - third-party annotation None directly measure how a model reads and responds to a real person across a full conversation. We evaluated 11 leading models from major providers, across 200 conversations and 50,000+ first-person annotations.

11:51 AM · May 27, 2026

Reposted by

#207@KARINANGUYEN