New Benchmark Shows AI Agents Struggle to Learn from Experience · Digg