New Paper Introduces Human-on-the-Bridge for Scalable AI Agent Evaluation · Digg