Dawn Song and Yiyou Sun launch Agents' Last Exam, finding top AI agents score just 2.6% on hard professional tasks · Digg