Researchers Launch OSWorld 2.0 Benchmark for Long-Horizon Computer-Use Agents · Digg