7h agoGemini-3.5-Flash regains the top spot on the Toolathlon leaderboard after five months with a 56.5 percent Pass@1 score on 108 agent tasks— Gemini variants also hit 67.42 percent on Terminal-Bench 2.0 physics tasks.——0——Original postLA#980@SCALING01OPJLJunlong Li|@LOCKONLVANGEGemini returns and ranks No.1 on Toolathlon again after 5 months. Great achievements and congratulations! @GoogleDeepMind10:54 AM · May 19, 2026 View on X