Gemini 3.5 Flash records the highest score of 47.1% Pass@1 on the APEX-Agents-AA benchmark, ahead of GPT-5.5 at 37.7% and Claude Opus 4.6 at 33.0%, according to Artificial Analysis data released May 19, 2026
Separate evaluations show leadership in coding, vision and finance tasks.
——0——



