Qwen 3.7-Max Nears Opus With 60.6 on SWE-Bench Pro and Strong NL2Repo Score
——0——
as an aside, > PNG 17277 × 10523 > 6,2 MB on disk why does Qwen insist on doing this?
Composite table for the four benchmarks where Qwen has shown both 3.6-Max (Preview) and 3.7-Max. The progress is not exactly dramatic, but it is significant for 1 month. …Except NL2Repo. Is this real? They claim to have matched Opus in the one thing Opus is hyped for.
6:29 AM · May 20, 2026 · 3.1K Views
6:49 AM · May 20, 2026 · 694 Views

