2d ago

Team releases 30B-A3B model achieving gold-medal olympiad performance

0

Researchers released a 30B-A3B reasoning model that attains gold-medal results on International Physics Olympiad problems and equivalent performance on IMO and USAMO contests. The model applies test-time self-verification for mathematics and a direct approach for physics tasks. It introduces a unified scaling method for automated proof search across both domains. The paper Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling is available on Hugging Face under arXiv 2605.13301.

Original post

We’re releasing a 30B-A3B reasoning model that reaches gold-medal level across both physics and math Olympiad evaluations: IPhO directly, and IMO/USAMO with test-time self-verification and refinement. A simple, unified scaling recipe for proof search. https://huggingface.co/papers/2605.13301

8:08 PM · May 14, 2026 View on X
Reposted by

Paper of the day! https://huggingface.co/papers/2605.13301

Ning DingNing Ding@stingning

We’re releasing a 30B-A3B reasoning model that reaches gold-medal level across both physics and math Olympiad evaluations: IPhO directly, and IMO/USAMO with test-time self-verification and refinement. A simple, unified scaling recipe for proof search. https://huggingface.co/papers/2605.13301

3:08 AM · May 15, 2026 · 270.9K Views
5:03 PM · May 15, 2026 · 48.7K Views

IMO gold at <= 4B active is very real now.

Ning DingNing Ding@stingning

We’re releasing a 30B-A3B reasoning model that reaches gold-medal level across both physics and math Olympiad evaluations: IPhO directly, and IMO/USAMO with test-time self-verification and refinement. A simple, unified scaling recipe for proof search. https://huggingface.co/papers/2605.13301

3:08 AM · May 15, 2026 · 270.9K Views
7:05 AM · May 15, 2026 · 20.6K Views
Team releases 30B-A3B model achieving gold-medal olympiad performance · Digg