11h ago

Swarat Chaudhuri releases AlphaProof Nexus, an LLM framework that solved nine open Erdős problems using Lean compiler feedback

The agent also solved 44 OEIS mathematical problems.

0
Original post

AI agents are advancing research-level math. 🚀 I’m thrilled to share @GoogleDeepMind’s AlphaProof Nexus - an agentic framework for formal proof search powered by Gemini. When applied to a set of open formal math problems, our agent autonomously solved: ✅ 9 open Erdős problems (including two open for 56 years!) ✅ 44 Online Encyclopedia of Integer Sequences (OEIS) problems ✅ A 15-year-old open problem in algebraic geometry ✅ A 7-year-old open question in min-max optimization We are collaborating with mathematicians across disciplines - from combinatorics and graph theory to quantum optics. Ultimately, these results show the massive potential of even simple agentic loops powered by Gemini. Read the paper here: https://arxiv.org/abs/2605.22763v1

8:39 AM · May 25, 2026 View on X
Reposted by

To my knowledge, this is the first large-scale empirical study of formal theorem proving by LLM agents. This work provides a key benchmark moving forward. Congrats to Swarat and team!

Swarat ChaudhuriSwarat Chaudhuri@swarat

Delighted to finally unveil these results! 🎉 Many congratulations to the team, who worked tirelessly for almost a year to build and evaluate AlphaProof Nexus. We revised many priors during this project — most notably, we discovered that with current frontier models, simple agent loops with compiler feedback can rival more sophisticated systems. We were struck both by the capabilities of our systems and the magnitude of the challenges ahead. I have never been as excited about the potential of formal math to enhance human creativity and bring rigor to AI. Onward! 🚀

4:56 PM · May 25, 2026 · 15.9K Views
7:06 PM · May 25, 2026 · 8.7K Views

See the formal proofs (in lean) discovered by the AlphaProof Nexus agent: https://github.com/google-deepmind/alphaproof-nexus-results

Pushmeet KohliPushmeet Kohli@pushmeet

AI agents are advancing research-level math. 🚀 I’m thrilled to share @GoogleDeepMind’s AlphaProof Nexus - an agentic framework for formal proof search powered by Gemini. When applied to a set of open formal math problems, our agent autonomously solved: ✅ 9 open Erdős problems (including two open for 56 years!) ✅ 44 Online Encyclopedia of Integer Sequences (OEIS) problems ✅ A 15-year-old open problem in algebraic geometry ✅ A 7-year-old open question in min-max optimization We are collaborating with mathematicians across disciplines - from combinatorics and graph theory to quantum optics. Ultimately, these results show the massive potential of even simple agentic loops powered by Gemini. Read the paper here: https://arxiv.org/abs/2605.22763v1

3:39 PM · May 25, 2026 · 91.2K Views
3:41 PM · May 25, 2026 · 3.1K Views
Swarat Chaudhuri releases AlphaProof Nexus, an LLM framework that solved nine open Erdős problems using Lean compiler feedback · Digg