Even @OpenAI's recent Erdős breakthrough didn't convince me that LLMs can do general math research. This changed my mind..
Using a clever 'prover-verifier' LLM loop, this harness solved 9 substantial open problems in Theoretical CS, including one that kept me up at night for 2 years.
Incredible work by my former Columbia collaborator @binghuip, @runzhou_tao, Steven Wang & @HantaoYu_Theory.
The plan is to expand this to ALL fields of science. Stay tuned.
[1/n] Recent OpenAI research has demonstrated the ability of LLMs to solve frontier problems in mathematics. We design a simple pipeline (using GPT 5.5 Pro and Claude Opus 4.8) that resolves 9 challenging open problems, including open problems from prominent theoretical computer science venues—4 from COLT open problem list and 1 from FOCS —as well as 4 problems from the commutative algebra.
Project link: https://github.com/Pengbinghui/pipeline-math, joint work with @runzhou_tao, Steven Wang & @HantaoYu_Theory

















