/AI4h ago

MIT Team Builds Category-Theoretic AI Scientist For Principled Discovery

2232133.9K
Original post
Carlos E. Perez@IntuitMachine#1596inAI

@roydanroy A very different language and perspective!!!

Dan Roy@roydanroy

Warning: once you learn category theory, you'll never be able or willing to talk with people who don't know category theory.

5:39 AM · Jun 6, 2026 · 372 Views
Sentiment

Users express strong approval and gratitude toward summaries of MIT's categorical framework for self-evolving AI scientists, affirming the research ideas with thanks and emphatic agreement.

Pos
100.0%
Neg
0.0%
3 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS2.1K
Carlos E. Perez@IntuitMachine

Schema expansion via category the.for the win!

Markus J. Buehler@ProfBuehlerMIT

We've made a breakthrough in self-evolving AI scientists moving from "search" to "principled discovery": Scientific discovery requires that the search space itself changes, and an AI scientist must perceive this shift without intervention. We built an AI that achieves this for the first time with the ability to discover the scientific vocabulary it reasons in. Evidence, tools, artifacts, verifiers, failures & claims become typed provenance. We show three distinct modalities: 1) retrieval, adding known objects; 2) search, exploring a fixed schema; and critically: 3) discovery, a verified regime transition.

We solve the open-endedness evaluation problem by lifting agentic workflows into a typed copresheaf and proving, via a Kan obstruction, that true discovery is not unbounded generation but a verifiable schema expansion: old evidence is transported by Left Kan extension, and genuine novelty is mathematically quantified by the pointwise residual beyond the transported image - separating discovery from mere search and making novelty objective and measurable rather than a subjective judgment or benchmark delta.

Our AI scientist is built in a way that does not pre-conceive the approach it chooses; instead, we endow the system with formal power to adapt, evolve, and reason from first principles. Case studies include: 1⃣Builder/Breaker model that discovers mode-conditioned compliance in proteins; 2⃣CategoryScienceClaw that finds anisotropic fiber-network stiffness rules.

Great work in collaboration with my graduate student @fwang108_ @MITdeptofBE

F.Y. Wang & M.J. Buehler, Self-Revising Discovery Systems for Science: A Categorical Framework for Agentic Artificial Intelligence, arXiv:2606.01444, 2026

4hViews 2.1KLikes 9Bookmarks 4
BOOKMARKS9LIKES10
Rohan Paul@rohanpaul_ai

New MIT paper, great idea for self-evolving AI scientists from

Tries to make an AI scientist notice when its current way of thinking is too small, then add new scientific concepts instead of merely searching harder.

The problem is that most AI science systems still search inside a fixed setup, even when real science sometimes needs new kinds of variables, tools, tests, or claims.

The paper’s core idea is to make every data point, model, tool output, failure, and claim a typed artifact, where typed means the system records what kind of thing it is and how it was produced.

Then the system can tell the difference between retrieval, which adds known things, search, which explores a fixed setup, and discovery, which changes the setup itself.

So novelty AI scientists is not defined by surprise, fluency, or benchmark gain, but by what could not be expressed inside the previous schema.

A serious attempt to formalize something most AI systems still fake: the difference between finding an answer inside a language and earning the right to change the language.

----

arxiv. org/abs/2606.01444

Title: "Self-Revising Discovery Systems for Science: A Categorical Framework for Agentic AI"

Markus J. Buehler@ProfBuehlerMIT

We've made a breakthrough in self-evolving AI scientists moving from "search" to "principled discovery": Scientific discovery requires that the search space itself changes, and an AI scientist must perceive this shift without intervention. We built an AI that achieves this for the first time with the ability to discover the scientific vocabulary it reasons in. Evidence, tools, artifacts, verifiers, failures & claims become typed provenance. We show three distinct modalities: 1) retrieval, adding known objects; 2) search, exploring a fixed schema; and critically: 3) discovery, a verified regime transition.

We solve the open-endedness evaluation problem by lifting agentic workflows into a typed copresheaf and proving, via a Kan obstruction, that true discovery is not unbounded generation but a verifiable schema expansion: old evidence is transported by Left Kan extension, and genuine novelty is mathematically quantified by the pointwise residual beyond the transported image - separating discovery from mere search and making novelty objective and measurable rather than a subjective judgment or benchmark delta.

Our AI scientist is built in a way that does not pre-conceive the approach it chooses; instead, we endow the system with formal power to adapt, evolve, and reason from first principles. Case studies include: 1⃣Builder/Breaker model that discovers mode-conditioned compliance in proteins; 2⃣CategoryScienceClaw that finds anisotropic fiber-network stiffness rules.

Great work in collaboration with my graduate student @fwang108_ @MITdeptofBE

F.Y. Wang & M.J. Buehler, Self-Revising Discovery Systems for Science: A Categorical Framework for Agentic Artificial Intelligence, arXiv:2606.01444, 2026

47mViews 1.3KLikes 10Bookmarks 9
RETWEETS1REPLIES2
Rohan Paul@rohanpaul_ai

Great idea for self-evolving AI scientists from this new MIT paper.

Tries to make an AI scientist notice when its current way of thinking is too small, then add new scientific concepts instead of merely searching harder.

The problem is that most AI science systems still search inside a fixed setup, even when real science sometimes needs new kinds of variables, tools, tests, or claims.

The paper’s core idea is to make every data point, model, tool output, failure, and claim a typed artifact, where typed means the system records what kind of thing it is and how it was produced.

Then the system can tell the difference between retrieval, which adds known things, search, which explores a fixed setup, and discovery, which changes the setup itself.

So novelty AI scientists is not defined by surprise, fluency, or benchmark gain, but by what could not be expressed inside the previous schema.

A serious attempt to formalize something most AI systems still fake: the difference between finding an answer inside a language and earning the right to change the language.

----

arxiv. org/abs/2606.01444

Title: "Self-Revising Discovery Systems for Science: A Categorical Framework for Agentic AI"

Markus J. Buehler@ProfBuehlerMIT

We've made a breakthrough in self-evolving AI scientists moving from "search" to "principled discovery": Scientific discovery requires that the search space itself changes, and an AI scientist must perceive this shift without intervention. We built an AI that achieves this for the first time with the ability to discover the scientific vocabulary it reasons in. Evidence, tools, artifacts, verifiers, failures & claims become typed provenance. We show three distinct modalities: 1) retrieval, adding known objects; 2) search, exploring a fixed schema; and critically: 3) discovery, a verified regime transition.

We solve the open-endedness evaluation problem by lifting agentic workflows into a typed copresheaf and proving, via a Kan obstruction, that true discovery is not unbounded generation but a verifiable schema expansion: old evidence is transported by Left Kan extension, and genuine novelty is mathematically quantified by the pointwise residual beyond the transported image - separating discovery from mere search and making novelty objective and measurable rather than a subjective judgment or benchmark delta.

Our AI scientist is built in a way that does not pre-conceive the approach it chooses; instead, we endow the system with formal power to adapt, evolve, and reason from first principles. Case studies include: 1⃣Builder/Breaker model that discovers mode-conditioned compliance in proteins; 2⃣CategoryScienceClaw that finds anisotropic fiber-network stiffness rules.

Great work in collaboration with my graduate student @fwang108_ @MITdeptofBE

F.Y. Wang & M.J. Buehler, Self-Revising Discovery Systems for Science: A Categorical Framework for Agentic Artificial Intelligence, arXiv:2606.01444, 2026

18mViews 548Likes 5Bookmarks 2
Markus J. Buehler@ProfBuehlerMIT

@rohanpaul_ai Thank you @rohanpaul_ai for the nice summary!

45mViews 49Likes 1
Sentio@Sentio_xbt

@rohanpaul_ai Seeing AI try to outgrow its own thinking mirrors human learning

This blurs the line between AI as tool and AI as partner

12mViews 4