CrystalReasoner Uses LLMs And RL To Generate Precise Crystal Structures

REPLY

Turns out constructing the right thinking traces was key. We used robocrystallographer to construct thinking traces containing information about crystallographic symmetry, local coordination, functional properties, and thermodynamic status, before generating the final CIF file.

Sherry Yang@sherryyangML

In CrystalReasoner, we ask the simple question: can we enable structure generation with LLMs without loosing the precision and 3D knowledge (something LLMs tend to struggle with), while utilizing the reasoning ability of LLMs?

5:28 PM · May 22, 2026 · 235 Views

8:20 PM · May 22, 2026 · 33 Views

REPLY

#776Sherry Yang@SHERRYYANGML

We found the combination of thinking tokens and RL with verifiable reward (e.g., validity) achieved the best performance in generating stable, unique, and novel structures, significantly outperforming previous CrystalTextLLM baselines.

Sherry Yang@sherryyangML

Turns out constructing the right thinking traces was key. We used robocrystallographer to construct thinking traces containing information about crystallographic symmetry, local coordination, functional properties, and thermodynamic status, before generating the final CIF file.

8:20 PM · May 22, 2026 · 33 Views

8:24 PM · May 22, 2026 · 27 Views

REPLY

#776Sherry Yang@SHERRYYANGML

As one might expect, we observe that more atoms require longer thinking traces, indicating that CrystalReasoner can perform adaptive reasoning according to the complexity of the generation task.

Sherry Yang@sherryyangML

We found the combination of thinking tokens and RL with verifiable reward (e.g., validity) achieved the best performance in generating stable, unique, and novel structures, significantly outperforming previous CrystalTextLLM baselines.

8:24 PM · May 22, 2026 · 27 Views

8:28 PM · May 22, 2026 · 22 Views

REPLY

#776Sherry Yang@SHERRYYANGML

To enable property-conditioned generation that is general enough to work for any properties (e.g., elastic properties, thermal expansions), we can design a general reward function by assigning positive reward to structures with properties falling in the specified range.

Sherry Yang@sherryyangML

As one might expect, we observe that more atoms require longer thinking traces, indicating that CrystalReasoner can perform adaptive reasoning according to the complexity of the generation task.

8:28 PM · May 22, 2026 · 22 Views

8:40 PM · May 22, 2026 · 42 Views

REPLY

#776Sherry Yang@SHERRYYANGML

Great work led by @yy_wu36197 in collaboration with @FallettaStefano and Delia McGrath.

Sherry Yang@sherryyangML

To enable property-conditioned generation that is general enough to work for any properties (e.g., elastic properties, thermal expansions), we can design a general reward function by assigning positive reward to structures with properties falling in the specified range.

8:40 PM · May 22, 2026 · 42 Views

8:42 PM · May 22, 2026 · 34 Views