1d ago

Reasoning As Search Integrates Value Model For CE Loss Gains

——0——
Original post
Dimitris PapailiopoulosDP#197@DIMITRISPAPAILOPHiranmay DarshaneHDHiranmay Darshane|@HDARSHANE

if reasoning is a search process, the value model is within the model (as god intended) and thus completely amenable to improvements from CE loss on the right data

8:26 AM · May 18, 2026 View on X
112222.0K

Cluster engagement

87 snapshots