1d agoReasoning As Search Integrates Value Model For CE Loss Gains——0——Original postDP#197@DIMITRISPAPAILOPHDHiranmay Darshane|@HDARSHANEif reasoning is a search process, the value model is within the model (as god intended) and thus completely amenable to improvements from CE loss on the right data8:26 AM · May 18, 2026 View on X