Pass@k and self-consistency work great for math and code; sample more and verify. So we asked: can the same trick scale truthfulness in domains with no verifier? The answer was no.
Excited to share our #ICML2026 conference paper: Truthfulness Does Not Scale Like Reasoning. https://arxiv.org/pdf/2603.06612. I’ll be at ICML in Seoul to present it!
Co-led by @JoshuaK92829 and @yegordb
