8h ago

Anti-Self-Distillation Boosts Math Reasoning Speed And Accuracy In RL

0
Original post

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

8:51 AM · May 20, 2026 View on X

paper: https://huggingface.co/papers/2605.11609

AKAK@_akhaliq

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

3:51 PM · May 20, 2026 · 4.3K Views
3:51 PM · May 20, 2026 · 2.6K Views