8h ago

Anti-Self-Distillation Boosts Math Reasoning Speed And Accuracy In RL

2304226.9K

——0——

Original post

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

paper: https://huggingface.co/papers/2605.11609

AK@_akhaliq

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

3:51 PM · May 20, 2026 · 4.3K Views

3:51 PM · May 20, 2026 · 2.6K Views

Sentiment