Self-distillation does not work for thinking models YET
https://arxiv.org/abs/2603.24472 https://openreview.net/forum?id=VhCJItwQHn https://arxiv.org/abs/2606.11709
ML Twitter: What's your favorite on-policy (self)-distillation paper / blogs from this year? Sharing your own work is totally fine!
If who want to learn more about LLM distillation, you can watch: https://youtu.be/O1AR4iL30mg?si=Zznk_BYCnjCAmhAz





