On-Policy Distillation Joins PapersWithCode With 183 Citing Papers
——0——
Sentiment
Pos100%
Neg0%
Users are excited about on-policy distillation as an AI post-training technique because recent papers, repos, and method descriptions look promising and well-organized.