1d agoCohere's Kris Cao questions when the classic imitation learning algorithm DAgger became known as on-policy distillationBoth interactive training methods share significant functional overlap.SentimentSentimentPos100%Neg0%Users agree the link between DAgger and on-policy distillation is accurate, confirming the researcher's observation in a light-hearted reply.1 comment with sentiment. View comments.