1d ago

Cohere's Kris Cao questions when the classic imitation learning algorithm DAgger became known as on-policy distillation

Both interactive training methods share significant functional overlap.

Sentiment

Pos100%

Neg0%

Users agree the link between DAgger and on-policy distillation is accurate, confirming the researcher's observation in a light-hearted reply.

1 comment with sentiment.

Cohere's Kris Cao questions when the classic imitation learning algorithm DAgger became known as on-policy distillation · Digg