AI2 post-training lead Nathan Lambert releases a free lecture on the evolution of synthetic data and on-policy distillation · Digg