Teortaxes and Super Dario say GLM 5.2 used distillation from Claude and GPT 5.5 to seed agentic coding RL trajectories · Digg