Based on the bouba shape, my guess would be hard synth/rl env scaling with recursive generative design+eval.
Has anyone done any speculation on the training recipe of GLM 5.2? Beyond extensive RL, we know it's (at least?) a new midtrain ("GLM-5.2 is trained with IndexShare from mid-training with 128K sequence length") with arch changes.





