AI engineer Yupo Niu argues RLVR-trained models derive their generalization from diverse training environments rather than inherent capabilities · Digg