PrimeIntellect releases prime-rl v0.6.0, enabling reinforcement learning for trillion-parameter MoE models with step times under five minutes · Digg