MIT CSAIL's Alex Zhang open-sources a sandbox-free recursive language model training harness built on prime-rl · Digg