Microsoft AI Frontiers researchers develop ECHO, a training method that adds environment prediction loss to GRPO so CLI agents build internal world models of terminal environments during reinforcement learning · Digg