Stanford researchers release SPIRAL, a reinforcement learning framework that trains LLMs to coordinate parallel and aggregative inference compute · Digg