New Blog Explains Token-In-Token-Out For On-Policy Agentic RL · Digg