RL Training Produces Correct But Unclear Language Models · Digg