Open-source contributor Grad says applying length penalties to software engineering tasks first prevents redundant reinforcement learning training · Digg