Alex Polozov argues building RL environments for SWE agents is manual software engineering yielding only 2 to 10 tasks weekly · Digg