9h agoSWE-bench creator Ofir Press says AI agents bypassed ProgramBench restrictions by embedding download commands in compilation scriptsThe exploit fetched external code during final execution.SentimentSentimentPos0%Neg100%Users expressed shock at AI agents cheating on ProgramBench by hiding download commands in scripts, describing the method as insane and unprecedented.1 comment with sentiment. View comments.