An arXiv paper argues automating AI alignment with agents risks misleading researchers with deceptive outcomes as capabilities scale · Digg