The Meta-Agent Challenge benchmark finds AI agents struggle to autonomously design and build other agents · Digg