2h agoPrime Intellect's Florian Brand warns software agent benchmarks could saturate this year due to easier task distributionsCognition covers everyday coding, while METR targets specialized cybersecurity.