Stanford NLP's Aryaman Arora argues that backlash against SWE-bench Verified validates the coding benchmark's quality · Digg