WeirdML v2 benchmark finds Claude Sonnet 5 yields modest accuracy gains but major cost improvements over Sonnet 4.6 · Digg