AI Security Institute says agent benchmarks must report performance curves over compute rather than static scores · Digg