Benjamin Anderson says open-source agent evaluation tool Harbor is poorly suited for training non-coding agents · Digg