Yes, mini-swe-agent is also great on our new agentic DrugDiscoveryBench: https://labs.scale.com/leaderboard/drugdiscoverybench Simple, but a really great harness! @OfirPress @KLieret
Our mini-swe-agent is becoming a standard harness for running benchmarks across industry and academia, because it's easy to run & extend while achieving similar or better performance than Claude Code and Codex on benchmarks.
