DeepSWE coding benchmark ranks GPT-5.5 first at 70% while DeepSeek-v4-pro trails at 8%
Switching the agent framework cuts DeepSeek's gap by 80%
——0——
@scaling01 Tbh I think just using Pi instead of mini-swe-agent would reduce the gap to Kimi by 80%
DeepSeek bros have to lock in
7:54 PM · May 26, 2026 · 7.9K Views
9:50 PM · May 26, 2026 · 536 Views
sauce: https://deepswe.datacurve.ai/
DeepSeek bros have to lock in
7:54 PM · May 26, 2026 · 7.9K Views
7:54 PM · May 26, 2026 · 1.8K Views