4h ago

DeepSWE coding benchmark ranks GPT-5.5 first at 70% while DeepSeek-v4-pro trails at 8%

Switching the agent framework cuts DeepSeek's gap by 80%

141243149.5K

——0——

Original post

DeepSeek bros have to lock in

@scaling01 Tbh I think just using Pi instead of mini-swe-agent would reduce the gap to Kimi by 80%

Lisan al Gaib@scaling01

DeepSeek bros have to lock in

7:54 PM · May 26, 2026 · 7.9K Views

9:50 PM · May 26, 2026 · 536 Views

sauce: https://deepswe.datacurve.ai/

Lisan al Gaib@scaling01

DeepSeek bros have to lock in

7:54 PM · May 26, 2026 · 7.9K Views

7:54 PM · May 26, 2026 · 1.8K Views

Sentiment