> During the process of benchmarking dim-agent, we discovered that DSv4's scores kept improving. Ah. This is the February-April playbook, when DeepSeek-Web (now known to be V4-Flash) kept getting better at long context. I guess they're deploying checkpoints after OPD rounds.
揭晓一下答案。 我们在做dim-agent的benchmark的过程中,发现DSv4的成绩一直在升级。 The whales are cooking!





