DeepSeek V4 Scores Rise On Dim-Agent Benchmark Despite Russian Test Failures · Digg