Speaking of, I think Vals underrates Qwen 3.7 Max. It's one of the strongest Chinese models overall, but pulled down by ridiculously low Vibe Code Bench v1.1. Like, it's below its lesser open source siblings. 3.7 *Plus* gets 46.4 there. What's up?
For those looking into open weight models in light of recent news … we’ve just evaluated Kimi K2.7 Code on the Vals coding benchmarks




