Lun Wang leaves Google DeepMind and argues in a new blog post that static benchmarks will lose relevance for self-evolving models entering new capability regimes · Digg