Google DeepMind's Philipp Schmid sparks debate on whether to optimize AI models for evaluation harnesses or vice versa · Digg