Arvind Narayanan and Peter Norvig argue that benchmark-driven AI evaluation fails to account for future developmental trajectories · Digg