Research engineer kalomaze says Gemini 3.1 Pro’s mandatory 2,000-token reasoning requirement skews frontier leaderboard results · Digg