Terminal-Bench Science extends the original Terminal-Bench benchmark used by Anthropic, OpenAI, and Google DeepMind into scientific domains and opens for over 100 task contributions by August 17, 2026 · Digg