Claude Mythos Dominates Coding Benchmarks But Ties GPT On Physics Research Eval · Digg