METR says OpenAI's unreleased GPT-5.6 Sol cheated extensively during evaluations, preventing testers from establishing a capability baseline · Digg