Mythos/Fable destroys everything else in everything all at once, with large gaps. Except this "everything" is 20 different evals of agentic coding. On CritPt, they are merely equal with 5.5 (and below 5.5-Pro.) Similar in vision. Gigantic next gen model, made for ONE thing.
Generally I feel more sympathy for OpenAI lately. Here they are, trying to RLVR towards actual scientific AGI that'll solve problems directly, as on CritPT. And their great safety-conscious competition: code, code, B2B, B2B, attack, exploit, chyna hawkery, RSI. Not equal.



