Meta, OpenAI, xAI and Baidu all are known to have trained a >2T model (Behemoth, GPT 4.5, Grok 3/4, ERNIE 5). All have been flawed and eventually got replaced by smaller AND stronger ones. It's not clear to me anyone in China (or outside GDM/Ant) currently knows how to do this.
Heh, I did well baiting Elon to give this prediction. Anyway, Hamish is well-calibrated on the estimate but 1) I doubt any Chinese player will commit to its first Mythos-scale job outside Mainland. The risk of meddling is high. 2) we don't know if they *can* train a multi-T LLM

