Cursor and xAI begin training a significantly larger AI model from scratch using 10 times more compute than prior runs on Colossus 2's one-million GPU cluster
xAI is also training 6-trillion and 10-trillion parameter Grok models.
Elon also has a 6T and 10T version of Grok in training now. The step-change that emerged when Anthropic had enough compute to train Mythos in March will start appearing from dozens of other places this year.

Together with SpaceXAI, we’re training a significantly larger model from scratch, using 10x more total compute. With Colossus 2’s million H100-equivalents and our combined data and training techniques, we expect this to be a major leap in model capability.
Big model coming soon ... !
…and more to come
Together with SpaceXAI, we’re training a significantly larger model from scratch, using 10x more total compute. With Colossus 2’s million H100-equivalents and our combined data and training techniques, we expect this to be a major leap in model capability.
Big model coming soon ... !
Together with SpaceXAI, we’re training a significantly larger model from scratch, using 10x more total compute. With Colossus 2’s million H100-equivalents and our combined data and training techniques, we expect this to be a major leap in model capability.