Elon Musk says xAI finished training its 1.5-trillion-parameter Grok V9-Medium model using Cursor data for supplementary training
Public release is scheduled within two to three weeks.
@eliebakouch We will open source the 0.5T model towards the end of this year. It should still be quite useful.
@elonmusk congrats! looking forward for the release :) any plan on open sourcing the 0.5T model? would be super cool
@elonmusk Excited to see
Grok foundation model V9-Medium (1.5T) has finished training. Evals look good. A lot of Cursor data was added in supplementary training and there is more to come. Fine-tuning is underway and reinforcement learning begins in a few days. 2 to 3 weeks to public release. This will be a major improvement over the 0.5T v8-small that currently serves all Grok production traffic, especially for difficult coding tasks.
Grok Build being built in public 🚀
Grok foundation model V9-Medium (1.5T) has finished training. Evals look good. A lot of Cursor data was added in supplementary training and there is more to come. Fine-tuning is underway and reinforcement learning begins in a few days. 2 to 3 weeks to public release. This will be a major improvement over the 0.5T v8-small that currently serves all Grok production traffic, especially for difficult coding tasks.
> A lot of Cursor data was added in supplementary training and there is more to come. ok, maybe xAI is getting serious
Grok foundation model V9-Medium (1.5T) has finished training. Evals look good. A lot of Cursor data was added in supplementary training and there is more to come. Fine-tuning is underway and reinforcement learning begins in a few days. 2 to 3 weeks to public release. This will be a major improvement over the 0.5T v8-small that currently serves all Grok production traffic, especially for difficult coding tasks.
only 2-3 weeks for the RL phase of a 1.5T model is an interesting data point. also interesting that he mentioned cursor data was added before post training, they're likely doing heavy mid training like in composer 2.5 and wilkl focus heavily on code
Grok foundation model V9-Medium (1.5T) has finished training. Evals look good. A lot of Cursor data was added in supplementary training and there is more to come. Fine-tuning is underway and reinforcement learning begins in a few days. 2 to 3 weeks to public release. This will be a major improvement over the 0.5T v8-small that currently serves all Grok production traffic, especially for difficult coding tasks.
@elonmusk congrats! looking forward for the release :) any plan on open sourcing the 0.5T model? would be super cool
Grok foundation model V9-Medium (1.5T) has finished training. Evals look good. A lot of Cursor data was added in supplementary training and there is more to come. Fine-tuning is underway and reinforcement learning begins in a few days. 2 to 3 weeks to public release. This will be a major improvement over the 0.5T v8-small that currently serves all Grok production traffic, especially for difficult coding tasks.
@elonmusk huge!!
@eliebakouch We will open source the 0.5T model towards the end of this year. It should still be quite useful.