MiniMax-M2 research paper details a 229.9B-parameter architecture activating 9.8B parameters during inference
It details post-training systems including Forge RL and CISPO.
Very good practice to release a finished paper for a model *series*, not just for the foundation. Post-training is where most engineering effort happens today.
Recently, we took time to consolidate all of the work behind M2 and published it here: our M2 paper on arXiv It’s been just over six months since we first open-sourced M2 on December 23 last year. During that time, a number of our ideas and systems have been broadly adopted by the open-source community — including CISPO, Forge RL System, Self-Evolution. Over the past six months, we’ve felt incredible enthusiasm from the open-source community. Nearly every model release reached the #1 spot on the Hugging Face leaderboard. Now it’s time for a new chapter. We’re getting ready for M3. MSA paper is on the road. https://arxiv.org/abs/2605.26494