Poolside releases technical report on training Laguna M1 and XS2 coding models and mitigating expert collapse
The document details scaling laws and synthetic data composition.
Users praised Poolside's Laguna M1 XS2 Technical Report as really really good.
No Digg Deeper questions have been answered for this story yet.
Most Activity
it's really really good
https://poolside.ai/assets/laguna/laguna-m1-xs2-technical-report.pdf
@yacineMTB very good, some of my fav part
wow, amazing tech report. lots of details on every part of the pipeline, especially on data. love that they share the system design of how they train models and do research with their "model factory", and also the negative results from M1 and how they fixed them in XS.2
one of the best tech reports to get up to speed on model training
Today we’re publishing the technical report behind Laguna M.1 and Laguna XS.2.
This report opens up more of what went into them: Model Factory, pre-training data, distributed training, post-training, agent RL, quantization, and evaluation.
https://poolside.ai/assets/laguna/laguna-m1-xs2-technical-report.pdf