Toto 2.0 is here: Datadog AI's 5 open-weights forecasting models (4m-2.5B params) finally make scaling work for time series forecasting! #1 on BOOM, GIFT-Eval, and TIME. Weights/code Apache 2.0. 🔗 Read the blog post for more details: https://bit.ly/4tCFvKL
Datadog releases Toto 2.0 time series foundation models
AI Judge changed title after evaluation, original title: "Toto 2.0 releases open-weights time series foundation models"
Datadog released Toto 2.0, a family of five open-weights time series foundation models spanning 4 million to 2.5 billion parameters. The models were trained from a single hyperparameter configuration using u-μP scaling. They are available under the Apache 2.0 license on Hugging Face, with inference code on GitHub and integration examples for GluonTS. Toto 2.0 achieves state-of-the-art results on the BOOM, GIFT-Eval, and TIME benchmarks, with forecast quality improving reliably as parameter count increases.
Users are excited about Toto 2.0 because its reliable scaling improvements resemble GPT-2 and the team is openly releasing the weights under Apache 2.0 for others to build with.
No Digg Deeper questions have been answered for this story yet.
