Toto 2.0: Time Series Forecasting Enters the Scaling Era

· 2026 · cs.LG · arXiv 2605.20119

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

We show that time series foundation models scale: a single training recipe produces reliable forecast-quality improvements from 4M to 2.5B parameters. We release Toto 2.0, a family of five open-weights forecasting models trained under this recipe. The Toto 2.0 family sets a new state of the art on three forecasting benchmarks: BOOM, our observability benchmark; GIFT-Eval, the standard general-purpose benchmark; and the recent contamination-resistant TIME benchmark. This report describes our experimental results and details the design decisions behind Toto 2.0: its architecture and training recipe, training data, and the u-muP hyperparameter transfer pipeline. All five base checkpoints are released under Apache 2.0.

representative citing papers

Falcon-X: A Time Series Foundation Model for Heterogeneous Multivariate Modeling

cs.LG · 2026-05-26 · unverdicted · novelty 5.0

Falcon-X introduces a latent prototype space with Unified Prototype Diff-Attention and Latent Entity Attention for heterogeneous multivariate time series forecasting.

citing papers explorer

Showing 1 of 1 citing paper.

Falcon-X: A Time Series Foundation Model for Heterogeneous Multivariate Modeling cs.LG · 2026-05-26 · unverdicted · none · ref 27 · internal anchor
Falcon-X introduces a latent prototype space with Unified Prototype Diff-Attention and Latent Entity Attention for heterogeneous multivariate time series forecasting.

Toto 2.0: Time Series Forecasting Enters the Scaling Era

fields

years

verdicts

representative citing papers

citing papers explorer