hub Mixed citations

A decoder-only foundation model for time-series forecasting

Abhimanyu Das, Weihao Kong, Rajat Sen, Yichen Zhou · 2023 · cs.CL · arXiv 2310.10688

Mixed citation behavior. Most common role is background (60%).

24 Pith papers citing it

Background 60% of classified citations

open full Pith review browse 24 citing papers arXiv PDF

abstract

Motivated by recent advances in large language models for Natural Language Processing (NLP), we design a time-series foundation model for forecasting whose out-of-the-box zero-shot performance on a variety of public datasets comes close to the accuracy of state-of-the-art supervised forecasting models for each individual dataset. Our model is based on pretraining a patched-decoder style attention model on a large time-series corpus, and can work well across different forecasting history lengths, prediction lengths and temporal granularities.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 6 baseline 3 method 1

citation-polarity summary

background 6 baseline 3 use method 1

representative citing papers

PolyBench: Benchmarking LLM Forecasting and Trading Capabilities on Live Prediction Market Data

q-fin.CP · 2026-04-03 · conditional · novelty 8.0

Only two of seven LLMs produce positive returns on live Polymarket data, with MiMo-V2-Flash at 17.6% CWR and Gemini-3-Flash at 6.2% CWR while the other five lose money.

Forecasting megaelectron-volt electron flux in the Earth's outer radiation belt using supervised machine learning algorithms and a timeseries foundation model

astro-ph.IM · 2026-05-15 · unverdicted · novelty 7.0

Hybrid TimesFM plus ridge regression on covariates forecasts 1-MeV electron flux with average R² of 0.9 on out-of-sample 2024 data, outperforming linear regression, CNN, LSTM and Transformer models.

SurF: A Generative Model for Multivariate Irregular Time Series Forecasting

cs.LG · 2026-05-13 · unverdicted · novelty 7.0

SurF applies the Time Rescaling Theorem as a learnable bijection to create a single generative model for forecasting irregular multivariate event streams that outperforms or matches baselines on six benchmarks.

TimeClaw: A Time-Series AI Agent with Exploratory Execution Learning

cs.AI · 2026-05-11 · unverdicted · novelty 7.0

TimeClaw is an exploratory execution learning system that turns multiple valid tool-use paths into hierarchical distilled experience for improved time-series reasoning without test-time adaptation.

FactoryBench: Evaluating Industrial Machine Understanding

cs.AI · 2026-05-08 · unverdicted · novelty 7.0

FactoryBench reveals that frontier LLMs achieve under 50% on structured causal questions and under 18% on decision-making in industrial robotic telemetry.

Explainable Load Forecasting with Covariate-Informed Time Series Foundation Models

cs.LG · 2026-04-30 · unverdicted · novelty 7.0

Time series foundation models match the performance of specialized models for day-ahead load forecasting while providing explanations that match domain knowledge on weather and calendar effects.

Chronos: Learning the Language of Time Series

cs.LG · 2024-03-12 · conditional · novelty 7.0

Chronos pretrains transformer models on tokenized time series to deliver strong zero-shot forecasting across diverse domains.

MILM: Large Language Models for Multimodal Irregular Time Series with Informative Sampling

cs.LG · 2026-05-13 · unverdicted · novelty 6.0

MILM fine-tunes LLMs on XML-encoded multimodal irregular time series via a two-stage process that exploits informative sampling patterns to achieve top performance on EHR classification datasets.

RareCP: Regime-Aware Retrieval for Efficient Conformal Prediction

cs.LG · 2026-05-09 · unverdicted · novelty 6.0

RareCP improves interval efficiency for time series conformal prediction by retrieving and weighting regime-specific calibration examples while adapting to drift and maintaining coverage.

Continuity Laws for Sequential Models

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

S4 models exhibit stable time-continuity unlike sensitive S6 models, with task continuity predicting performance and enabling temporal subsampling for better efficiency.

FETS Benchmark: Foundation Models Outperform Dataset-specific Machine Learning in Energy Time Series Forecasting

cs.LG · 2026-04-24 · unverdicted · novelty 6.0

Foundation models outperform dataset-specific machine learning in energy time series forecasting across 54 datasets in 9 categories.

Agentic Forecasting using Sequential Bayesian Updating of Linguistic Beliefs

cs.AI · 2026-04-20 · unverdicted · novelty 6.0

BLF achieves state-of-the-art binary forecasting on ForecastBench by using linguistic belief states updated in tool-use loops, hierarchical multi-trial logit averaging, and hierarchical Platt scaling calibration.

Predicting Power-System Dynamic Trajectories with Foundation Models

cs.AI · 2026-04-16 · unverdicted · novelty 6.0

LASS-ODE-Power is a pretrained model that predicts power-system dynamic trajectories across regimes in a zero-shot manner after large-scale ODE pretraining and targeted fine-tuning.

MICA: Multivariate Infini Compressive Attention for Time Series Forecasting

cs.LG · 2026-04-07 · unverdicted · novelty 6.0 · 2 refs

MICA adapts infini compressive attention to the channel dimension, enabling scalable cross-channel dependencies in Transformers and cutting forecast error by 5.4% on average versus channel-independent baselines.

Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series

cs.LG · 2026-04-06 · unverdicted · novelty 6.0 · 2 refs

DynLMC creates synthetic time series data with dynamic inter-channel correlations that improve zero-shot forecasting in foundation models across multiple benchmarks.

Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling

cs.AI · 2026-03-05 · unverdicted · novelty 6.0

Timer-S1 is a released 8.3B-parameter MoE time series model that achieves state-of-the-art MASE and CRPS scores on GIFT-Eval using serial scaling and Serial-Token Prediction.

General Geospatial Inference with a Population Dynamics Foundation Model

cs.LG · 2024-11-11 · unverdicted · novelty 6.0

A GNN-based foundation model on aggregated US geospatial data produces embeddings achieving SOTA on all 27 interpolation tasks and 25/27 extrapolation/super-resolution tasks across health, socioeconomic and environmental domains, plus improved forecasting when combined with TimesFM.

Quantifying the Pre-training Dividend: Generative versus Latent Self-Supervised Learning for Time Series Foundation Models

cs.LG · 2026-05-19 · unverdicted · novelty 5.0

Self-supervised pre-training delivers large gains up to 375% on time series anomaly detection and classification but only marginal benefits for forecasting, driven by a precision-invariance trade-off in the learned representations.

A Quantum Inspired Variational Kernel and Explainable AI Framework for Cross Region Solar and Wind Energy Forecasting

cs.CL · 2026-05-09 · unverdicted · novelty 5.0

A hybrid classical-plus-quantum-inspired framework for cross-region renewable energy forecasting matches top baselines within 1% accuracy and separates calm versus stormy conditions with a 15-fold higher Fisher discriminant ratio than a tuned radial basis kernel.

Degradation-aware Predictive Energy Management for Fuel Cell-Battery Ship Power System with Data-driven Load Forecasting

eess.SY · 2026-04-16 · unverdicted · novelty 5.0

A degradation-aware predictive controller for hybrid ship power systems reduces hydrogen consumption by up to 5.8% and fuel cell degradation by up to 36.4% versus a filter-based benchmark on real harbor tug data.

ChronoVAE-HOPE: Beyond Attention -- A Next-Generation VAE Foundation Model for Specialized Time Series Classification

cs.LG · 2026-05-21

Toto 2.0: Time Series Forecasting Enters the Scaling Era

cs.LG · 2026-05-19

KairosHope: A Next-Generation Time-Series Foundation Model for Specialized Classification via Dual-Memory Architecture

cs.LG · 2026-05-18

TelecomTS: A Multi-Modal Observability Dataset for Time Series and Language Analysis

cs.AI · 2025-10-07

citing papers explorer

Showing 24 of 24 citing papers.

PolyBench: Benchmarking LLM Forecasting and Trading Capabilities on Live Prediction Market Data q-fin.CP · 2026-04-03 · conditional · none · ref 13 · internal anchor
Only two of seven LLMs produce positive returns on live Polymarket data, with MiMo-V2-Flash at 17.6% CWR and Gemini-3-Flash at 6.2% CWR while the other five lose money.
Forecasting megaelectron-volt electron flux in the Earth's outer radiation belt using supervised machine learning algorithms and a timeseries foundation model astro-ph.IM · 2026-05-15 · unverdicted · none · ref 23 · internal anchor
Hybrid TimesFM plus ridge regression on covariates forecasts 1-MeV electron flux with average R² of 0.9 on out-of-sample 2024 data, outperforming linear regression, CNN, LSTM and Transformer models.
SurF: A Generative Model for Multivariate Irregular Time Series Forecasting cs.LG · 2026-05-13 · unverdicted · none · ref 1 · internal anchor
SurF applies the Time Rescaling Theorem as a learnable bijection to create a single generative model for forecasting irregular multivariate event streams that outperforms or matches baselines on six benchmarks.
TimeClaw: A Time-Series AI Agent with Exploratory Execution Learning cs.AI · 2026-05-11 · unverdicted · none · ref 47 · internal anchor
TimeClaw is an exploratory execution learning system that turns multiple valid tool-use paths into hierarchical distilled experience for improved time-series reasoning without test-time adaptation.
FactoryBench: Evaluating Industrial Machine Understanding cs.AI · 2026-05-08 · unverdicted · none · ref 18 · internal anchor
FactoryBench reveals that frontier LLMs achieve under 50% on structured causal questions and under 18% on decision-making in industrial robotic telemetry.
Explainable Load Forecasting with Covariate-Informed Time Series Foundation Models cs.LG · 2026-04-30 · unverdicted · none · ref 10 · internal anchor
Time series foundation models match the performance of specialized models for day-ahead load forecasting while providing explanations that match domain knowledge on weather and calendar effects.
Chronos: Learning the Language of Time Series cs.LG · 2024-03-12 · conditional · none · ref 16 · internal anchor
Chronos pretrains transformer models on tokenized time series to deliver strong zero-shot forecasting across diverse domains.
MILM: Large Language Models for Multimodal Irregular Time Series with Informative Sampling cs.LG · 2026-05-13 · unverdicted · none · ref 47 · internal anchor
MILM fine-tunes LLMs on XML-encoded multimodal irregular time series via a two-stage process that exploits informative sampling patterns to achieve top performance on EHR classification datasets.
RareCP: Regime-Aware Retrieval for Efficient Conformal Prediction cs.LG · 2026-05-09 · unverdicted · none · ref 54 · internal anchor
RareCP improves interval efficiency for time series conformal prediction by retrieving and weighting regime-specific calibration examples while adapting to drift and maintaining coverage.
Continuity Laws for Sequential Models cs.LG · 2026-05-08 · unverdicted · none · ref 55 · internal anchor
S4 models exhibit stable time-continuity unlike sensitive S6 models, with task continuity predicting performance and enabling temporal subsampling for better efficiency.
FETS Benchmark: Foundation Models Outperform Dataset-specific Machine Learning in Energy Time Series Forecasting cs.LG · 2026-04-24 · unverdicted · none · ref 28 · internal anchor
Foundation models outperform dataset-specific machine learning in energy time series forecasting across 54 datasets in 9 categories.
Agentic Forecasting using Sequential Bayesian Updating of Linguistic Beliefs cs.AI · 2026-04-20 · unverdicted · none · ref 10 · internal anchor
BLF achieves state-of-the-art binary forecasting on ForecastBench by using linguistic belief states updated in tool-use loops, hierarchical multi-trial logit averaging, and hierarchical Platt scaling calibration.
Predicting Power-System Dynamic Trajectories with Foundation Models cs.AI · 2026-04-16 · unverdicted · none · ref 50 · internal anchor
LASS-ODE-Power is a pretrained model that predicts power-system dynamic trajectories across regimes in a zero-shot manner after large-scale ODE pretraining and targeted fine-tuning.
MICA: Multivariate Infini Compressive Attention for Time Series Forecasting cs.LG · 2026-04-07 · unverdicted · none · ref 12 · 2 links · internal anchor
MICA adapts infini compressive attention to the channel dimension, enabling scalable cross-channel dependencies in Transformers and cutting forecast error by 5.4% on average versus channel-independent baselines.
Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series cs.LG · 2026-04-06 · unverdicted · none · ref 3 · 2 links · internal anchor
DynLMC creates synthetic time series data with dynamic inter-channel correlations that improve zero-shot forecasting in foundation models across multiple benchmarks.
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling cs.AI · 2026-03-05 · unverdicted · none · ref 12 · internal anchor
Timer-S1 is a released 8.3B-parameter MoE time series model that achieves state-of-the-art MASE and CRPS scores on GIFT-Eval using serial scaling and Serial-Token Prediction.
General Geospatial Inference with a Population Dynamics Foundation Model cs.LG · 2024-11-11 · unverdicted · none · ref 9 · internal anchor
A GNN-based foundation model on aggregated US geospatial data produces embeddings achieving SOTA on all 27 interpolation tasks and 25/27 extrapolation/super-resolution tasks across health, socioeconomic and environmental domains, plus improved forecasting when combined with TimesFM.
Quantifying the Pre-training Dividend: Generative versus Latent Self-Supervised Learning for Time Series Foundation Models cs.LG · 2026-05-19 · unverdicted · none · ref 5 · internal anchor
Self-supervised pre-training delivers large gains up to 375% on time series anomaly detection and classification but only marginal benefits for forecasting, driven by a precision-invariance trade-off in the learned representations.
A Quantum Inspired Variational Kernel and Explainable AI Framework for Cross Region Solar and Wind Energy Forecasting cs.CL · 2026-05-09 · unverdicted · none · ref 39 · internal anchor
A hybrid classical-plus-quantum-inspired framework for cross-region renewable energy forecasting matches top baselines within 1% accuracy and separates calm versus stormy conditions with a 15-fold higher Fisher discriminant ratio than a tuned radial basis kernel.
Degradation-aware Predictive Energy Management for Fuel Cell-Battery Ship Power System with Data-driven Load Forecasting eess.SY · 2026-04-16 · unverdicted · none · ref 9 · internal anchor
A degradation-aware predictive controller for hybrid ship power systems reduces hydrogen consumption by up to 5.8% and fuel cell degradation by up to 36.4% versus a filter-based benchmark on real harbor tug data.
ChronoVAE-HOPE: Beyond Attention -- A Next-Generation VAE Foundation Model for Specialized Time Series Classification cs.LG · 2026-05-21 · unreviewed · ref 9 · internal anchor
Toto 2.0: Time Series Forecasting Enters the Scaling Era cs.LG · 2026-05-19 · unreviewed · ref 14 · internal anchor
KairosHope: A Next-Generation Time-Series Foundation Model for Specialized Classification via Dual-Memory Architecture cs.LG · 2026-05-18 · unreviewed · ref 10 · internal anchor
TelecomTS: A Multi-Modal Observability Dataset for Time Series and Language Analysis cs.AI · 2025-10-07 · unreviewed · ref 6 · internal anchor

A decoder-only foundation model for time-series forecasting

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer