hub Mixed citations

A decoder-only foundation model for time-series forecasting

Abhimanyu Das, Weihao Kong, Rajat Sen, Yichen Zhou · 2023 · cs.CL · arXiv 2310.10688

Mixed citation behavior. Most common role is background (60%).

43 Pith papers citing it

Background 60% of classified citations

open full Pith review browse 43 citing papers arXiv PDF

abstract

Motivated by recent advances in large language models for Natural Language Processing (NLP), we design a time-series foundation model for forecasting whose out-of-the-box zero-shot performance on a variety of public datasets comes close to the accuracy of state-of-the-art supervised forecasting models for each individual dataset. Our model is based on pretraining a patched-decoder style attention model on a large time-series corpus, and can work well across different forecasting history lengths, prediction lengths and temporal granularities.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 6 baseline 3 method 1

citation-polarity summary

background 6 baseline 3 use method 1

representative citing papers

PolyBench: Benchmarking LLM Forecasting and Trading Capabilities on Live Prediction Market Data

q-fin.CP · 2026-04-03 · conditional · novelty 8.0

Only two of seven LLMs produce positive returns on live Polymarket data, with MiMo-V2-Flash at 17.6% CWR and Gemini-3-Flash at 6.2% CWR while the other five lose money.

A Three-Phase Foundation Model for Tax-Aware Personalized Portfolio Management

cs.AI · 2026-06-30 · unverdicted · novelty 7.0

A three-phase DRL framework for personalized portfolio management using a ticker-free encoder pretrained with a time series foundation model, an objective-conditioned MoE actor-critic, and inference-time LoRA adaptation from brokerage data.

B[FM]$^2$: Brain Foundation Model via Flow Matching with SplitUNet

cs.LG · 2026-06-18 · unverdicted · novelty 7.0

B[FM]^2 pretrains an EEG foundation model on raw signals with flow matching and SplitUNet, reaching SOTA on 7 of 9 tasks using ~30x less data and generating neurologist-indistinguishable synthetic EEG.

CloudCons: A Comprehensive End-to-End Benchmark for Cloud Resource Consolidation

cs.AI · 2026-06-11 · unverdicted · novelty 7.0

CloudCons benchmark shows foundation models' superior zero-shot forecasting does not automatically yield better resource consolidation decisions, with predictive quantile choice acting as a key lever for efficiency-reliability trade-offs.

GNSS-FM: A Self-Supervised Foundation Model for Daily GNSS Displacement Time Series

physics.geo-ph · 2026-06-05 · unverdicted · novelty 7.0

GNSS-FM is a self-supervised foundation model for GNSS displacement time series that outperforms task-specific baselines on 90-day forecasting and seismic step localization after pretraining on global station data.

Forecasting megaelectron-volt electron flux in the Earth's outer radiation belt using supervised machine learning algorithms and a timeseries foundation model

astro-ph.IM · 2026-05-15 · unverdicted · novelty 7.0

Hybrid TimesFM plus ridge regression on covariates forecasts 1-MeV electron flux with average R² of 0.9 on out-of-sample 2024 data, outperforming linear regression, CNN, LSTM and Transformer models.

SurF: A Generative Model for Multivariate Irregular Time Series Forecasting

cs.LG · 2026-05-13 · unverdicted · novelty 7.0

SurF applies the Time Rescaling Theorem as a learnable bijection to create a single generative model for forecasting irregular multivariate event streams that outperforms or matches baselines on six benchmarks.

TimeClaw: A Time-Series AI Agent with Exploratory Execution Learning

cs.AI · 2026-05-11 · unverdicted · novelty 7.0

TimeClaw is an exploratory execution learning system that turns multiple valid tool-use paths into hierarchical distilled experience for improved time-series reasoning without test-time adaptation.

FactoryBench: Evaluating Industrial Machine Understanding

cs.AI · 2026-05-08 · unverdicted · novelty 7.0

FactoryBench reveals that frontier LLMs achieve under 50% on structured causal questions and under 18% on decision-making in industrial robotic telemetry.

Explainable Load Forecasting with Covariate-Informed Time Series Foundation Models

cs.LG · 2026-04-30 · unverdicted · novelty 7.0

Time series foundation models match the performance of specialized models for day-ahead load forecasting while providing explanations that match domain knowledge on weather and calendar effects.

Chronos: Learning the Language of Time Series

cs.LG · 2024-03-12 · conditional · novelty 7.0

Chronos pretrains transformer models on tokenized time series to deliver strong zero-shot forecasting across diverse domains.

Probabilistic Low-Voltage Peak Load Forecasting with Time Series Foundation Models Evaluated on Application-Oriented Metrics

cs.LG · 2026-07-02 · unverdicted · novelty 6.0

Compares foundation models for probabilistic low-voltage load forecasting on 200 real feeders and introduces a grid-planning metric that scores peak prediction by its effect on asset cost-risk decisions.

Aionoscope: Debugging Latent-State Accessibility in Time-Series Representations

cs.LG · 2026-07-01 · unverdicted · novelty 6.0

Aionoscope shows that time-series representations recover coarse signal types reliably but expose dense latent states like phase and amplitude much less reliably, with best dense-probe R² at 0.689 versus oracle 0.999.

Domain-Informed Multi-View Self-Distillation for Astronomical Light-Curve Representation Learning with JEPA

astro-ph.IM · 2026-06-26 · unverdicted · novelty 6.0

A JEPA-based model with domain-informed multi-view self-distillation learns light-curve representations that outperform hand-crafted features on 15 of 16 StarEmbed metrics and adapts competitively to other irregular time-series datasets.

From Forecasting Leaderboards to Deployment Decisions: A Fail-Closed Certification Protocol

cs.LG · 2026-06-23 · unverdicted · novelty 6.0

Presents a fail-closed certification protocol for determining when forecasting leaderboard winners are deployment-actionable, using a traffic dataset to show friction-induced reversals and an audit to prevent overclaiming.

Tyan-WP: A Wind Power Foundation Model for Ultra-Short-Term Probabilistic Forecasting

cs.LG · 2026-06-07 · unverdicted · novelty 6.0

Tyan-WP is a pretrained wind power foundation model that outperforms site-specific TSMs and generic LTSMs in zero-shot ultra-short-term probabilistic forecasting on U.S. and U.K. sites via static embeddings and PAMF module.

GeoGNN: Time Series Geo-Localization using Two-Tower Graph Neural Networks

cs.LG · 2026-06-06 · unverdicted · novelty 6.0

GeoGNN is a two-tower GNN that learns geographic cell embeddings from adjacency graphs and matches them to temporal representations via dot-product similarity plus classification, improving geolocalization accuracy by ~27% on electricity datasets.

GITCO: Gated Inference-Time Context Optimization in TSFMs

cs.AI · 2026-06-03 · unverdicted · novelty 6.0

GITCO delivers +1.95% average MASE reduction on TimesFM 2.5 across 53 datasets by gated inference-time suppression of anomalous patches, capturing 89.9% of the improvement upper bound.

Probabilistic Data-Driven Modelling of Astrophysical Transients: The Neural Process Family for Ultrafast and Class-Agnostic Light Curve Reconstruction with NightLANP

astro-ph.IM · 2026-05-26 · unverdicted · novelty 6.0

Attentive Neural Processes outperform Gaussian Processes and neural networks on light curve interpolation quality, feature recovery, calibration, and speed for 15 transient classes under realistic Rubin cadences.

ChronoVAE-HOPE: Beyond Attention -- A Next-Generation VAE Foundation Model for Specialized Time Series Classification

cs.LG · 2026-05-21 · unverdicted · novelty 6.0 · 2 refs

ChronoVAE-HOPE proposes a VAE foundation model for time series classification that replaces attention with a HOPE Block dual-memory system and uses disentangled trend-seasonal latent representations, pre-trained on Monash and evaluated on UCR datasets.

MILM: Large Language Models for Multimodal Irregular Time Series with Informative Sampling

cs.LG · 2026-05-13 · unverdicted · novelty 6.0

MILM fine-tunes LLMs on XML-encoded multimodal irregular time series via a two-stage process that exploits informative sampling patterns to achieve top performance on EHR classification datasets.

RareCP: Regime-Aware Retrieval for Efficient Conformal Prediction

cs.LG · 2026-05-09 · unverdicted · novelty 6.0

RareCP improves interval efficiency for time series conformal prediction by retrieving and weighting regime-specific calibration examples while adapting to drift and maintaining coverage.

Continuity Laws for Sequential Models

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

S4 models exhibit stable time-continuity unlike sensitive S6 models, with task continuity predicting performance and enabling temporal subsampling for better efficiency.

FETS Benchmark: Foundation Models Outperform Dataset-specific Machine Learning in Energy Time Series Forecasting

cs.LG · 2026-04-24 · unverdicted · novelty 6.0

Foundation models outperform dataset-specific machine learning in energy time series forecasting across 54 datasets in 9 categories.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

A decoder-only foundation model for time-series forecasting

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer