pith. sign in

arxiv: 2505.14411 · v4 · pith:ACTK24V7new · submitted 2025-05-20 · 💻 cs.LG

Byte Pair Encoding for Efficient Time Series Forecasting

classification 💻 cs.LG
keywords seriestimetokenizationpatternstokensanalysisbytecomputational
0
0 comments X
read the original abstract

Existing time series tokenization methods predominantly encode a constant number of samples into individual tokens. This inflexible approach can generate excessive tokens for even simple patterns like extended constant values, resulting in substantial computational overhead. Inspired by the success of byte pair encoding, we propose the first pattern-centric tokenization scheme for time series analysis. Based on a discrete vocabulary of frequent motifs, our method merges samples with underlying patterns into tokens, compressing time series adaptively. Exploiting our finite set of motifs and the continuous properties of time series, we further introduce conditional decoding as a lightweight yet powerful post-hoc optimization method, which requires no gradient computation and adds no computational overhead. On recent time series foundation models, our motif-based tokenization improves forecasting performance by 40% and boosts efficiency by 2314% on average. Conditional decoding further reduces MSE by up to 48%. In an extensive analysis, we demonstrate the adaptiveness of our tokenization to diverse temporal patterns, its generalization to unseen data, and its meaningful token representations capturing distinct time series properties, including statistical moments and trends.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Dywave: Event-Aligned Dynamic Tokenization for Heterogeneous IoT Sensing Signals

    cs.LG 2026-05 unverdicted novelty 6.0

    Dywave applies wavelet-based hierarchical decomposition to build dynamic, event-aligned tokens for heterogeneous IoT signals, cutting token length by up to 75% while raising accuracy up to 12% on sequence models.

  2. Dywave: Event-Aligned Dynamic Tokenization for Heterogeneous IoT Sensing Signals

    cs.LG 2026-05 unverdicted novelty 6.0

    Dywave uses wavelet hierarchical decomposition to create event-aligned compact token sequences for heterogeneous IoT signals, yielding up to 12% accuracy gains and 75% shorter inputs on mainstream sequence models acro...