TimeLAVA: Learning-Agnostic Data Valuation for Time Series
Pith reviewed 2026-06-26 19:22 UTC · model grok-4.3
The pith
TimeLAVA values time series segments by their marginal contribution to a selective wavelet-based Wasserstein discrepancy, without training any model.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
TimeLAVA values temporal segments according to their marginal contribution to minimizing a Selective Wavelet-based Wasserstein discrepancy between the evaluated data and a reference distribution. The discrepancy uses multi-scale wavelet transforms to localize temporal features and unbalanced optimal transport to handle shifts. Values are obtained without training any predictive model, and the framework proves that these values bound generalization error in a model-agnostic sense while having limited sensitivity to contaminated samples.
What carries the argument
The Selective Wavelet-based Wasserstein discrepancy, which combines wavelet decomposition for multi-scale temporal localization with unbalanced optimal transport for robustness to distributional shifts; segment values are then obtained from its sensitivity analysis.
If this is right
- The value scores improve results on anomaly detection, data pruning, and label noise detection across diverse real-world time series.
- Valuation is linked by proof to model-agnostic generalization bounds.
- Sensitivity to outlier contamination is provably bounded.
- The method captures non-stationary dynamics and multi-scale patterns that i.i.d. valuation methods miss.
Where Pith is reading between the lines
- The same wavelet-transport construction could be tested on other ordered data such as audio waveforms or video frames.
- Because computation relies only on sensitivity analysis, the approach might be adapted to streaming time series for continuous re-valuation.
- Controlled synthetic experiments with known segment quality could provide a direct check on whether the discrepancy truly tracks intrinsic value.
- The efficiency gain from avoiding model retraining suggests the method could scale to long multivariate series where retraining costs are high.
Load-bearing premise
That the marginal contribution of a temporal segment to reducing the selective wavelet-based Wasserstein discrepancy serves as a valid proxy for the segment's intrinsic quality.
What would settle it
An experiment in which high-value segments identified by TimeLAVA are pruned and downstream model performance fails to degrade more than when low-value or random segments are pruned instead.
Figures
read the original abstract
Data valuation quantifies the intrinsic quality of individual samples to enable principled data curation, quality control, and robust learning. For time series in critical domains such as healthcare, finance, and industrial monitoring, effective valuation methods are essential yet fundamentally lacking. Existing approaches are either model-dependent, limiting their generalizability, or designed for i.i.d. data and thus fail to capture temporal dependencies, multi-scale patterns, and non-stationary dynamics inherent to sequential data. We introduce TimeLAVA, a learning-agnostic framework that values temporal segments by their marginal contribution to minimizing distributional discrepancy between evaluated and reference data. At its core is a novel Selective Wavelet-based Wasserstein discrepancy combining multi-scale wavelet transforms for temporal localization with unbalanced optimal transport for robustness to distributional shifts. Segment values are efficiently computed via sensitivity analysis without requiring model training and aggregated into point-wise scores. We provide theoretical guarantees linking valuation to model-agnostic generalization and prove bounded sensitivity to outlier contamination. Extensive experiments across anomaly detection, data pruning, and label noise detection demonstrate that TimeLAVA produces significantly more informative value scores than existing methods on diverse real-world datasets.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces TimeLAVA, a learning-agnostic framework for data valuation in time series. Temporal segments are valued by their marginal contribution to minimizing a Selective Wavelet-based Wasserstein discrepancy that combines multi-scale wavelet transforms with unbalanced optimal transport. Values are obtained via sensitivity analysis without model training, aggregated to pointwise scores, and supported by claimed theoretical guarantees on model-agnostic generalization and bounded outlier sensitivity. Experiments on anomaly detection, data pruning, and label noise detection across real-world datasets are asserted to show superior informativeness relative to existing methods.
Significance. If the central mapping from marginal discrepancy reduction to intrinsic segment quality holds, the work would fill a clear gap by supplying a model-free valuation method that respects temporal structure and non-stationarity. The wavelet-plus-unbalanced-OT construction and the sensitivity-analysis route to computation are potentially practical strengths; the claimed generalization bounds and outlier robustness, if rigorously established, would further differentiate the approach from i.i.d.-centric or model-dependent baselines.
major comments (2)
- [Abstract] Abstract (framework core paragraph): the premise that marginal contribution to the selective wavelet-based Wasserstein discrepancy constitutes a valid model-agnostic proxy for segment quality is load-bearing for both the theoretical guarantees and the empirical claims, yet the abstract supplies no argument that this discrepancy is monotonic with downstream utility or that the sensitivity analysis isolates segment contributions independently of reference-set statistics.
- [Abstract] Abstract (theoretical guarantees sentence): the claimed bounds on generalization and outlier sensitivity are asserted without any indication of the proof strategy, key assumptions, or the precise statement of the discrepancy measure, preventing assessment of whether the non-stationary regimes typical of the target domains are covered.
minor comments (1)
- [Abstract] The abstract repeatedly uses 'selective' without a concise definition; a one-sentence gloss would improve readability.
Simulated Author's Rebuttal
We thank the referee for the detailed comments on the abstract. We agree that the abstract can more explicitly articulate the justification for the core premise and the scope of the theoretical claims. We will revise the abstract in the next version to address these points while respecting length constraints.
read point-by-point responses
-
Referee: [Abstract] Abstract (framework core paragraph): the premise that marginal contribution to the selective wavelet-based Wasserstein discrepancy constitutes a valid model-agnostic proxy for segment quality is load-bearing for both the theoretical guarantees and the empirical claims, yet the abstract supplies no argument that this discrepancy is monotonic with downstream utility or that the sensitivity analysis isolates segment contributions independently of reference-set statistics.
Authors: The full paper (Section 3) establishes monotonicity of the selective wavelet-based Wasserstein discrepancy with respect to inclusion of informative segments via the unbalanced OT formulation and shows that sensitivity analysis isolates marginal contributions through first-order derivatives independent of reference-set statistics under the chosen weighting. We acknowledge the abstract does not preview this argument. We will revise the abstract to add a brief clause: 'whose monotonicity with downstream utility follows from the unbalanced transport cost and multi-scale localization.' revision: yes
-
Referee: [Abstract] Abstract (theoretical guarantees sentence): the claimed bounds on generalization and outlier sensitivity are asserted without any indication of the proof strategy, key assumptions, or the precise statement of the discrepancy measure, preventing assessment of whether the non-stationary regimes typical of the target domains are covered.
Authors: Abstract length precludes full proof details, but we accept that a minimal indication of assumptions would help. The bounds rely on Lipschitz continuity of wavelet features and moment bounds on the distributions; the discrepancy is the selective wavelet-based Wasserstein distance (Eq. 3). Non-stationarity is addressed by the multi-scale wavelet decomposition. We will revise the abstract to append 'under Lipschitz and moment assumptions that accommodate non-stationarity' after the guarantees sentence. revision: yes
Circularity Check
No significant circularity in derivation chain
full rationale
The paper defines segment values explicitly as marginal contributions to its proposed Selective Wavelet-based Wasserstein discrepancy (a novel combination of multi-scale wavelets and unbalanced OT) and supplies separate theoretical guarantees plus empirical tests on anomaly detection, pruning, and noise detection. No step reduces by construction to a fitted parameter renamed as prediction, a self-citation chain, or a definitional tautology; the discrepancy is an independent proposed construct whose link to generalization is asserted via analysis rather than identity with the input definition. The derivation remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Marginal contribution to the selective wavelet-based Wasserstein discrepancy measures intrinsic temporal segment quality for generalization
Reference graph
Works this paper leans on
-
[1]
The Annals of Applied Statistics , volume=
Compression of climate simulations with a nonstationary global SpatioTemporal SPDE model , author=. The Annals of Applied Statistics , volume=. 2020 , publisher=
2020
-
[2]
Spatial Statistics , volume=
Non-stationary spatio-temporal modeling using the stochastic advection--diffusion equation , author=. Spatial Statistics , volume=. 2024 , publisher=
2024
-
[3]
Spatial Statistics , volume=
Estimation of a non-stationary model for annual precipitation in southern Norway using replicates of the spatial field , author=. Spatial Statistics , volume=. 2015 , publisher=
2015
-
[4]
Journal of the American Statistical Association , volume=
Approximate likelihood for large irregularly spaced spatial data , author=. Journal of the American Statistical Association , volume=. 2007 , publisher=
2007
-
[5]
Statistica Sinica , pages=
Exploring a new class of non-stationary spatial Gaussian random fields with varying local anisotropy , author=. Statistica Sinica , pages=. 2015 , publisher=
2015
-
[6]
Spatial Statistics , volume=
Does non-stationary spatial data always require non-stationary random fields? , author=. Spatial Statistics , volume=. 2015 , publisher=
2015
-
[7]
Journal of the American Statistical Association , volume=
Statistical modeling for spatio-temporal data from stochastic convection-diffusion processes , author=. Journal of the American Statistical Association , volume=. 2022 , publisher=
2022
-
[8]
2023 , booktitle =
Kwon, Yongchan and Zou, James , title =. 2023 , booktitle =
2023
-
[9]
Hoang Anh Just and Feiyang Kang and Tianhao Wang and Yi Zeng and Myeongseob Ko and Ming Jin and Ruoxi Jia , booktitle=
-
[10]
The Thirteenth International Conference on Learning Representations , year=
Optimal Transport for Time Series Imputation , author=. The Thirteenth International Conference on Learning Representations , year=
-
[11]
Ingredients
Data Valuation in Machine Learning: "Ingredients", Strategies, and Open Challenges , author =. Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence,. 2022 , month =
2022
-
[12]
The Thirteenth International Conference on Learning Representations , year=
TimeInf: Time Series Data Contribution via Influence Functions , author=. The Thirteenth International Conference on Learning Representations , year=
-
[13]
2017 , booktitle =
Koh, Pang Wei and Liang, Percy , title =. 2017 , booktitle =
2017
-
[14]
Ingredients
Data Valuation in Machine Learning:" Ingredients", Strategies, and Open Challenges. , author=. IJCAI , pages=
-
[15]
The Twelfth International Conference on Learning Representations , year=
iTransformer: Inverted Transformers Are Effective for Time Series Forecasting , author=. The Twelfth International Conference on Learning Representations , year=
-
[16]
Mathematics of computation , volume=
Scaling algorithms for unbalanced optimal transport problems , author=. Mathematics of computation , volume=
-
[17]
Advances in neural information processing systems , volume=
Sinkhorn distances: Lightspeed computation of optimal transport , author=. Advances in neural information processing systems , volume=
-
[18]
Foundations and Trends
Computational optimal transport: With applications to data science , author=. Foundations and Trends. 2019 , publisher=
2019
-
[19]
2008 , publisher=
Optimal transport: old and new , author=. 2008 , publisher=
2008
-
[20]
Advances in Neural Information Processing Systems , volume=
Geometric dataset distances via optimal transport , author=. Advances in Neural Information Processing Systems , volume=
-
[21]
The 41st international ACM SIGIR conference on research & development in information retrieval , pages=
Modeling long-and short-term temporal patterns with deep neural networks , author=. The 41st international ACM SIGIR conference on research & development in information retrieval , pages=
-
[22]
The 22nd international conference on artificial intelligence and statistics , pages=
Interpolating between optimal transport and mmd using sinkhorn divergences , author=. The 22nd international conference on artificial intelligence and statistics , pages=. 2019 , organization=
2019
-
[23]
Journal of Machine Learning Research , volume=
Pot: Python optimal transport , author=. Journal of Machine Learning Research , volume=
-
[24]
Journal of the American Statistical Association , volume=
The influence curve and its role in robust estimation , author=. Journal of the American Statistical Association , volume=. 1974 , publisher=
1974
-
[25]
The Annals of Statistics , volume=
Infinitesimal robustness for autoregressive processes , author=. The Annals of Statistics , volume=. 1984 , publisher=
1984
-
[26]
The Annals of Statistics , volume=
Influence functionals for time series , author=. The Annals of Statistics , volume=. 1986 , publisher=
1986
-
[27]
International Conference on Machine Learning (ICML) , pages=
Data shapley: Equitable valuation of data for machine learning , author=. International Conference on Machine Learning (ICML) , pages=. 2019 , publisher=
2019
-
[28]
The 22nd International Conference on Artificial Intelligence and Statistics (AISTATS) , pages=
Towards efficient data valuation based on the shapley value , author=. The 22nd International Conference on Artificial Intelligence and Statistics (AISTATS) , pages=. 2019 , series=
2019
-
[29]
Advances in Neural Information Processing Systems (NeurIPS) , volume=
Validation free and replication robust volume-based data valuation , author=. Advances in Neural Information Processing Systems (NeurIPS) , volume=. 2021 , editor=
2021
-
[30]
Probability Theory and Related Fields , volume=
Entropic optimal transport: Convergence of potentials , author=. Probability Theory and Related Fields , volume=. 2022 , publisher=
2022
-
[31]
International Conference on Machine Learning , pages=
On unbalanced optimal transport: An analysis of sinkhorn algorithm , author=. International Conference on Machine Learning , pages=. 2020 , organization=
2020
-
[32]
2015 , howpublished =
Trindade, Artur , title =. 2015 , howpublished =
2015
-
[33]
Advances in Neural Information Processing Systems , volume=
Softpatch: Unsupervised anomaly detection with noisy data , author=. Advances in Neural Information Processing Systems , volume=
-
[34]
Proceedings of the VLDB Endowment , volume=
Anomaly detection in time series: a comprehensive evaluation , author=. Proceedings of the VLDB Endowment , volume=. 2022 , publisher=
2022
-
[35]
IEEE transactions on knowledge and data engineering , volume=
Current time series anomaly detection benchmarks are flawed and are creating the illusion of progress , author=. IEEE transactions on knowledge and data engineering , volume=. 2021 , publisher=
2021
-
[36]
Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining , pages=
Detecting spacecraft anomalies using lstms and nonparametric dynamic thresholding , author=. Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining , pages=
-
[37]
Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining , pages=
Robust anomaly detection for multivariate time series through stochastic recurrent neural network , author=. Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining , pages=
-
[38]
Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining , pages=
Practical approach to asynchronous multivariate time series anomaly detection and localization , author=. Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining , pages=
-
[39]
2016 international workshop on cyber-physical systems for smart water networks (CySWater) , pages=
SWaT: A water treatment testbed for research and training on ICS security , author=. 2016 international workshop on cyber-physical systems for smart water networks (CySWater) , pages=. 2016 , organization=
2016
-
[40]
International Conference on Learning Representations , year=
Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy , author=. International Conference on Learning Representations , year=
-
[41]
Proceedings of the 29th ACM SIGKDD conference on knowledge discovery and data mining , pages=
Dcdetector: Dual attention contrastive representation learning for time series anomaly detection , author=. Proceedings of the 29th ACM SIGKDD conference on knowledge discovery and data mining , pages=
-
[42]
, title =
Tuli, Shreshth and Casale, Giuliano and Jennings, Nicholas R. , title =. Proc. VLDB Endow. , month = feb, pages =. 2022 , issue_date =
2022
-
[43]
Advances in neural information processing systems , volume=
Approximate cross-validation for structured models , author=. Advances in neural information processing systems , volume=
-
[44]
2008 eighth ieee international conference on data mining , pages=
Isolation forest , author=. 2008 eighth ieee international conference on data mining , pages=. 2008 , organization=
2008
-
[45]
Proceedings of the 3rd international conference on knowledge discovery and data mining , pages=
Using dynamic time warping to find patterns in time series , author=. Proceedings of the 3rd international conference on knowledge discovery and data mining , pages=
-
[46]
International conference on machine learning , pages=
Unbalanced minibatch optimal transport; applications to domain adaptation , author=. International conference on machine learning , pages=. 2021 , organization=
2021
-
[47]
1994 , publisher=
Time series analysis , author=. 1994 , publisher=
1994
-
[48]
Scientific data , volume=
MIMIC-III, a freely accessible critical care database , author=. Scientific data , volume=. 2016 , publisher=
2016
-
[49]
Big data
“Big data” in the intensive care unit. Closing the data loop , author=. American journal of respiratory and critical care medicine , volume=
-
[50]
2000 , publisher=
Non-linear time series models in empirical finance , author=. 2000 , publisher=
2000
-
[51]
Computers & Industrial Engineering , volume=
A systematic literature review of machine learning methods applied to predictive maintenance , author=. Computers & Industrial Engineering , volume=. 2019 , publisher=
2019
-
[52]
ACM computing surveys (CSUR) , volume=
Deep learning for anomaly detection: A review , author=. ACM computing surveys (CSUR) , volume=. 2021 , publisher=
2021
-
[53]
Applied soft computing , volume=
Financial time series forecasting with deep learning: A systematic literature review: 2005--2019 , author=. Applied soft computing , volume=. 2020 , publisher=
2005
-
[54]
Advances in Neural Information Processing Systems , volume=
Opendataval: a unified benchmark for data valuation , author=. Advances in Neural Information Processing Systems , volume=
-
[55]
SIAM review , volume=
Continuous and discrete wavelet transforms , author=. SIAM review , volume=. 1989 , publisher=
1989
-
[56]
Unsupervised real-time anomaly detection for streaming data , journal =
Subutai Ahmad and Alexander Lavin and Scott Purdy and Zuha Agha , keywords =. Unsupervised real-time anomaly detection for streaming data , journal =. 2017 , note =
2017
-
[57]
, title =
Ahmed, Chuadhry Mujeeb and Palleti, Venkata Reddy and Mathur, Aditya P. , title =. Proceedings of the 3rd International Workshop on Cyber-Physical Systems for Smart Water Networks , pages =. 2017 , publisher =
2017
-
[58]
and Wei Fan and Wenke Lee and Prodromidis, A
Stolfo, S.J. and Wei Fan and Wenke Lee and Prodromidis, A. and Chan, P.K. , booktitle=. Cost-based modeling for fraud and intrusion detection: results from the JAM project , year=
-
[59]
Applied sciences , volume=
Wavelet transform application for/in non-stationary time-series analysis: A review , author=. Applied sciences , volume=. 2019 , publisher=
2019
-
[60]
2004 , publisher=
Fourier analysis of time series: an introduction , author=. 2004 , publisher=
2004
-
[61]
The Thirteenth International Conference on Learning Representations , year=
Learning under Temporal Label Noise , author=. The Thirteenth International Conference on Learning Representations , year=
-
[62]
2013 , howpublished =
Reyes-Ortiz, Jorge and Anguita, Davide and Ghio, Alessandro and Oneto, Luca and Parra, Xavier , title =. 2013 , howpublished =
2013
-
[63]
Logacjov, Aleksej and Ustad, Astrid , title =
-
[64]
2013 , howpublished =
Roesler, Oliver , title =. 2013 , howpublished =
2013
-
[65]
Proceedings of the 19th ACM conference on computer-supported cooperative work & social computing , pages=
You get who you pay for: The impact of incentives on participation bias , author=. Proceedings of the 19th ACM conference on computer-supported cooperative work & social computing , pages=
-
[66]
, author=
Seasonal variation in self-reports of recent alcohol consumption: racial and ethnic differences. , author=. Journal of Studies on Alcohol , volume=. 2003 , publisher=
2003
-
[67]
Journal of Functional Analysis , volume=
Unbalanced optimal transport: Dynamic and Kantorovich formulations , author=. Journal of Functional Analysis , volume=. 2018 , publisher=
2018
-
[68]
arXiv preprint arXiv:1910.12958 , year=
Sinkhorn divergences for unbalanced optimal transport , author=. arXiv preprint arXiv:1910.12958 , year=
arXiv 1910
-
[69]
International Conference on Artificial Intelligence and Statistics , year=
Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning , author=. International Conference on Artificial Intelligence and Statistics , year=
-
[70]
Proceedings of the AAAI conference on artificial intelligence , volume=
Informer: Beyond efficient transformer for long sequence time-series forecasting , author=. Proceedings of the AAAI conference on artificial intelligence , volume=
-
[71]
, journal=
Mallat, S.G. , journal=. A theory for multiresolution signal decomposition: the wavelet representation , year=
-
[72]
Transactions on Machine Learning Research , year=
Data Valuation in the Absence of a Reliable Validation Set , author=. Transactions on Machine Learning Research , year=
-
[73]
Samuel Kessler and Tam Le and Vu Nguyen , booktitle=
-
[74]
Machine Learning and Knowledge Discovery in Databases
Kraus, Maurice and Steinmann, David and Wüst, Antonia and Kokozinski, Andre and Kersting, Kristian , title =. Machine Learning and Knowledge Discovery in Databases. , year =
-
[75]
Luo, Donghao and Wang, Xue , booktitle=. Modern
-
[76]
Xingjian Wu and Xiangfei Qiu and Zhengyu Li and Yihang Wang and Jilin Hu and Chenjuan Guo and Hui Xiong and Bin Yang , booktitle=
-
[77]
arXiv preprint arXiv:1908.08619 , year=
Efficient task-specific data valuation for nearest neighbor algorithms , author=. arXiv preprint arXiv:1908.08619 , year=
arXiv 1908
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.