Nonasymptotic clt and error bounds for two-time-scale stochastic approximation.arXiv preprint arXiv:2502.09884

Seo Taek Kong, Sihan Zeng, Thinh T Doan, R Srikant · 2025 · arXiv 2502.09884

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Gaussian Approximation for Asynchronous Q-learning

stat.ML · 2026-04-08 · unverdicted · novelty 7.0

Derived rates of order up to n^{-1/6} log^4(n S A) for the high-dimensional CLT of averaged asynchronous Q-learning iterates, plus a general martingale-difference CLT.

Central Limit Theorems for Asynchronous Averaged Q-Learning

cs.LG · 2025-09-23 · unverdicted · novelty 6.0

Establishes non-asymptotic and functional central limit theorems for asynchronous averaged Q-learning with explicit rates depending on iterations, state-action space, discount factor, and exploration quality.

citing papers explorer

Showing 2 of 2 citing papers.

Gaussian Approximation for Asynchronous Q-learning stat.ML · 2026-04-08 · unverdicted · none · ref 27
Derived rates of order up to n^{-1/6} log^4(n S A) for the high-dimensional CLT of averaged asynchronous Q-learning iterates, plus a general martingale-difference CLT.
Central Limit Theorems for Asynchronous Averaged Q-Learning cs.LG · 2025-09-23 · unverdicted · none · ref 6
Establishes non-asymptotic and functional central limit theorems for asynchronous averaged Q-learning with explicit rates depending on iterations, state-action space, discount factor, and exploration quality.

Nonasymptotic clt and error bounds for two-time-scale stochastic approximation.arXiv preprint arXiv:2502.09884

fields

years

verdicts

representative citing papers

citing papers explorer