A T-estimation-based procedure for adaptive density estimation and optimal control in offline contextual MDPs without stationarity, providing oracle risk bounds under two loss functions and finite-sample cost guarantees.
On the Markov chain central limit theorem
3 Pith papers cite this work, alongside 241 external citations. Polarity classification is still indexing.
years
2026 3verdicts
UNVERDICTED 3representative citing papers
New discrete-time approximations to SG(L)D enable accurate non-asymptotic predictions of covariance and integrated autocorrelation time for practical tuning in large-batch or misspecified regimes.
Weak convergence rates of Markov transition kernels imply variance convergence bounds for Lipschitz functions and chi-squared divergence bounds under reversibility with Lipschitz initial densities.
citing papers explorer
-
Adaptive Estimation and Optimal Control in Offline Contextual MDPs without Stationarity
A T-estimation-based procedure for adaptive density estimation and optimal control in offline contextual MDPs without stationarity, providing oracle risk bounds under two loss functions and finite-sample cost guarantees.
-
Accurate Large-sample Uncertainty Quantification using Stochastic Gradient Markov Chain Monte Carlo
New discrete-time approximations to SG(L)D enable accurate non-asymptotic predictions of covariance and integrated autocorrelation time for practical tuning in large-batch or misspecified regimes.
-
Implications of weak convergence rates of Markov transition kernels
Weak convergence rates of Markov transition kernels imply variance convergence bounds for Lipschitz functions and chi-squared divergence bounds under reversibility with Lipschitz initial densities.