Pseudo-quantized actor-critic algorithm for robustness to noisy temporal difference error,

· 2026 · arXiv 2604.01613

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Redesigning Regularization for Effective Policy Smoothing

cs.RO · 2026-06-11 · unverdicted · novelty 5.0

Redesigned regularization addresses implementation gaps in policy smoothing for RL, yielding smoother motions with improved performance and robustness on a quadruped robot in sim-to-real settings.

citing papers explorer

Showing 1 of 1 citing paper.

Redesigning Regularization for Effective Policy Smoothing cs.RO · 2026-06-11 · unverdicted · none · ref 34
Redesigned regularization addresses implementation gaps in policy smoothing for RL, yielding smoother motions with improved performance and robustness on a quadruped robot in sim-to-real settings.

Pseudo-quantized actor-critic algorithm for robustness to noisy temporal difference error,

fields

years

verdicts

representative citing papers

citing papers explorer