A liveness-based Bellman operator enables conservative offline policy evaluation for manipulation tasks by encoding task progression and reducing truncation bias from finite horizons.
Dex- terous manipulation with deep reinforcement learning: Efficient, gen- eral, and low-cost, in: 2019 International Conference on Robotics and Automation (ICRA), pp
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3roles
background 1polarities
background 1representative citing papers
A conformal prediction certification for belief-space safety filters focuses verification on reliable inference regions to produce less conservative yet high-probability safe filters than standard baselines in human-vehicle simulations.
A VAE-based latent task representation enables automatic curriculum generation in CRL for non-Euclidean navigation tasks, outperforming interpolation and GAN-based methods in experiments.
citing papers explorer
-
Offline Policy Evaluation for Manipulation Policies via Discounted Liveness Formulation
A liveness-based Bellman operator enables conservative offline policy evaluation for manipulation tasks by encoding task progression and reducing truncation bias from finite horizons.
-
Permissive Safety Through Trusted Inference: Verifiable Belief-Space Neural Safety Filters for Assured Interactive Robotics
A conformal prediction certification for belief-space safety filters focuses verification on reliable inference regions to produce less conservative yet high-probability safe filters than standard baselines in human-vehicle simulations.
-
Curriculum reinforcement learning with measurable task representation learning
A VAE-based latent task representation enables automatic curriculum generation in CRL for non-Euclidean navigation tasks, outperforming interpolation and GAN-based methods in experiments.