IWR improves CRL sample efficiency and performance in interaction-rich manipulation by interaction-aware resampling that preserves mode boundaries, yielding 19.8% average gains and a real-world air-hockey agent.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
years
2026 3verdicts
UNVERDICTED 3representative citing papers
WorldDP combines a high-level object-centric world model for subgoal planning with a low-level diffusion policy for execution, claiming better performance than baselines on multi-stage robotic manipulation benchmarks.
LAGO decomposes language instructions into predicted latent subgoals and uses soft-minimum trajectory costs to enable robust long-horizon planning in world models.
citing papers explorer
-
Latent Goal Prediction from Language for Model-Based Planning
LAGO decomposes language instructions into predicted latent subgoals and uses soft-minimum trajectory costs to enable robust long-horizon planning in world models.