Sparse video generation propels real-world beyond-the-view vision-language navigation.arXiv preprint arXiv:2602.05827

Hai Zhang, Siqi Liang, Li Chen, Yuxian Li, Yukuan Xu, Yichao Zhong, Fu Zhang, Hongyang Li · 2026 · arXiv 2602.05827

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

PathPainter: Transferring the Generalization Ability of Image Generation Models to Embodied Navigation

cs.RO · 2026-05-08 · unverdicted · novelty 6.0

PathPainter transfers image generation models to embodied navigation by generating traversability masks from BEV images and language instructions while using cross-view localization to reduce odometry drift.

World Model for Robot Learning: A Comprehensive Survey

cs.RO · 2026-04-30 · unverdicted · novelty 3.0

A comprehensive survey that organizes the literature on world models in robot learning, their roles in policy learning, planning, simulation, and video-based generation, with connections to navigation, driving, datasets, and benchmarks.

citing papers explorer

Showing 2 of 2 citing papers.

PathPainter: Transferring the Generalization Ability of Image Generation Models to Embodied Navigation cs.RO · 2026-05-08 · unverdicted · none · ref 23
PathPainter transfers image generation models to embodied navigation by generating traversability masks from BEV images and language instructions while using cross-view localization to reduce odometry drift.
World Model for Robot Learning: A Comprehensive Survey cs.RO · 2026-04-30 · unverdicted · none · ref 68
A comprehensive survey that organizes the literature on world models in robot learning, their roles in policy learning, planning, simulation, and video-based generation, with connections to navigation, driving, datasets, and benchmarks.

Sparse video generation propels real-world beyond-the-view vision-language navigation.arXiv preprint arXiv:2602.05827

fields

years

verdicts

representative citing papers

citing papers explorer