pith. sign in

arxiv: 2203.12667 · v3 · pith:6A5NSML2new · submitted 2022-03-22 · 💻 cs.CV · cs.AI· cs.CL· cs.LG

Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

classification 💻 cs.CV cs.AIcs.CLcs.LG
keywords researchtaskscurrentfuturegoallanguagemethodsnatural
0
0 comments X
read the original abstract

A long-term goal of AI research is to build intelligent agents that can communicate with humans in natural language, perceive the environment, and perform real-world tasks. Vision-and-Language Navigation (VLN) is a fundamental and interdisciplinary research topic towards this goal, and receives increasing attention from natural language processing, computer vision, robotics, and machine learning communities. In this paper, we review contemporary studies in the emerging field of VLN, covering tasks, evaluation metrics, methods, etc. Through structured analysis of current progress and challenges, we highlight the limitations of current VLN and opportunities for future work. This paper serves as a thorough reference for the VLN research community.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Progress-Think: Semantic Progress Reasoning for Vision-Language Navigation

    cs.RO 2025-11 unverdicted novelty 6.0

    Semantic progress reasoning predicts instruction-style advancement from visual history to guide policies, yielding state-of-the-art success and efficiency on R2R-CE and RxR-CE.