pith. machine review for the scientific record. sign in

arxiv: 1810.06543 · v1 · submitted 2018-10-15 · 💻 cs.CV · cs.AI· cs.RO

Recognition: unknown

Visual Semantic Navigation using Scene Priors

Authors on Pith no claims yet
classification 💻 cs.CV cs.AIcs.RO
keywords semanticknowledgepriorssearchframeworkgraphincorporatingnavigate
0
0 comments X
read the original abstract

How do humans navigate to target objects in novel scenes? Do we use the semantic/functional priors we have built over years to efficiently search and navigate? For example, to search for mugs, we search cabinets near the coffee machine and for fruits we try the fridge. In this work, we focus on incorporating semantic priors in the task of semantic navigation. We propose to use Graph Convolutional Networks for incorporating the prior knowledge into a deep reinforcement learning framework. The agent uses the features from the knowledge graph to predict the actions. For evaluation, we use the AI2-THOR framework. Our experiments show how semantic knowledge improves performance significantly. More importantly, we show improvement in generalization to unseen scenes and/or objects. The supplementary video can be accessed at the following link: https://youtu.be/otKjuO805dE .

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. IGV-RRT: Prior-Real-Time Observation Fusion for Active Object Search in Changing Environments

    cs.RO 2026-03 unverdicted novelty 5.0

    IGV-RRT improves object goal navigation in dynamic indoor environments by combining uncertainty-aware priors from 3D scene graphs with online VLM observations in a real-time tree planner.