pith. sign in

arxiv: 2503.01783 · v3 · pith:IPJ5NH63new · submitted 2025-03-03 · 💻 cs.RO · cs.CV

vS-Graphs: Tightly Coupling Visual SLAM and 3D Scene Graphs Exploiting Hierarchical Scene Understanding

classification 💻 cs.RO cs.CV
keywords scenevisualaccuracyframeworkgraphssemanticvs-graphsvslam
0
0 comments X
read the original abstract

Current Visual Simultaneous Localization and Mapping (VSLAM) systems often struggle to create maps that are both semantically rich and easily interpretable. While incorporating semantic scene knowledge aids in building richer maps with contextual associations among mapped objects, representing them in structured formats, such as scene graphs, has not been widely addressed, resulting in complex map comprehension and limited scalability. This paper introduces vS-Graphs, a novel real-time VSLAM framework that integrates vision-based scene understanding with map reconstruction and comprehensible graph-based representation. The framework infers structural elements (i.e., rooms and floors) from detected building components (i.e., walls and ground surfaces) and incorporates them into optimizable 3D scene graphs. This solution enhances the reconstructed map's semantic richness, comprehensibility, and localization accuracy. Extensive experiments on standard benchmarks and real-world datasets demonstrate that vS-Graphs achieves an average of 15.22% accuracy gain across all tested datasets compared to state-of-the-art VSLAM methods. Furthermore, the proposed framework achieves environment-driven semantic entity detection accuracy comparable to that of precise LiDAR-based frameworks, using only visual features. The code is publicly available at https://github.com/snt-arg/visual_sgraphs and is actively being improved. Moreover, a web page containing more media and evaluation outcomes is available on https://snt-arg.github.io/vsgraphs-results/.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Passage-Aware Structural Mapping for RGB-D Visual SLAM

    cs.RO 2026-04 unverdicted novelty 6.0

    A passage-aware structural mapping approach for RGB-D VSLAM detects doors and openings via joint geometric-semantic-topological fusion and adds passage abstractions to vS-Graphs scene graphs.

  2. Situationally-aware Path Planning Exploiting 3D Scene Graphs

    cs.RO 2025-08 unverdicted novelty 5.0

    S-Path uses the metric-semantic structure of indoor 3D scene graphs for two-stage planning with parallel subproblem decomposition and heuristic reuse on replanning, reporting 6x average planning time reduction versus ...