pith. sign in

Avlen: Audio-visual- language embodied navigation in 3d environments

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

fields

cs.SD 2

years

2026 2

verdicts

UNVERDICTED 2

roles

background 2

polarities

background 2

representative citing papers

Audio Spatially-Guided Fusion for Audio-Visual Navigation

cs.SD · 2026-04-02 · unverdicted · novelty 5.0

Audio Spatially-Guided Fusion improves generalization in audio-visual navigation on unheard sound sources by extracting spatial audio features and adaptively fusing them with visual data.

citing papers explorer

Showing 2 of 2 citing papers.

  • Spatial-Aware Conditioned Fusion for Audio-Visual Navigation cs.SD · 2026-04-02 · unverdicted · none · ref 21

    SACF discretizes target direction and distance from audio-visual cues then applies conditioned fusion to improve navigation efficiency and generalization to unheard sounds.

  • Audio Spatially-Guided Fusion for Audio-Visual Navigation cs.SD · 2026-04-02 · unverdicted · none · ref 21

    Audio Spatially-Guided Fusion improves generalization in audio-visual navigation on unheard sound sources by extracting spatial audio features and adaptively fusing them with visual data.