ShelfGaussian achieves state-of-the-art zero-shot semantic occupancy prediction on Occ3D-nuScenes by jointly supervising Gaussian representations with vision foundation model features at 2D image and 3D scene levels.
Talking to dino: Bridging self- supervised vision backbones with language for open- vocabulary segmentation
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2verdicts
UNVERDICTED 2representative citing papers
DEC combines a DINO backbone, a Chunking and Adapting Module, and CLIP-driven virtual feature synthesis to improve open-set 3D object retrieval on standard benchmarks.
citing papers explorer
-
ShelfGaussian: Shelf-Supervised Open-Vocabulary Gaussian-based 3D Scene Understanding
ShelfGaussian achieves state-of-the-art zero-shot semantic occupancy prediction on Occ3D-nuScenes by jointly supervising Gaussian representations with vision foundation model features at 2D image and 3D scene levels.
-
DINO Eats CLIP: Adapting Beyond Knowns for Open-set 3D Object Retrieval
DEC combines a DINO backbone, a Chunking and Adapting Module, and CLIP-driven virtual feature synthesis to improve open-set 3D object retrieval on standard benchmarks.