SAM 3 outperforms SAM 2 under click prompting for zero-shot 3D medical segmentation across 16 datasets and 54 structures, with fewer failure modes in prompt-frame over-segmentation and prediction retention.
Endonet: a deep architecture for recognition tasks on laparoscopic videos
2 Pith papers cite this work. Polarity classification is still indexing.
representative citing papers
SurgMotion outperforms prior methods on 17 surgical video benchmarks by shifting pretraining to latent motion prediction with motion-guided masking, affinity distillation, and diversity regularization on a 15M-sample dataset.
citing papers explorer
-
Comparing SAM 2 and SAM 3 for Zero-Shot Segmentation of 3D Medical Data
SAM 3 outperforms SAM 2 under click prompting for zero-shot 3D medical segmentation across 16 datasets and 54 structures, with fewer failure modes in prompt-frame over-segmentation and prediction retention.
-
SurgMotion: A Video-Native Foundation Model for Universal Understanding of Surgical Videos
SurgMotion outperforms prior methods on 17 surgical video benchmarks by shifting pretraining to latent motion prediction with motion-guided masking, affinity distillation, and diversity regularization on a 15M-sample dataset.