Endovla: Dual-phase vision-language-action model for autonomous tracking in endoscopy

· 2025 · arXiv 2505.15206

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

A Vision-Language-Action Model for Adaptive Ultrasound-Guided Needle Insertion and Needle Tracking

cs.RO · 2026-04-22 · unverdicted · novelty 6.0

A VLA model with Cross-Depth Fusion tracking head and TraCon register unifies needle tracking and adaptive insertion control, outperforming prior trackers and manual operation in experiments.

citing papers explorer

Showing 1 of 1 citing paper.

A Vision-Language-Action Model for Adaptive Ultrasound-Guided Needle Insertion and Needle Tracking cs.RO · 2026-04-22 · unverdicted · none · ref 10
A VLA model with Cross-Depth Fusion tracking head and TraCon register unifies needle tracking and adaptive insertion control, outperforming prior trackers and manual operation in experiments.

Endovla: Dual-phase vision-language-action model for autonomous tracking in endoscopy

fields

years

verdicts

representative citing papers

citing papers explorer