MetaDent supplies a new annotated dental image dataset and shows that current VLMs achieve only moderate accuracy with inconsistent descriptions on fine-grained intraoral scenes.
The reference captions are generated by GPT-OSS to serve as high-quality ground truth descriptions
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
MetaDent: Labeling Clinical Images for Vision-Language Models in Dentistry
MetaDent supplies a new annotated dental image dataset and shows that current VLMs achieve only moderate accuracy with inconsistent descriptions on fine-grained intraoral scenes.