TinyUSFM distills a large ultrasound foundation model into a lightweight version using feature-gradient coreset selection and domain-separated masked image modeling, matching performance on a new 18-dataset benchmark with 6.36% of the parameters.
Dataset of breast ultrasound images
4 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 4representative citing papers
Rad-VLSM is a cross-modal two-stage framework that converts semantic guidance from BLIP-2 into box prompts for SAM-based lesion segmentation and then uses the resulting masks as spatial priors in a visual-radiomics fusion head for diagnosis.
DRD introduces a reprogramming module and CKA-based distillation to enable efficient, robust adaptation of medical foundation models to downstream 2D/3D classification and segmentation tasks, outperforming prior PEFT and KD methods on 18 tasks.
FM-BFF-Net combines focal modulation attention with bidirectional encoder-decoder fusion in a CNN-transformer architecture and reports higher Dice and Jaccard scores than recent methods across eight medical image datasets.
citing papers explorer
-
TinyUSFM: Towards Compact and Efficient Ultrasound Foundation Models
TinyUSFM distills a large ultrasound foundation model into a lightweight version using feature-gradient coreset selection and domain-separated masked image modeling, matching performance on a new 18-dataset benchmark with 6.36% of the parameters.
-
Rad-VLSM: A Cross-Modal Framework with Semantics-Assisted Prompting for Medical Segmentation and Diagnosis
Rad-VLSM is a cross-modal two-stage framework that converts semantic guidance from BLIP-2 into box prompts for SAM-based lesion segmentation and then uses the resulting masks as spatial priors in a visual-radiomics fusion head for diagnosis.
-
Deep Reprogramming Distillation for Medical Foundation Models
DRD introduces a reprogramming module and CKA-based distillation to enable efficient, robust adaptation of medical foundation models to downstream 2D/3D classification and segmentation tasks, outperforming prior PEFT and KD methods on 18 tasks.
-
Focal Modulation and Bidirectional Feature Fusion Network for Medical Image Segmentation
FM-BFF-Net combines focal modulation attention with bidirectional encoder-decoder fusion in a CNN-transformer architecture and reports higher Dice and Jaccard scores than recent methods across eight medical image datasets.