A vision-tabular multimodal transformer uses modality tokens, masked self-attention, and stochastic modality dropout to maintain performance under pervasive missing data on MIMIC-CXR and MIMIC-IV for 14-label diagnostic classification.
Prediction of in- tensive care unit length of stay in the MIMIC-IV dataset
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Resilient Vision-Tabular Multimodal Learning under Modality Missingness
A vision-tabular multimodal transformer uses modality tokens, masked self-attention, and stochastic modality dropout to maintain performance under pervasive missing data on MIMIC-CXR and MIMIC-IV for 14-label diagnostic classification.