A multimodal model fuses Whisper acoustic embeddings with LLM-extracted linguistic features via gated fusion to achieve F1 scores of 89.47% and 90.14% on ADReSS and ADReSSo dementia detection benchmarks.
A multimodal approach for dementia detection from spontaneous speech with tensor fusion layer,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
eess.AS 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Listening Between the Lines: Joint Learning of ASR Embeddings and LLM-Augmented Linguistics for Dementia Detection
A multimodal model fuses Whisper acoustic embeddings with LLM-extracted linguistic features via gated fusion to achieve F1 scores of 89.47% and 90.14% on ADReSS and ADReSSo dementia detection benchmarks.