Audit of depression detection benchmarks finds that official splits yield unstable model rankings, zero-shot transfer across datasets is weak, and text models but not audio models improve on symptom-dense interview segments.
Detecting depression using vocal, facial and semantic communication cues,
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2verdicts
UNVERDICTED 2representative citing papers
A GRU encoder using static acoustic, dynamic, and interaction features from 53 dyads predicts cognitive load related to time pressure, mental work, effort, and performance, with turn-taking linked to temporal demand and imbalanced participation to mental demand.
citing papers explorer
-
A Multi-Probe Audit of Clinical-Interview Depression Detection Benchmarks
Audit of depression detection benchmarks finds that official splits yield unstable model rankings, zero-shot transfer across datasets is weak, and text models but not audio models improve on symptom-dense interview segments.
-
Predicting Cognitive Load from Speech and Interaction Dynamics in Dyadic Conversations
A GRU encoder using static acoustic, dynamic, and interaction features from 53 dyads predicts cognitive load related to time pressure, mental work, effort, and performance, with turn-taking linked to temporal demand and imbalanced participation to mental demand.