LLMs reach moderate accuracy on a new psychiatric interview benchmark but systematically discount explicit symptoms when preserved functioning or protective factors are present.
Digital diagnostics: the potential of large Language models in recognizing symptoms of common illnesses
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
When Symptoms Are Not Enough: Evidence-Weighting Patterns in Large Language Model Psychiatric Screening
LLMs reach moderate accuracy on a new psychiatric interview benchmark but systematically discount explicit symptoms when preserved functioning or protective factors are present.