Mentalchat16k: A benchmark dataset for conversational mental health assistance, 2025

Jia Xu, Tianyi Wei, Bojian Hou, Patryk Orzechowski, Shu Yang, Ruochen Jin, Rachael Paulbeck, Joost Wagenaar, George Demiris, Li Shen · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

CounselBench: A Large-Scale Expert Evaluation and Adversarial Benchmarking of Large Language Models in Mental Health Question Answering

cs.CL · 2025-06-10 · conditional · novelty 7.0

CounselBench introduces expert-rated evaluations and an adversarial test set showing LLMs frequently produce unconstructive, overgeneralized, or unsafe responses in mental health QA compared to human therapists.

citing papers explorer

Showing 1 of 1 citing paper.

CounselBench: A Large-Scale Expert Evaluation and Adversarial Benchmarking of Large Language Models in Mental Health Question Answering cs.CL · 2025-06-10 · conditional · none · ref 67
CounselBench introduces expert-rated evaluations and an adversarial test set showing LLMs frequently produce unconstructive, overgeneralized, or unsafe responses in mental health QA compared to human therapists.

Mentalchat16k: A benchmark dataset for conversational mental health assistance, 2025

fields

years

verdicts

representative citing papers

citing papers explorer