Can LLM “self-report

· 2024 · arXiv 2412.00207

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

When Robots Rate Their Own Interactions: Engagement Validity and the Strangeness Failure

cs.RO · 2026-06-22 · conditional · novelty 6.0

LLM robots match humans on engagement ratings in HRI questionnaires but systematically invert strangeness/comfort dimensions across models and live interactions.

Rethinking Psychometric Evaluation of LLMs: When and Why Self-Reports Predict Behavior

cs.AI · 2026-06-10 · unverdicted · novelty 6.0

LLM self-reports predict behavior selectively: TPB reaches human-level coherence within shared conversations but collapses across sessions for primed behaviors, unlike Big 5, with persona prompting stabilizing reports but not actions.

The Unsampled Truth: Psychometrics in SLMs Measure Prompt Artifacts, Not Psychological Constructs

cs.CL · 2026-06-02 · unverdicted · novelty 5.0

SLM responses to psychometric prompts are dominated by prompt artifacts such as personas and option symbols rather than semantic understanding of psychological constructs.

citing papers explorer

Showing 3 of 3 citing papers.

When Robots Rate Their Own Interactions: Engagement Validity and the Strangeness Failure cs.RO · 2026-06-22 · conditional · none · ref 25
LLM robots match humans on engagement ratings in HRI questionnaires but systematically invert strangeness/comfort dimensions across models and live interactions.
Rethinking Psychometric Evaluation of LLMs: When and Why Self-Reports Predict Behavior cs.AI · 2026-06-10 · unverdicted · none · ref 10
LLM self-reports predict behavior selectively: TPB reaches human-level coherence within shared conversations but collapses across sessions for primed behaviors, unlike Big 5, with persona prompting stabilizing reports but not actions.
The Unsampled Truth: Psychometrics in SLMs Measure Prompt Artifacts, Not Psychological Constructs cs.CL · 2026-06-02 · unverdicted · none · ref 12
SLM responses to psychometric prompts are dominated by prompt artifacts such as personas and option symbols rather than semantic understanding of psychological constructs.

Can LLM “self-report

fields

years

verdicts

representative citing papers

citing papers explorer