InProceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 14397–14413

Fantom: A benchmark for stress-testing machine theory of mind in interactions · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

CoSToM:Causal-oriented Steering for Intrinsic Theory-of-Mind Alignment in Large Language Models

cs.CL · 2026-04-11 · unverdicted · novelty 6.0

CoSToM maps ToM features inside LLMs with causal tracing and steers activations in critical layers to boost intrinsic social reasoning and dialogue quality.

citing papers explorer

Showing 1 of 1 citing paper.

CoSToM:Causal-oriented Steering for Intrinsic Theory-of-Mind Alignment in Large Language Models cs.CL · 2026-04-11 · unverdicted · none · ref 3
CoSToM maps ToM features inside LLMs with causal tracing and steers activations in critical layers to boost intrinsic social reasoning and dialogue quality.

InProceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 14397–14413

fields

years

verdicts

representative citing papers

citing papers explorer