Empirical study across multiple benchmarks finds the link between uncertainty estimators and LLM hallucinations is highly variable and often weak.
arXiv preprint arXiv:2311.15451 , year=
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CL 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Mainstream UQ for LLMs reduces to unsupervised clustering of internal generation consistency and therefore cannot detect confident hallucinations or provide reliable safety signals.
citing papers explorer
-
Evaluating the Relevance of Uncertainty Estimators for LLM Hallucination
Empirical study across multiple benchmarks finds the link between uncertainty estimators and LLM hallucinations is highly variable and often weak.
-
Position: Uncertainty Quantification in LLMs is Just Unsupervised Clustering
Mainstream UQ for LLMs reduces to unsupervised clustering of internal generation consistency and therefore cannot detect confident hallucinations or provide reliable safety signals.