Don’t miss the forest for the trees: In-depth confidence estimation for llms via reasoning over the answer space.arXiv preprint arXiv:2511.14275, 2025

Ante Wang, Weizhi Ma, Yang Liu · 2025 · arXiv 2511.14275

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

A Systematic Evaluation of Black-Box Uncertainty Estimation Methods for Large Language Models

cs.AI · 2026-06-18 · unverdicted · novelty 7.0

A unified benchmark of 24 black-box UE methods for LLMs finds no universal winner but favors methods that reason over answer candidates and hybrid combinations of signals.

citing papers explorer

Showing 1 of 1 citing paper.

A Systematic Evaluation of Black-Box Uncertainty Estimation Methods for Large Language Models cs.AI · 2026-06-18 · unverdicted · none · ref 40
A unified benchmark of 24 black-box UE methods for LLMs finds no universal winner but favors methods that reason over answer candidates and hybrid combinations of signals.

Don’t miss the forest for the trees: In-depth confidence estimation for llms via reasoning over the answer space.arXiv preprint arXiv:2511.14275, 2025

fields

years

verdicts

representative citing papers

citing papers explorer