pith. sign in

Trace length is a simple un- certainty signal in reasoning models.arXiv preprint arXiv:2510.10409

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

years

2026 5

verdicts

UNVERDICTED 5

roles

background 1

polarities

background 1

clear filters

representative citing papers

How do LLMs Compute Verbal Confidence

cs.CL · 2026-03-18 · unverdicted · novelty 6.0

Mechanistic experiments on Gemma 3 27B, Qwen 2.5 7B and Magistral Small 24B show verbal confidence is cached at post-answer positions from answer tokens and captures richer answer-quality information beyond token log-probabilities.

citing papers explorer

Showing 2 of 2 citing papers after filters.

  • How Language Models Fail: Token-Level Signatures of Committed and Persistent Reasoning Failures cs.CL · 2026-06-04 · unverdicted · none · ref 2

    LLM reasoning failures split into committed (early lock-in) and persistent-uncertainty modes with distinct token-level signatures that hold across 23 model-dataset pairs in 20 of 23 falsifiable tests.

  • How do LLMs Compute Verbal Confidence cs.CL · 2026-03-18 · unverdicted · none · ref 4

    Mechanistic experiments on Gemma 3 27B, Qwen 2.5 7B and Magistral Small 24B show verbal confidence is cached at post-answer positions from answer tokens and captures richer answer-quality information beyond token log-probabilities.