arXiv preprint arXiv:2312.12832 , year=

Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning , author= · arXiv 2312.12832

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Scaling with Confidence: Calibrating Confidence of LLMs for Adaptive Test Time Scaling

cs.AI · 2026-07-02 · unverdicted · novelty 5.0

C3RL is a new RL algorithm combining correctness, calibration, and reference accuracy rewards to improve LLM confidence calibration, enabling CAS to outperform majority voting with up to 12.33x lower inference cost.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Scaling with Confidence: Calibrating Confidence of LLMs for Adaptive Test Time Scaling cs.AI · 2026-07-02 · unverdicted · none · ref 32
C3RL is a new RL algorithm combining correctness, calibration, and reference accuracy rewards to improve LLM confidence calibration, enabling CAS to outperform majority voting with up to 12.33x lower inference cost.

arXiv preprint arXiv:2312.12832 , year=

fields

years

verdicts

representative citing papers

citing papers explorer