A diagnostic framework discovers prevalent input-dependent calibration heterogeneity in LLMs via a calibration-aware representation and kernel-smoothed signed miscalibration field, enabling local corrections that outperform global methods like temperature scaling in miscalibrated regions.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 2verdicts
UNVERDICTED 2roles
background 1polarities
background 1representative citing papers
MedExAgent models clinical diagnosis as a POMDP with patient and exam noise, then uses supervised fine-tuning followed by DAPO optimization to train an agent that matches larger models on diagnostic accuracy while controlling exam costs.
citing papers explorer
-
Discovery of Hidden Miscalibration Regimes
A diagnostic framework discovers prevalent input-dependent calibration heterogeneity in LLMs via a calibration-aware representation and kernel-smoothed signed miscalibration field, enabling local corrections that outperform global methods like temperature scaling in miscalibrated regions.
-
MedExAgent: Training LLM Agents to Ask, Examine, and Diagnose in Noisy Clinical Environments
MedExAgent models clinical diagnosis as a POMDP with patient and exam noise, then uses supervised fine-tuning followed by DAPO optimization to train an agent that matches larger models on diagnostic accuracy while controlling exam costs.