Large-scale experiments reveal that uncertainty quantification in defect prediction is highly context-dependent, with UQ-performance and UQ-calibration correlations varying by setting and often weakening or reversing in cross-project scenarios.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Empirical study across multiple benchmarks finds the link between uncertainty estimators and LLM hallucinations is highly variable and often weak.
citing papers explorer
-
Understanding Software Defect Prediction: A Large-scale Empirical Study Across Uncertainty Quantification and Performance Evaluation
Large-scale experiments reveal that uncertainty quantification in defect prediction is highly context-dependent, with UQ-performance and UQ-calibration correlations varying by setting and often weakening or reversing in cross-project scenarios.
-
Evaluating the Relevance of Uncertainty Estimators for LLM Hallucination
Empirical study across multiple benchmarks finds the link between uncertainty estimators and LLM hallucinations is highly variable and often weak.