SLA detects noisy labels task-agnostically by standardizing and aggregating validation losses across repeated cross-validation folds, generalizing hard-counting into a continuous estimator that outperforms baselines on fundus data.
Task-Agnostic Noisy Label Detection via Standardized Loss Aggregation
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
Noisy labels are common in large-scale medical imaging datasets due to inter-observer variability and ambiguous cases. We propose a statistically grounded and task-agnostic framework, Standardized Loss Aggregation (SLA), for detecting noisy labels at the sample level. SLA quantifies label reliability by aggregating standardized fold-level validation losses across repeated cross-validation runs. This formulation generalizes discrete hard-counting schemes into a continuous estimator that captures both the frequency and magnitude of performance deviations, yielding interpretable and statistically stable noisiness scores. Experiments on a public fundus dataset demonstrate that SLA consistently outperforms the hard-counting baseline across all noise levels and converges substantially faster, especially under low noise ratios where subtle loss variations are informative. Samples with high SLA scores indicate potentially ambiguous or mislabeled cases, guiding efficient re-annotation and improving dataset reliability for any classification task.
citation-role summary
citation-polarity summary
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1roles
background 1polarities
background 1representative citing papers
citing papers explorer
-
Task-Agnostic Noisy Label Detection via Standardized Loss Aggregation
SLA detects noisy labels task-agnostically by standardizing and aggregating validation losses across repeated cross-validation folds, generalizing hard-counting into a continuous estimator that outperforms baselines on fundus data.