pith. sign in

Task-Agnostic Noisy Label Detection via Standardized Loss Aggregation

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

Noisy labels are common in large-scale medical imaging datasets due to inter-observer variability and ambiguous cases. We propose a statistically grounded and task-agnostic framework, Standardized Loss Aggregation (SLA), for detecting noisy labels at the sample level. SLA quantifies label reliability by aggregating standardized fold-level validation losses across repeated cross-validation runs. This formulation generalizes discrete hard-counting schemes into a continuous estimator that captures both the frequency and magnitude of performance deviations, yielding interpretable and statistically stable noisiness scores. Experiments on a public fundus dataset demonstrate that SLA consistently outperforms the hard-counting baseline across all noise levels and converges substantially faster, especially under low noise ratios where subtle loss variations are informative. Samples with high SLA scores indicate potentially ambiguous or mislabeled cases, guiding efficient re-annotation and improving dataset reliability for any classification task.

citation-role summary

background 1

citation-polarity summary

fields

cs.CV 1

years

2026 1

verdicts

UNVERDICTED 1

roles

background 1

polarities

background 1

representative citing papers

Task-Agnostic Noisy Label Detection via Standardized Loss Aggregation

cs.CV · 2026-05-11 · unverdicted · novelty 4.0 · 2 refs

SLA detects noisy labels task-agnostically by standardizing and aggregating validation losses across repeated cross-validation folds, generalizing hard-counting into a continuous estimator that outperforms baselines on fundus data.

citing papers explorer

Showing 1 of 1 citing paper.

  • Task-Agnostic Noisy Label Detection via Standardized Loss Aggregation cs.CV · 2026-05-11 · unverdicted · none · ref 1 · 2 links · internal anchor

    SLA detects noisy labels task-agnostically by standardizing and aggregating validation losses across repeated cross-validation folds, generalizing hard-counting into a continuous estimator that outperforms baselines on fundus data.