pith. machine review for the scientific record. sign in

arxiv: 2408.07066 · v4 · submitted 2024-08-13 · 📊 stat.ME

Recognition: unknown

Conformal prediction after data-dependent model selection

Authors on Pith no claims yet
classification 📊 stat.ME
keywords modelpredictionconformalselectionsetswidthdatamethods
0
0 comments X
read the original abstract

Given a family of pretrained models and a hold-out set, how can we construct a valid conformal prediction set while selecting a model that minimizes the width of the set? If we use the same hold-out data set both to select a model (the model that yields the smallest conformal prediction sets) and then to construct a conformal prediction set based on that selected model, we suffer a loss of coverage due to selection bias. Alternatively, we could further split the data to perform selection and calibration separately, but this comes at a steep cost if the size of the dataset is limited. In this paper, we address the challenge of constructing a valid prediction set after data-dependent model selection -- commonly, selecting the model that minimizes the width of the resulting prediction sets. Our novel methods can be implemented efficiently and admit finite-sample validity guarantees without invoking additional sample-splitting. We show that our methods yield prediction sets with asymptotically optimal width under certain notion of regularity for the model class. The improvement in the width of the prediction sets constructed by our methods are further demonstrated through applications to synthetic datasets in various settings and a real data example.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 7 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Online Localized Conformal Prediction

    cs.LG 2026-05 unverdicted novelty 7.0

    OLCP combines online adaptation with covariate-based localization to achieve valid long-run coverage and narrower intervals than global baselines in non-exchangeable settings.

  2. A Unified Theory of Conditional Coverage in Conformal Prediction with Applications

    stat.ME 2026-05 unverdicted novelty 6.0

    A unified framework derives non-asymptotic bounds on conditional miscoverage in conformal prediction via pointwise and L_p routes and gives a common view of existing methods.

  3. Learning Polyhedral Conformal Sets for Robust Optimization

    cs.LG 2026-05 unverdicted novelty 6.0

    Learns polyhedral uncertainty sets for robust optimization using a decision-aware conformal prediction approach that minimizes robust loss while ensuring statistical coverage.

  4. Learning Polyhedral Conformal Sets for Robust Optimization

    cs.LG 2026-05 unverdicted novelty 6.0

    A new conformal framework learns polyhedral uncertainty sets tailored to robust optimization objectives, minimizing decision loss while preserving coverage via calibration and independent re-calibration.

  5. Online Localized Conformal Prediction

    cs.LG 2026-05 unverdicted novelty 6.0

    OLCP and OLCP-Hedge achieve long-run valid coverage in non-exchangeable online settings with narrower prediction sets by localizing conformal prediction to covariates and selecting bandwidth via online convex optimization.

  6. Conformalized Percentile Interval: Finite Sample Validity and Improved Conditional Performance

    stat.ML 2026-05 unverdicted novelty 6.0

    A PIT-calibrated percentile interval method delivers finite-sample marginal coverage, asymptotic conditional coverage, and shorter intervals than prior conformal approaches.

  7. Stable Localized Conformal Prediction via Transduction

    stat.ME 2026-05 unverdicted novelty 6.0

    StCP leverages transfer learning to stabilize the size of conformal prediction sets without additional target labels.