A Deployment Audit of Release-Side Risk in Conformal Triage under Prevalence Shift

Chengze Li; Chunyu Miao; Haiyang Peng; Hanrong Zhang; Huanhuan Ma; Philip Yu; Qichao Zhou; Xiangrong Qi; Xiao Liu; Yanghao Ruan

arxiv: 2605.20956 · v1 · pith:NDWM7C4Rnew · submitted 2026-05-20 · 💻 cs.LG · cs.CY

A Deployment Audit of Release-Side Risk in Conformal Triage under Prevalence Shift

Chengze Li , Xiao Liu , Hanrong Zhang , Haiyang Peng , Yanghao Ruan , Huanhuan Ma , Chunyu Miao , Qichao Zhou

show 2 more authors

Xiangrong Qi Philip Yu

This is my paper

Pith reviewed 2026-05-21 06:10 UTC · model grok-4.3

classification 💻 cs.LG cs.CY

keywords conformal triageprevalence shiftdeployment auditrelease safetyNSCLC pilotconformal predictionhuman review rateevent-positive release

0 comments

The pith

Prevalence-corrected conformal triage lowers review rates by releasing some event-positive patients without review.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a deployment audit that directly measures whether conformal triage clears patients who truly experience the target event without human review, a risk that marginal coverage and review-rate summaries can hide under prevalence shift. It does so by first assigning subjects to three non-overlapping roles—prevalence correction, conformal calibration, and held-out release-safety evaluation—so that the audit can count released event-positive cases without leakage. In a retrospective NSCLC pilot, the audit finds that the pooled conformal branch achieves lower review by clearing additional patients, some of whom are event-positive, while the classwise branch shows the pilot has too few event labels to certify that low-review operation is safe. A reader would care because triage decisions in clinical settings trade workload against the chance of missing critical cases, and the audit supplies concrete numbers on that trade-off.

Core claim

By partitioning subjects into prevalence-correction, conformal-calibration, and held-out evaluation sets, the audit can evaluate release actions directly: after prevalence correction the pooled conformal branch reduces human review volume yet releases additional event-positive patients without review, and the classwise branch functions as a scarcity diagnostic revealing that the pilot lacks sufficient event labels to certify safe low-review deployment.

What carries the argument

Leakage-aware deployment audit that assigns target subjects to three non-overlapping roles—prevalence correction, conformal calibration, and held-out release-safety evaluation—to measure event-positive releases without review.

If this is right

After prevalence correction the pooled conformal branch reduces review volume by releasing more patients, some of whom are event-positive.
The classwise conformal branch diagnoses when a pilot has too few event labels to certify safe low-review release.
Standard marginal coverage and review-rate summaries miss the safety question of whether true event-positive cases are cleared without review.
Role separation supports unbiased evaluation of release safety under prevalence shift.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same audit structure could be used in other clinical prediction tasks to check for hidden release of high-risk cases after prevalence adjustment.
Conformal triage deployments may need explicit checks for both prevalence shift and label sufficiency before moving to low-review operation.
Pilots aiming for reduced review should first verify they hold enough event-positive labels in the calibration partition.

Load-bearing premise

Partitioning subjects into three non-overlapping roles for prevalence correction, calibration, and evaluation prevents leakage and permits unbiased counting of event-positive patients released without review.

What would settle it

Re-applying the audit to the same NSCLC pilot data and counting zero event-positive patients released without review in the held-out set under the prevalence-corrected pooled conformal branch would show the reported risk does not appear.

Figures

Figures reproduced from arXiv: 2605.20956 by Chengze Li, Chunyu Miao, Haiyang Peng, Hanrong Zhang, Huanhuan Ma, Philip Yu, Qichao Zhou, Xiangrong Qi, Xiao Liu, Yanghao Ruan.

**Figure 2.** Figure 2: Leakage-aware pilot audit pipeline. The target cohort is split into three non-overlapping [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: Event-coverage collapse under pooled prevalence correction at [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗

**Figure 4.** Figure 4: Conformal triage operating curves. Curves trace event-release risk versus human-review [PITH_FULL_IMAGE:figures/full_fig_p012_4.png] view at source ↗

read the original abstract

Conformal triage converts predictive scores into deployment actions that either release a case, flag it for urgent attention, or defer it to human review. Under prevalence shift, however, the usual summaries of marginal coverage and human-review rate can miss the safety-critical question of whether patients who truly experience the target event are released without review. To address this gap, we introduce a leakage-aware deployment audit for release-side conformal triage. It first assigns target subjects to three non-overlapping roles: prevalence correction, conformal calibration, and held-out release-safety evaluation. This separation then lets the audit evaluate release directly: how many event-positive patients are cleared without review, whether the pilot has enough event labels for calibration, and how the safety-review trade-off shifts. Applying this audit to a retrospective NSCLC pilot shows why lower review can be misleading: after prevalence correction, the pooled conformal branch lowers review by releasing more patients, some of whom are event-positive. Within the audit, the classwise branch acts as a scarcity diagnostic: the pilot has too few event labels to certify safe low-review release.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper introduces a three-role split to audit whether conformal triage releases event-positive cases without review after prevalence correction, but the NSCLC pilot's label scarcity leaves the safety findings underpowered.

read the letter

The main takeaway is that this work sets up a leakage-aware audit by assigning subjects to non-overlapping groups for prevalence correction, conformal calibration, and held-out release evaluation. That structure lets them check directly whether the system clears true event cases under shift, which standard coverage numbers miss. The NSCLC retrospective pilot illustrates the point: after correction the pooled branch cuts review volume but still releases some event-positives, while the classwise branch flags that there simply are not enough labels to certify the low-review regime as safe. The role separation itself is a useful framing that extends ordinary conformal practice without circularity. It gives practitioners a concrete way to surface the safety trade-off rather than relying on marginal summaries. The central argument holds up on its own terms as a diagnostic tool. The soft spot is exactly the one the stress-test flags. With event labels already scarce in the pilot, the held-out evaluation slice will contain very few positives. The abstract states that some released patients are event-positive but supplies no counts, no intervals, and no check on how the count moves with different partitions. If the positives in that slice are in the single digits, the observation is likely dominated by sampling variation. The paper would be stronger with those numbers and a sensitivity table. This is aimed at researchers and engineers who deploy conformal methods on imbalanced medical data and want explicit release-side checks. A reader working on safety audits or prevalence-robust triage will get practical value from the role-separation idea even if the current pilot is limited. It deserves a serious referee because the audit structure is a reasonable addition to the conformal toolkit and the concern it raises is real. Send it for review and ask for the held-out counts plus robustness checks on the partition.

Referee Report

1 major / 2 minor

Summary. The paper introduces a leakage-aware deployment audit for conformal triage under prevalence shift. Target subjects are partitioned into three non-overlapping roles (prevalence correction, conformal calibration, and held-out release-safety evaluation) to directly assess whether event-positive patients are released without review. In a retrospective NSCLC pilot, the audit shows that after prevalence correction the pooled conformal branch lowers review rates by releasing additional patients including some event-positives, while the classwise branch diagnoses insufficient event labels to certify safe low-review release.

Significance. If the audit holds, the work fills an important gap by focusing on release-side safety for true event-positives rather than marginal coverage or review rates alone. The explicit role separation provides a practical, leakage-controlled framework for auditing conformal methods in shifted medical settings, and the pilot illustrates how lower review can mask risk when labels are scarce. This could inform safer deployment protocols and encourage more targeted evaluation of conformal triage systems.

major comments (1)

[Results / NSCLC pilot application] In the retrospective NSCLC application, the manuscript reports that the pooled conformal branch releases additional event-positive patients without review but supplies no counts of such releases, no confidence intervals, and no sensitivity checks across partitions. Given the classwise branch's scarcity diagnostic and the expectation of very few positives in the held-out set, this leaves the central safety claim vulnerable to sampling variation rather than a stable signal.

minor comments (2)

[Audit procedure] Clarify in the methods section how the three-role partition sizes are chosen in practice and whether any power analysis supports reliable detection of released positives.
[Abstract] The abstract would benefit from a brief quantitative qualifier on the pilot findings (or their absence) to better align reader expectations with the reported results.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback and for recognizing the importance of a leakage-aware release-side audit for conformal triage. We address the single major comment below and will revise the manuscript accordingly to strengthen the empirical presentation of the NSCLC pilot.

read point-by-point responses

Referee: [Results / NSCLC pilot application] In the retrospective NSCLC application, the manuscript reports that the pooled conformal branch releases additional event-positive patients without review but supplies no counts of such releases, no confidence intervals, and no sensitivity checks across partitions. Given the classwise branch's scarcity diagnostic and the expectation of very few positives in the held-out set, this leaves the central safety claim vulnerable to sampling variation rather than a stable signal.

Authors: We agree that the current presentation of the NSCLC pilot results would benefit from greater transparency on the exact release counts for event-positive cases, uncertainty quantification, and checks for partition sensitivity. The pilot is deliberately small and retrospective, with the classwise branch explicitly diagnosing label scarcity in the held-out set; the pooled branch is shown to release additional event-positives precisely to illustrate how reduced review rates can mask release-side risk under prevalence shift. To make this illustration more robust, we will add: (i) the raw counts of released event-positive patients under each conformal branch, (ii) bootstrap or exact confidence intervals on the release proportions where the small sample permits, and (iii) sensitivity results obtained by re-running the audit under alternative random partitions and different prevalence-correction splits. These additions will be placed in a new subsection of the results and will not alter the core methodological contribution or the scarcity diagnostic. revision: yes

Circularity Check

0 steps flagged

No circularity: audit defined by explicit role separation on held-out data

full rationale

The paper introduces a leakage-aware deployment audit by partitioning target subjects into three non-overlapping roles (prevalence correction, conformal calibration, held-out release-safety evaluation). This partition is presented as a methodological definition to enable direct evaluation of released event-positives, and the procedure is then applied to retrospective NSCLC pilot data. No equations, fitted parameters, or derivations are shown that reduce by construction to the inputs; the role separation is an external design choice rather than a self-referential quantity. The analysis remains self-contained against external data splits and does not rely on self-citation chains or ansatzes smuggled from prior author work.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The approach rests on standard conformal prediction assumptions after prevalence correction and introduces no new free parameters or invented entities beyond the audit structure itself.

axioms (1)

domain assumption Standard conformal prediction assumptions such as exchangeability hold after prevalence correction is applied.
Implicit foundation for the conformal calibration step described in the abstract.

pith-pipeline@v0.9.0 · 5749 in / 1217 out tokens · 34749 ms · 2026-05-21T06:10:41.700776+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

42 extracted references · 42 canonical work pages

[1]

Vovk, Vladimir and Gammerman, Alex and Shafer, Glenn , title =

work page
[2]

Classification with Valid and Adaptive Coverage , booktitle =

Romano, Yaniv and Sesia, Matteo and Cand. Classification with Valid and Adaptive Coverage , booktitle =

work page
[3]

and Bates, Stephen , title =

Angelopoulos, Anastasios N. and Bates, Stephen , title =. Foundations and Trends in Machine Learning , volume =. 2023 , doi =

work page 2023
[4]

and Bates, Stephen and Fisch, Adam and Lei, Lihua and Schuster, Tal , title =

Angelopoulos, Anastasios N. and Bates, Stephen and Fisch, Adam and Lei, Lihua and Schuster, Tal , title =. International Conference on Learning Representations (

work page
[5]

and Lei, Lihua and Malik, Jitendra and Jordan, Michael I

Bates, Stephen and Angelopoulos, Anastasios N. and Lei, Lihua and Malik, Jitendra and Jordan, Michael I. , title =. Journal of the. 2021 , doi =

work page 2021
[6]

Deployment of Image Analysis Algorithms under Prevalence Shifts , booktitle =

Godau, Patrick and Kalinowski, Piotr and Christodoulou, Evanthia and Reinke, Annika and Tizabi, Mehdi and Ferrer, Lucia and J. Deployment of Image Analysis Algorithms under Prevalence Shifts , booktitle =

work page
[7]

, title =

Guo, Chuan and Pleiss, Geoff and Sun, Yu and Weinberger, Kilian Q. , title =. Proceedings of the 34th International Conference on Machine Learning (

work page
[8]

Scientific Data , volume =

Bakr, Shaimaa and Gevaert, Olivier and Echegaray, Salvador and Ayers, Kelsey and others , title =. Scientific Data , volume =. 2018 , doi =

work page 2018
[9]

Conformal Prediction beyond Exchangeability , journal =

Barber, Rina Foygel and Cand. Conformal Prediction beyond Exchangeability , journal =. 2023 , doi =

work page 2023
[10]

Proceedings of the 37th Conference on Uncertainty in Artificial Intelligence (

Podkopaev, Alexander and Ramdas, Aaditya , title =. Proceedings of the 37th Conference on Uncertainty in Artificial Intelligence (

work page
[11]

Neural Computation , volume =

Saerens, Marco and Latinne, Patrice and Decaestecker, Christine , title =. Neural Computation , volume =. 2002 , doi =

work page 2002
[12]

Aerts, Hugo J. W. L. , title =. 2014 , doi =

work page 2014
[13]

2017 , doi =

Bakr, Shaimaa and Gevaert, Olivier and Echegaray, Salvador and Ayers, Kelsey and others , title =. 2017 , doi =

work page 2017
[14]

and Cand

Gibbs, Isaac and Cherian, John J. and Cand. Conformal Prediction with Conditional Guarantees , journal =. 2025 , doi =

work page 2025
[15]

and Pomerantz, Stuart R

Angelopoulos, Anastasios N. and Pomerantz, Stuart R. and Do, Synho and Bates, Stephen and others , title =. 2024 , doi =

work page 2024
[16]

, title =

Cox, David R. , title =. Journal of the Royal Statistical Society: Series B (Methodological) , volume =. 1972 , doi =

work page 1972
[17]

Journal of Open Source Software , volume =

Davidson-Pilon, Cameron , title =. Journal of Open Source Software , volume =. 2019 , doi =

work page 2019
[18]

Advances in Neural Information Processing Systems (

Geifman, Yonatan and El-Yaniv, Ran , title =. Advances in Neural Information Processing Systems (

work page
[19]

and Barber, Rina Foygel and Cand

Tibshirani, Ryan J. and Barber, Rina Foygel and Cand. Conformal Prediction under Covariate Shift , booktitle =

work page
[20]

and Bates, Stephen and Jordan, Michael I

Ding, Tiffany and Angelopoulos, Anastasios N. and Bates, Stephen and Jordan, Michael I. and Tibshirani, Ryan J. , title =. Advances in Neural Information Processing Systems (

work page
[21]

Journal of Machine Learning Research , volume =

El-Yaniv, Ran and Wiener, Yair , title =. Journal of Machine Learning Research , volume =

work page
[22]

Dataset Shift in Machine Learning , publisher =

Qui. Dataset Shift in Machine Learning , publisher =

work page
[23]

International Conference on Machine Learning (

Lipton, Zachary and Wang, Yu-Xiang and Smola, Alexander , title =. International Conference on Machine Learning (

work page
[24]

Journal of Machine Learning Research , volume =

Shafer, Glenn and Vovk, Vladimir , title =. Journal of Machine Learning Research , volume =

work page
[25]

Chow, C. K. , title =. IEEE Transactions on Information Theory , volume =

work page
[26]

International Journal of Computer Vision , volume =

Yang, Jingkang and Zhou, Kaiyang and Li, Yixuan and Liu, Ziwei , title =. International Journal of Computer Vision , volume =. 2024 , doi =

work page 2024
[27]

International Conference on Learning Representations (

Hendrycks, Dan and Gimpel, Kevin , title =. International Conference on Learning Representations (

work page
[28]

Advances in Neural Information Processing Systems , volume =

Liu, Weitang and Wang, Xiaoyun and Owens, John and Li, Yixuan , title =. Advances in Neural Information Processing Systems , volume =

work page
[29]

Li, Yucen Lily and Lu, Daohan and Kirichenko, Polina and Qiu, Shikai and Rudner, Tim G. J. and Bruss, C. Bayan and Wilson, Andrew Gordon , title =. International Conference on Machine Learning (

work page
[30]

Medical Image Computing and Computer Assisted Intervention -- MICCAI 2023 , editor =

Zhan, Chenlu and Peng, Peng and Zhang, Hanrong and Sun, Haiyue and Shang, Chunnan and Chen, Tao and Wang, Hongsen and Wang, Gaoang and Wang, Hongwei , title =. Medical Image Computing and Computer Assisted Intervention -- MICCAI 2023 , editor =. 2023 , isbn =

work page 2023
[31]

Biometrika , volume =

Classification with Confidence , author =. Biometrika , volume =. 2014 , doi =

work page 2014
[32]

Journal of the American Statistical Association , volume =

Least Ambiguous Set-Valued Classifiers With Bounded Error Levels , author =. Journal of the American Statistical Association , volume =. 2019 , doi =

work page 2019
[33]

Proceedings of the AAAI Conference on Artificial Intelligence , volume =

Fair Conformal Predictors for Applications in Medical Imaging , author =. Proceedings of the AAAI Conference on Artificial Intelligence , volume =. 2022 , doi =

work page 2022
[34]

Algorithmic Learning Theory , series =

Learning with Rejection , author =. Algorithmic Learning Theory , series =. 2016 , doi =

work page 2016
[35]

2019 , editor =

Geifman, Yonatan and El-Yaniv, Ran , booktitle =. 2019 , editor =

work page 2019
[36]

Proceedings of the 37th International Conference on Machine Learning , pages =

Consistent Estimators for Learning to Defer to an Expert , author =. Proceedings of the 37th International Conference on Machine Learning , pages =. 2020 , editor =

work page 2020
[37]

Pattern Recognition , volume =

A Unifying View on Dataset Shift in Classification , author =. Pattern Recognition , volume =. 2012 , doi =

work page 2012
[38]

Proceedings of the 34th International Conference on Machine Learning , pages =

On Calibration of Modern Neural Networks , author =. Proceedings of the 34th International Conference on Machine Learning , pages =. 2017 , editor =

work page 2017
[39]

Advances in Neural Information Processing Systems , volume =

Can You Trust Your Model's Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift , author =. Advances in Neural Information Processing Systems , volume =

work page
[40]

Nature Medicine , volume =

Do No Harm: A Roadmap for Responsible Machine Learning for Health Care , author =. Nature Medicine , volume =. 2019 , doi =

work page 2019
[41]

npj Digital Medicine , volume =

Clinical Artificial Intelligence Quality Improvement: Towards Continual Monitoring and Updating of AI Algorithms in Healthcare , author =. npj Digital Medicine , volume =. 2022 , doi =

work page 2022
[42]

Advances in Neural Information Processing Systems , volume =

Conformal Prediction Under Covariate Shift , author =. Advances in Neural Information Processing Systems , volume =

work page

[1] [1]

Vovk, Vladimir and Gammerman, Alex and Shafer, Glenn , title =

work page

[2] [2]

Classification with Valid and Adaptive Coverage , booktitle =

Romano, Yaniv and Sesia, Matteo and Cand. Classification with Valid and Adaptive Coverage , booktitle =

work page

[3] [3]

and Bates, Stephen , title =

Angelopoulos, Anastasios N. and Bates, Stephen , title =. Foundations and Trends in Machine Learning , volume =. 2023 , doi =

work page 2023

[4] [4]

and Bates, Stephen and Fisch, Adam and Lei, Lihua and Schuster, Tal , title =

Angelopoulos, Anastasios N. and Bates, Stephen and Fisch, Adam and Lei, Lihua and Schuster, Tal , title =. International Conference on Learning Representations (

work page

[5] [5]

and Lei, Lihua and Malik, Jitendra and Jordan, Michael I

Bates, Stephen and Angelopoulos, Anastasios N. and Lei, Lihua and Malik, Jitendra and Jordan, Michael I. , title =. Journal of the. 2021 , doi =

work page 2021

[6] [6]

Deployment of Image Analysis Algorithms under Prevalence Shifts , booktitle =

Godau, Patrick and Kalinowski, Piotr and Christodoulou, Evanthia and Reinke, Annika and Tizabi, Mehdi and Ferrer, Lucia and J. Deployment of Image Analysis Algorithms under Prevalence Shifts , booktitle =

work page

[7] [7]

, title =

Guo, Chuan and Pleiss, Geoff and Sun, Yu and Weinberger, Kilian Q. , title =. Proceedings of the 34th International Conference on Machine Learning (

work page

[8] [8]

Scientific Data , volume =

Bakr, Shaimaa and Gevaert, Olivier and Echegaray, Salvador and Ayers, Kelsey and others , title =. Scientific Data , volume =. 2018 , doi =

work page 2018

[9] [9]

Conformal Prediction beyond Exchangeability , journal =

Barber, Rina Foygel and Cand. Conformal Prediction beyond Exchangeability , journal =. 2023 , doi =

work page 2023

[10] [10]

Proceedings of the 37th Conference on Uncertainty in Artificial Intelligence (

Podkopaev, Alexander and Ramdas, Aaditya , title =. Proceedings of the 37th Conference on Uncertainty in Artificial Intelligence (

work page

[11] [11]

Neural Computation , volume =

Saerens, Marco and Latinne, Patrice and Decaestecker, Christine , title =. Neural Computation , volume =. 2002 , doi =

work page 2002

[12] [12]

Aerts, Hugo J. W. L. , title =. 2014 , doi =

work page 2014

[13] [13]

2017 , doi =

Bakr, Shaimaa and Gevaert, Olivier and Echegaray, Salvador and Ayers, Kelsey and others , title =. 2017 , doi =

work page 2017

[14] [14]

and Cand

Gibbs, Isaac and Cherian, John J. and Cand. Conformal Prediction with Conditional Guarantees , journal =. 2025 , doi =

work page 2025

[15] [15]

and Pomerantz, Stuart R

Angelopoulos, Anastasios N. and Pomerantz, Stuart R. and Do, Synho and Bates, Stephen and others , title =. 2024 , doi =

work page 2024

[16] [16]

, title =

Cox, David R. , title =. Journal of the Royal Statistical Society: Series B (Methodological) , volume =. 1972 , doi =

work page 1972

[17] [17]

Journal of Open Source Software , volume =

Davidson-Pilon, Cameron , title =. Journal of Open Source Software , volume =. 2019 , doi =

work page 2019

[18] [18]

Advances in Neural Information Processing Systems (

Geifman, Yonatan and El-Yaniv, Ran , title =. Advances in Neural Information Processing Systems (

work page

[19] [19]

and Barber, Rina Foygel and Cand

Tibshirani, Ryan J. and Barber, Rina Foygel and Cand. Conformal Prediction under Covariate Shift , booktitle =

work page

[20] [20]

and Bates, Stephen and Jordan, Michael I

Ding, Tiffany and Angelopoulos, Anastasios N. and Bates, Stephen and Jordan, Michael I. and Tibshirani, Ryan J. , title =. Advances in Neural Information Processing Systems (

work page

[21] [21]

Journal of Machine Learning Research , volume =

El-Yaniv, Ran and Wiener, Yair , title =. Journal of Machine Learning Research , volume =

work page

[22] [22]

Dataset Shift in Machine Learning , publisher =

Qui. Dataset Shift in Machine Learning , publisher =

work page

[23] [23]

International Conference on Machine Learning (

Lipton, Zachary and Wang, Yu-Xiang and Smola, Alexander , title =. International Conference on Machine Learning (

work page

[24] [24]

Journal of Machine Learning Research , volume =

Shafer, Glenn and Vovk, Vladimir , title =. Journal of Machine Learning Research , volume =

work page

[25] [25]

Chow, C. K. , title =. IEEE Transactions on Information Theory , volume =

work page

[26] [26]

International Journal of Computer Vision , volume =

Yang, Jingkang and Zhou, Kaiyang and Li, Yixuan and Liu, Ziwei , title =. International Journal of Computer Vision , volume =. 2024 , doi =

work page 2024

[27] [27]

International Conference on Learning Representations (

Hendrycks, Dan and Gimpel, Kevin , title =. International Conference on Learning Representations (

work page

[28] [28]

Advances in Neural Information Processing Systems , volume =

Liu, Weitang and Wang, Xiaoyun and Owens, John and Li, Yixuan , title =. Advances in Neural Information Processing Systems , volume =

work page

[29] [29]

Li, Yucen Lily and Lu, Daohan and Kirichenko, Polina and Qiu, Shikai and Rudner, Tim G. J. and Bruss, C. Bayan and Wilson, Andrew Gordon , title =. International Conference on Machine Learning (

work page

[30] [30]

Medical Image Computing and Computer Assisted Intervention -- MICCAI 2023 , editor =

Zhan, Chenlu and Peng, Peng and Zhang, Hanrong and Sun, Haiyue and Shang, Chunnan and Chen, Tao and Wang, Hongsen and Wang, Gaoang and Wang, Hongwei , title =. Medical Image Computing and Computer Assisted Intervention -- MICCAI 2023 , editor =. 2023 , isbn =

work page 2023

[31] [31]

Biometrika , volume =

Classification with Confidence , author =. Biometrika , volume =. 2014 , doi =

work page 2014

[32] [32]

Journal of the American Statistical Association , volume =

Least Ambiguous Set-Valued Classifiers With Bounded Error Levels , author =. Journal of the American Statistical Association , volume =. 2019 , doi =

work page 2019

[33] [33]

Proceedings of the AAAI Conference on Artificial Intelligence , volume =

Fair Conformal Predictors for Applications in Medical Imaging , author =. Proceedings of the AAAI Conference on Artificial Intelligence , volume =. 2022 , doi =

work page 2022

[34] [34]

Algorithmic Learning Theory , series =

Learning with Rejection , author =. Algorithmic Learning Theory , series =. 2016 , doi =

work page 2016

[35] [35]

2019 , editor =

Geifman, Yonatan and El-Yaniv, Ran , booktitle =. 2019 , editor =

work page 2019

[36] [36]

Proceedings of the 37th International Conference on Machine Learning , pages =

Consistent Estimators for Learning to Defer to an Expert , author =. Proceedings of the 37th International Conference on Machine Learning , pages =. 2020 , editor =

work page 2020

[37] [37]

Pattern Recognition , volume =

A Unifying View on Dataset Shift in Classification , author =. Pattern Recognition , volume =. 2012 , doi =

work page 2012

[38] [38]

Proceedings of the 34th International Conference on Machine Learning , pages =

On Calibration of Modern Neural Networks , author =. Proceedings of the 34th International Conference on Machine Learning , pages =. 2017 , editor =

work page 2017

[39] [39]

Advances in Neural Information Processing Systems , volume =

Can You Trust Your Model's Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift , author =. Advances in Neural Information Processing Systems , volume =

work page

[40] [40]

Nature Medicine , volume =

Do No Harm: A Roadmap for Responsible Machine Learning for Health Care , author =. Nature Medicine , volume =. 2019 , doi =

work page 2019

[41] [41]

npj Digital Medicine , volume =

Clinical Artificial Intelligence Quality Improvement: Towards Continual Monitoring and Updating of AI Algorithms in Healthcare , author =. npj Digital Medicine , volume =. 2022 , doi =

work page 2022

[42] [42]

Advances in Neural Information Processing Systems , volume =

Conformal Prediction Under Covariate Shift , author =. Advances in Neural Information Processing Systems , volume =

work page