Sada: Safe and adaptive inference with multiple black-box predictions.arXiv preprint arXiv:2509.21707

Jiawei Shan, Yiming Dong, Jiwei Zhao · 2025 · stat.ML · arXiv 2509.21707

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Semi-supervised learning (SSL) arises in practice when labeled data are scarce or expensive to obtain, while large quantities of unlabeled data are readily available. With the growing adoption of machine learning techniques, it has become increasingly feasible to generate multiple predicted labels using a variety of models and algorithms, including deep learning, large language models, and generative AI. In this paper, we propose a novel approach that safely and adaptively aggregates multiple black-box predictions of uncertain quality for both inference and prediction tasks. Our method provides two key guarantees: (i) it never performs worse than using the labeled data alone, regardless of the quality of the predictions; and (ii) if any one of the predictions (without knowing which one) perfectly fits the ground truth, the algorithm adaptively exploits this to achieve either a faster convergence rate or the semiparametric efficiency bound. We demonstrate the effectiveness of the proposed algorithm through small-scale simulations and two real-data analyses with distinct scientific goals. A user-friendly R package, sada, is provided to facilitate practical implementation.

representative citing papers

Semiparametric semi-supervised learning for general targets under distribution shift and decaying overlap

math.ST · 2025-05-09 · unverdicted · novelty 5.0

Introduces D2S3 semiparametric framework that extends AIPW estimators to semi-supervised settings with MAR labeling, distribution shift, and decaying overlap, supplying corrected asymptotic rates instead of root-n convergence.

Industrializing Prediction-Powered Inference: The GLIDE Library for Reliable GenAI and Agentic Systems Evaluation

cs.AI · 2026-05-29 · unverdicted · novelty 3.0

GLIDE is a Python library that packages multiple PPI estimators and samplers for reliable GenAI evaluation and reports annotation savings in an agentic case study.

citing papers explorer

Showing 2 of 2 citing papers.

Semiparametric semi-supervised learning for general targets under distribution shift and decaying overlap math.ST · 2025-05-09 · unverdicted · none · ref 10 · internal anchor
Introduces D2S3 semiparametric framework that extends AIPW estimators to semi-supervised settings with MAR labeling, distribution shift, and decaying overlap, supplying corrected asymptotic rates instead of root-n convergence.
Industrializing Prediction-Powered Inference: The GLIDE Library for Reliable GenAI and Agentic Systems Evaluation cs.AI · 2026-05-29 · unverdicted · none · ref 10 · internal anchor
GLIDE is a Python library that packages multiple PPI estimators and samplers for reliable GenAI evaluation and reports annotation savings in an agentic case study.

Sada: Safe and adaptive inference with multiple black-box predictions.arXiv preprint arXiv:2509.21707

fields

years

verdicts

representative citing papers

citing papers explorer