pith. sign in

hub Mixed citations

PPI++: Efficient Prediction-Powered Inference

Mixed citation behavior. Most common role is method (50%).

30 Pith papers citing it
Method 50% of classified citations
abstract

We present PPI++: a computationally lightweight methodology for estimation and inference based on a small labeled dataset and a typically much larger dataset of machine-learning predictions. The methods automatically adapt to the quality of available predictions, yielding easy-to-compute confidence sets -- for parameters of any dimensionality -- that always improve on classical intervals using only the labeled data. PPI++ builds on prediction-powered inference (PPI), which targets the same problem setting, improving its computational and statistical efficiency. Real and synthetic experiments demonstrate the benefits of the proposed adaptations.

hub tools

citation-role summary

method 3 background 2 dataset 1

citation-polarity summary

clear filters

representative citing papers

Online Pandora's Box for Contextual LLM Cascading

cs.AI · 2026-06-05 · unverdicted · novelty 7.0

Introduces a parametric reservation-index policy with GMM estimation and UCB exploration for contextual LLM cascading under output-mediated feedback, claiming dimension-dependent square-root regret.

Prediction-powered Inference by Mixture of Experts

stat.ML · 2026-04-30 · unverdicted · novelty 7.0

An MOE-powered PPI framework adaptively blends multiple predictors to achieve minimal variance and a best-expert guarantee for semi-supervised mean estimation, linear regression, quantile estimation, and M-estimation, supported by non-asymptotic coverage bounds.

Bootstrapping with AI/ML-generated labels

econ.EM · 2026-04-26 · unverdicted · novelty 7.0

A coupled-label bootstrap provides valid inference for OLS regressions that use AI/ML-generated binary labels despite misclassification errors, unlike standard fixed-label bootstraps.

Calibeating Prediction-Powered Inference

stat.ML · 2026-04-23 · unverdicted · novelty 7.0

Post-hoc calibration of miscalibrated black-box predictions on a labeled sample improves efficiency of prediction-powered inference for semisupervised mean estimation.

Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards

math.ST · 2025-06-20 · unverdicted · novelty 7.0

The MLA-UCB algorithm uses ML-generated surrogate rewards from auxiliary data to provably lower cumulative regret in multi-armed bandits, achieving asymptotic optimality under joint Gaussian assumptions without requiring knowledge of the true-surrogate covariance.

Active Statistical Inference

stat.ML · 2024-03-05 · unverdicted · novelty 7.0

Active inference adapts label collection via ML uncertainty to deliver valid statistical inference with substantially fewer samples than standard non-adaptive methods across any data distribution.

The Warrant Gap: Claim-Conditioned Re-scoring for Fact-Checking

cs.CL · 2026-06-23 · unverdicted · novelty 6.0

Introduces claim-conditioned re-scoring (SIFT) and warranted supports proportion (WSP) metric, reporting accuracy recovery up to 27.6 points and WSP calibration at AUC 0.92 on FEVER, SciFact and other benchmarks.

Multi-Source Prediction-Powered Inference

stat.ME · 2026-06-19 · unverdicted · novelty 6.0

Multi-source prediction-powered inference aggregates multiple pseudo-labeled datasets via weights chosen to minimize asymptotic confidence-region volume, with asymptotic normality and comparisons to single-source and target-only baselines shown for both homogeneous and heterogeneous (covariate/domai

Valid Inference with Synthetic Data via Task Exchangeability

stat.ME · 2026-06-11 · unverdicted · novelty 6.0

Proposes task exchangeability as a condition for valid inference when using synthetic data in scientific research, with methods and extensions demonstrated on surveys and AI evaluations.

Learning U-Statistics with Active Inference

stat.ML · 2026-05-12 · unverdicted · novelty 6.0

Active inference framework for U-statistics using augmented IPW to optimize label queries and minimize variance under budget constraints.

Supercharging Bayesian Inference with Reliable AI-Informed Priors

stat.ML · 2026-05-11 · unverdicted · novelty 6.0

Rectified AI priors, obtained by correcting AI-induced data laws before embedding them in techniques like Dirichlet process priors, reduce bias, improve credible interval coverage, and boost performance in tasks like skin disease classification.

Empirical Bayes Rebiasing

stat.ME · 2026-05-08 · unverdicted · novelty 6.0

Empirical Bayes rebiasing learns the bias distribution from paired noisy estimates to produce shorter calibrated intervals than full debiasing while maintaining coverage.

Bias and Uncertainty in LLM-as-a-Judge Estimation

cs.LG · 2026-05-07 · unverdicted · novelty 6.0

Bias-corrected LLM-as-a-Judge estimators can reverse true model orderings under shared calibration, and the paper supplies judge quality J and cross-model instability ΔJ as practical diagnostics for when such estimates are unreliable.

Debiased neural operators for estimating functionals

cs.LG · 2026-04-21 · unverdicted · novelty 6.0

DOPE is a Neyman-orthogonal one-step semiparametric estimator that removes first-order bias in functional estimates from neural operators by learning weights via Riesz regression.

Semi-Supervised Hypothesis Testing by Betting on Predictions

cs.LG · 2026-05-27 · unverdicted · novelty 5.0

A new e-statistic enables anytime-valid sequential testing by betting on predictions from unlabeled data, with non-trivial power for binary outcomes even under inaccurate predictions and label or concept shift.

citing papers explorer

Showing 4 of 4 citing papers after filters.