Optimal Deterministic Multicalibration and Omniprediction

Aaron Roth; Georgy Noarov

arxiv: 2606.20557 · v1 · pith:FBKOA47Inew · submitted 2026-06-18 · 💻 cs.LG · math.ST· stat.ML· stat.TH

Optimal Deterministic Multicalibration and Omniprediction

Georgy Noarov , Aaron Roth This is my paper

Pith reviewed 2026-06-26 18:24 UTC · model grok-4.3

classification 💻 cs.LG math.STstat.MLstat.TH

keywords multicalibrationdeterministic predictorssample complexityoutcome indistinguishabilityomnipredictioncalibration

0 comments

The pith

A deterministic algorithm achieves the minimax-optimal sample complexity for multicalibration.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper constructs a deterministic predictor that is multicalibrated with respect to a collection of groups at the optimal rate of roughly epsilon to the minus three. Earlier methods that reached this rate used randomization, leaving open whether randomness was required for efficiency. The new construction shows a deterministic version suffices. The same approach extends to outcome indistinguishability for finite collections of tests and therefore supplies deterministic omnipredictors and panpredictors at the same optimal rate. Readers care because multicalibration guarantees that predictions remain unbiased inside every specified subgroup, a basic requirement for reliable downstream use.

Core claim

The central claim is that there exists a deterministic predictor attaining ε-multicalibration with sample complexity matching the known minimax lower bound of Õ(ε^{-3}). The algorithm is then lifted to produce deterministic predictors satisfying outcome indistinguishability for any finite or finitely covered collection of tests, which immediately yields deterministic omnipredictors and panpredictors with the same optimal sample complexity.

What carries the argument

The deterministic multicalibration algorithm that iteratively enforces calibration constraints across the given groups without introducing randomness.

If this is right

Deterministic omnipredictors achieve the optimal sample complexity previously known only for randomized versions.
Outcome indistinguishability holds deterministically for any finite collection of tests at the minimax rate.
Panpredictors likewise admit deterministic constructions with optimal sample complexity.
Multicalibration and its generalizations no longer require randomization to attain best-known efficiency.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Reproducibility improves in applications that cannot tolerate randomized outputs.
The finite-test restriction may still allow practical coverage of many fairness or calibration constraints via suitable discretization.
Similar derandomization techniques could be tested on related notions such as multiaccuracy or other conditional unbiasedness properties.

Load-bearing premise

The collection of groups or tests is finite or admits a finite cover.

What would settle it

A distribution over contexts and outcomes together with a finite group collection such that every deterministic predictor requires more than Õ(ε^{-3}) samples to achieve ε-multicalibration would falsify the claim.

Figures

Figures reproduced from arXiv: 2606.20557 by Aaron Roth, Georgy Noarov.

**Figure 2.** Figure 2: The learner splits its sample into three independent parts. The confidence sample [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗

read the original abstract

A model is multicalibrated on a collection of group weights $G$ if it is calibrated -- i.e. unbiased even conditional on its prediction -- not just overall, but also after reweighting contexts by each $g \in G$. It is a useful property for many downstream applications and is a basic desideratum of trustworthy machine learning. Before this work, all predictors known to attain the minimax-optimal $\widetilde O(\varepsilon^{-3})$ sample complexity rate for $\varepsilon$-multicalibration were randomized, while deterministic predictors were known only with substantially worse sample complexity. Whether randomization is necessary for optimal sample complexity in multicalibration was explicitly asked by [CLNR26] and implicitly in several prior works. We resolve this open problem by giving a minimax-optimal multicalibration algorithm that outputs a deterministic predictor. We then generalize the algorithm to produce optimal deterministic predictors that satisfy outcome indistinguishability (OI) with respect to finite or finitely covered collections of tests. As an application, this also gives deterministic omnipredictors and panpredictors with optimal sample complexity, resolving open problems posed by [OKK25] and [BHHLZ25].

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper gives a deterministic algorithm matching the optimal multicalibration sample complexity and extends it to deterministic omniprediction.

read the letter

The main takeaway is that they construct a deterministic predictor achieving the minimax-optimal ilde O(\varepsilon^{-3}) rate for \varepsilon-multicalibration, which settles the explicit open question from prior work on whether randomization was required.

The paper does two things cleanly. First, it delivers the deterministic algorithm for multicalibration itself. Second, it generalizes the same approach to outcome indistinguishability over finite or finitely covered test collections, which immediately yields deterministic omnipredictors and panpredictors at the same optimal rate. Both steps address stated open problems in the cited papers.

The result is new because earlier deterministic constructions had worse dependence on \varepsilon. The abstract states the limitation to finite collections upfront, so that boundary is not hidden. No circularity or invented entities appear in the claim.

The soft spot is that the abstract gives no proof sketch or runtime details, so any gaps in the actual construction or hidden assumptions on the group weights G cannot be checked from the summary alone. The finite-collection restriction for the OI extension is real and may limit some applications, but it matches what the paper claims to solve. Nothing in the given material suggests the central argument fails on its own terms.

This is for people working on calibration, fairness, and prediction with guarantees in learning theory. A reader following multicalibration or omniprediction will get direct value from seeing the deterministic construction at optimal rates.

It deserves peer review. The result targets a known open question with a concrete algorithmic fix, so referees should check the details.

Referee Report

0 major / 2 minor

Summary. The paper claims to resolve the open question of whether randomization is required to achieve the minimax-optimal ̃O(ε^{-3}) sample complexity for ε-multicalibration by constructing a deterministic algorithm that attains this rate. It further extends the approach to produce deterministic predictors satisfying outcome indistinguishability (OI) for finite or finitely covered test collections, and derives from this optimal deterministic omnipredictors and panpredictors, addressing open problems in prior work.

Significance. If the algorithmic construction and its analysis are correct, the result is significant: it shows that deterministic predictors can match the optimal sample complexity previously known only for randomized predictors in multicalibration, OI, and omniprediction settings. This strengthens the toolkit for trustworthy ML by removing the need to trade determinism for statistical efficiency and directly resolves multiple cited open problems.

minor comments (2)

The abstract states the existence of the algorithm and its optimality but does not include even a high-level sketch of the construction or the key technical step that enables determinism while preserving the ̃O(ε^{-3}) rate; adding one sentence on the high-level idea would improve accessibility.
Notation for the collection of group weights is introduced as G in the abstract; ensure consistent use of this symbol (and any related notation for tests in the OI generalization) throughout the manuscript and that all symbols are defined before first use.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their positive summary of the paper and for recommending minor revision. The report correctly captures the main contributions and the resolution of the cited open questions. No major comments appear in the report, so there are no specific points requiring a point-by-point response.

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The paper claims to resolve an open problem by constructing a new minimax-optimal deterministic multicalibration algorithm (and its generalization to outcome indistinguishability for finite/finitely covered tests). The provided abstract and description contain no self-definitional reductions, no fitted parameters renamed as predictions, no load-bearing self-citations that justify uniqueness or ansatzes, and no renaming of known results. The central contribution is an explicit algorithmic construction whose correctness is independent of the cited open-problem statements; those citations identify the target rather than supply the derivation. The result is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Review performed on abstract only; no free parameters, axioms, or invented entities are extractable.

pith-pipeline@v0.9.1-grok · 5737 in / 845 out tokens · 26204 ms · 2026-06-26T18:24:13.109070+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

18 extracted references · 8 canonical work pages · 1 internal anchor

[1]

S´ ılvia Casacuberta, Cynthia Dwork, and Salil Vadhan

URLhttps://arxiv.org/abs/2510.27638. S´ ılvia Casacuberta, Cynthia Dwork, and Salil Vadhan. Complexity-theoretic implications of multicalibration. InProceedings of the 56th Annual ACM Symposium on Theory of Computing, pages 1071–1082,

arXiv
[2]

Optimal Lower Bounds for Online Multicalibration

Natalie Collina, Ira Globus-Harris, Surbhi Goel, Varun Gupta, Aaron Roth, and Mirah Shi. Collaborative prediction: Tractable information aggregation via agreement. InProceedings of the 2026 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA). SIAM, 2026a. Natalie Collina, Jiuyao Lu, Georgy Noarov, and Aaron Roth. Optimal lower bounds for online multic...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2604.21923 2026
[3]

A Philip Dawid

doi: 10.1145/3717823.3718178. A Philip Dawid. The well-calibrated bayesian.Journal of the American Statistical Association, 77 (379):605–610,

work page doi:10.1145/3717823.3718178
[4]

A Stochastic Approximat ion Method

doi: 10.1214/aoms/1177729689. URLhttps://doi.org/10.1214/aoms/1177729689. Cynthia Dwork and Pranay Tankala. Supersimulators.arXiv preprint arXiv:2509.17994,

work page doi:10.1214/aoms/1177729689
[5]

Oracle efficient online multi- calibration and omniprediction

Sumegha Garg, Christopher Jung, Omer Reingold, and Aaron Roth. Oracle efficient online multi- calibration and omniprediction. InProceedings of the 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 2725–2792. Society for Industrial and Applied Mathematics,

2024
[6]

Om- nipredictors

Parikshit Gopalan, Adam Tauman Kalai, Omer Reingold, Vatsal Sharan, and Udi Wieder. Om- nipredictors. In13th Innovations in Theoretical Computer Science Conference (ITCS 2022), pages 79:1–79:21. Schloss Dagstuhl – Leibniz-Zentrum f¨ ur Informatik,

2022
[7]

Loss minimization through the lens of outcome indistinguishability

Parikshit Gopalan, Lunjia Hu, Michael P Kim, Omer Reingold, and Udi Wieder. Loss minimization through the lens of outcome indistinguishability. In14th Innovations in Theoretical Computer Science Conference (ITCS 2023), 2023a. Parikshit Gopalan, Michael Kim, and Omer Reingold. Swap agnostic learning, or characterizing omniprediction via multicalibration. I...

2023
[8]

Ursula Hebert-Johnson, Michael Kim, Omer Reingold, and Guy Rothblum

doi: 10.1016/ 0890-5401(92)90010-D. Ursula Hebert-Johnson, Michael Kim, Omer Reingold, and Guy Rothblum. Multicalibration: Calibration for the (computationally-identifiable) masses. InProceedings of the 35th International Conference on Machine Learning, pages 1939–1948. PMLR,

1939
[9]

Comparative Learning: A Sample Complexity Theory for Two Hypothesis Classes

Lunjia Hu and Charlotte Peale. Comparative Learning: A Sample Complexity Theory for Two Hypothesis Classes. In Yael Tauman Kalai, editor,14th Innovations in Theoretical Computer Science Conference (ITCS 2023), volume 251 ofLeibniz International Proceedings in Informatics (LIPIcs), pages 72:1–72:30, Dagstuhl, Germany,

2023
[10]

ISBN 978-3-95977-263-1

Schloss Dagstuhl – Leibniz- Zentrum f¨ ur Informatik. ISBN 978-3-95977-263-1. doi: 10.4230/LIPIcs.ITCS.2023.72. URL https://drops.dagstuhl.de/entities/document/10.4230/LIPIcs.ITCS.2023.72. 31 Lunjia Hu, Charlotte Peale, and Omer Reingold. Metric entropy duality and the sample complexity of Outcome Indistinguishability. In Sanjoy Dasgupta and Nika Haghtala...

work page doi:10.4230/lipics.itcs.2023.72 2023
[11]

Michael Kearns, Aaron Roth, and Emily Ryu

URL https: //proceedings.mlr.press/v167/hu22a.html. Michael Kearns, Aaron Roth, and Emily Ryu. Networked information aggregation via machine learning. InProceedings of the 2026 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 4799–4845. SIAM,

2026
[12]

Derandomizing multi-distribution learning

Kasper Green Larsen, Omar Montasser, and Nikita Zhivotovskiy. Derandomizing multi-distribution learning. InAdvances in Neural Information Processing Systems 37 (NeurIPS 2024),

2024
[13]

URLhttps://arxiv.org/abs/2409.17567

doi: 10.52202/079017-2989. URLhttps://arxiv.org/abs/2409.17567. Daniel Lee, Georgy Noarov, Mallesh Pai, and Aaron Roth. Online minimax multiobjective opti- mization: Multicalibeating and other applications.Advances in Neural Information Processing Systems, 35:29051–29063,

work page doi:10.52202/079017-2989
[14]

Harrison Rosenberg, Robi Bhattacharjee, Kassem Fawaz, and Somesh Jha

doi: 10.1145/3406325.3451050. Harrison Rosenberg, Robi Bhattacharjee, Kassem Fawaz, and Somesh Jha. An exploration of multicalibration uniform convergence bounds.arXiv preprint arXiv:2202.04530,

work page doi:10.1145/3406325.3451050
[15]

URLhttps://doi.org/10.1137/S089548019223872X

doi: 10.1137/S089548019223872X. URLhttps://doi.org/10.1137/S089548019223872X. Eliran Shabat, Lee Cohen, and Yishay Mansour. Sample complexity of uniform convergence for multicalibration. InAdvances in Neural Information Processing Systems, volume 33, pages 13331–13340,

work page doi:10.1137/s089548019223872x
[16]

URLhttps://msp.org/pjm/1958/8-1/pjm-v8-n1-p14-s.pdf

doi: 10.2140/pjm.1958.8.171. URLhttps://msp.org/pjm/1958/8-1/pjm-v8-n1-p14-s.pdf. Zihan Zhang, Wenhao Zhan, Yuxin Chen, Simon S Du, and Jason D Lee. Optimal multi-distribution learning. InThe Thirty Seventh Annual Conference on Learning Theory, pages 5220–5223. PMLR,

work page doi:10.2140/pjm.1958.8.171 1958
[17]

Since T contains both signs of each test in A, this is exactly OIErrP (Q; A)

On the intersection of the two events, the population residual is bounded by the claimed quantity for every signed test in T . Since T contains both signs of each test in A, this is exactly OIErrP (Q; A). Finally, M≤ 2|A| and supp(Qx) ⊆ Λx because each online action has this support. E Deterministic panprediction from OI tests We use the notation of Balak...

2026
[18]

We use the standard Chernoff–Hoeffding moment bound for bounded k-wise independent sums [Schmidt et al., 1995]: if ZC are mean-zero, k-wise independent, |ZC| ≤b C, and B2 =P C b2 C, then for evenk, P X C ZC > C1B √ k ! ≤exp(−k). For the simple form used here, this follows directly from the usualkth-moment proof for independent bounded sums, since the kth ...

1995

[1] [1]

S´ ılvia Casacuberta, Cynthia Dwork, and Salil Vadhan

URLhttps://arxiv.org/abs/2510.27638. S´ ılvia Casacuberta, Cynthia Dwork, and Salil Vadhan. Complexity-theoretic implications of multicalibration. InProceedings of the 56th Annual ACM Symposium on Theory of Computing, pages 1071–1082,

arXiv

[2] [2]

Optimal Lower Bounds for Online Multicalibration

Natalie Collina, Ira Globus-Harris, Surbhi Goel, Varun Gupta, Aaron Roth, and Mirah Shi. Collaborative prediction: Tractable information aggregation via agreement. InProceedings of the 2026 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA). SIAM, 2026a. Natalie Collina, Jiuyao Lu, Georgy Noarov, and Aaron Roth. Optimal lower bounds for online multic...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2604.21923 2026

[3] [3]

A Philip Dawid

doi: 10.1145/3717823.3718178. A Philip Dawid. The well-calibrated bayesian.Journal of the American Statistical Association, 77 (379):605–610,

work page doi:10.1145/3717823.3718178

[4] [4]

A Stochastic Approximat ion Method

doi: 10.1214/aoms/1177729689. URLhttps://doi.org/10.1214/aoms/1177729689. Cynthia Dwork and Pranay Tankala. Supersimulators.arXiv preprint arXiv:2509.17994,

work page doi:10.1214/aoms/1177729689

[5] [5]

Oracle efficient online multi- calibration and omniprediction

Sumegha Garg, Christopher Jung, Omer Reingold, and Aaron Roth. Oracle efficient online multi- calibration and omniprediction. InProceedings of the 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 2725–2792. Society for Industrial and Applied Mathematics,

2024

[6] [6]

Om- nipredictors

Parikshit Gopalan, Adam Tauman Kalai, Omer Reingold, Vatsal Sharan, and Udi Wieder. Om- nipredictors. In13th Innovations in Theoretical Computer Science Conference (ITCS 2022), pages 79:1–79:21. Schloss Dagstuhl – Leibniz-Zentrum f¨ ur Informatik,

2022

[7] [7]

Loss minimization through the lens of outcome indistinguishability

Parikshit Gopalan, Lunjia Hu, Michael P Kim, Omer Reingold, and Udi Wieder. Loss minimization through the lens of outcome indistinguishability. In14th Innovations in Theoretical Computer Science Conference (ITCS 2023), 2023a. Parikshit Gopalan, Michael Kim, and Omer Reingold. Swap agnostic learning, or characterizing omniprediction via multicalibration. I...

2023

[8] [8]

Ursula Hebert-Johnson, Michael Kim, Omer Reingold, and Guy Rothblum

doi: 10.1016/ 0890-5401(92)90010-D. Ursula Hebert-Johnson, Michael Kim, Omer Reingold, and Guy Rothblum. Multicalibration: Calibration for the (computationally-identifiable) masses. InProceedings of the 35th International Conference on Machine Learning, pages 1939–1948. PMLR,

1939

[9] [9]

Comparative Learning: A Sample Complexity Theory for Two Hypothesis Classes

Lunjia Hu and Charlotte Peale. Comparative Learning: A Sample Complexity Theory for Two Hypothesis Classes. In Yael Tauman Kalai, editor,14th Innovations in Theoretical Computer Science Conference (ITCS 2023), volume 251 ofLeibniz International Proceedings in Informatics (LIPIcs), pages 72:1–72:30, Dagstuhl, Germany,

2023

[10] [10]

ISBN 978-3-95977-263-1

Schloss Dagstuhl – Leibniz- Zentrum f¨ ur Informatik. ISBN 978-3-95977-263-1. doi: 10.4230/LIPIcs.ITCS.2023.72. URL https://drops.dagstuhl.de/entities/document/10.4230/LIPIcs.ITCS.2023.72. 31 Lunjia Hu, Charlotte Peale, and Omer Reingold. Metric entropy duality and the sample complexity of Outcome Indistinguishability. In Sanjoy Dasgupta and Nika Haghtala...

work page doi:10.4230/lipics.itcs.2023.72 2023

[11] [11]

Michael Kearns, Aaron Roth, and Emily Ryu

URL https: //proceedings.mlr.press/v167/hu22a.html. Michael Kearns, Aaron Roth, and Emily Ryu. Networked information aggregation via machine learning. InProceedings of the 2026 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 4799–4845. SIAM,

2026

[12] [12]

Derandomizing multi-distribution learning

Kasper Green Larsen, Omar Montasser, and Nikita Zhivotovskiy. Derandomizing multi-distribution learning. InAdvances in Neural Information Processing Systems 37 (NeurIPS 2024),

2024

[13] [13]

URLhttps://arxiv.org/abs/2409.17567

doi: 10.52202/079017-2989. URLhttps://arxiv.org/abs/2409.17567. Daniel Lee, Georgy Noarov, Mallesh Pai, and Aaron Roth. Online minimax multiobjective opti- mization: Multicalibeating and other applications.Advances in Neural Information Processing Systems, 35:29051–29063,

work page doi:10.52202/079017-2989

[14] [14]

Harrison Rosenberg, Robi Bhattacharjee, Kassem Fawaz, and Somesh Jha

doi: 10.1145/3406325.3451050. Harrison Rosenberg, Robi Bhattacharjee, Kassem Fawaz, and Somesh Jha. An exploration of multicalibration uniform convergence bounds.arXiv preprint arXiv:2202.04530,

work page doi:10.1145/3406325.3451050

[15] [15]

URLhttps://doi.org/10.1137/S089548019223872X

doi: 10.1137/S089548019223872X. URLhttps://doi.org/10.1137/S089548019223872X. Eliran Shabat, Lee Cohen, and Yishay Mansour. Sample complexity of uniform convergence for multicalibration. InAdvances in Neural Information Processing Systems, volume 33, pages 13331–13340,

work page doi:10.1137/s089548019223872x

[16] [16]

URLhttps://msp.org/pjm/1958/8-1/pjm-v8-n1-p14-s.pdf

doi: 10.2140/pjm.1958.8.171. URLhttps://msp.org/pjm/1958/8-1/pjm-v8-n1-p14-s.pdf. Zihan Zhang, Wenhao Zhan, Yuxin Chen, Simon S Du, and Jason D Lee. Optimal multi-distribution learning. InThe Thirty Seventh Annual Conference on Learning Theory, pages 5220–5223. PMLR,

work page doi:10.2140/pjm.1958.8.171 1958

[17] [17]

Since T contains both signs of each test in A, this is exactly OIErrP (Q; A)

On the intersection of the two events, the population residual is bounded by the claimed quantity for every signed test in T . Since T contains both signs of each test in A, this is exactly OIErrP (Q; A). Finally, M≤ 2|A| and supp(Qx) ⊆ Λx because each online action has this support. E Deterministic panprediction from OI tests We use the notation of Balak...

2026

[18] [18]

We use the standard Chernoff–Hoeffding moment bound for bounded k-wise independent sums [Schmidt et al., 1995]: if ZC are mean-zero, k-wise independent, |ZC| ≤b C, and B2 =P C b2 C, then for evenk, P X C ZC > C1B √ k ! ≤exp(−k). For the simple form used here, this follows directly from the usualkth-moment proof for independent bounded sums, since the kth ...

1995