Statistical Guarantees in the Search for Less Discriminatory Algorithms

Ben Laufer; Chris Hays; Manish Raghavan; Solon Barocas

arxiv: 2512.23943 · v2 · pith:NUQ3NKIAnew · submitted 2025-12-30 · 💻 cs.CY · cs.LG· stat.ME

Statistical Guarantees in the Search for Less Discriminatory Algorithms

Chris Hays , Ben Laufer , Solon Barocas , Manish Raghavan This is my paper

Pith reviewed 2026-05-21 16:21 UTC · model grok-4.3

classification 💻 cs.CY cs.LGstat.ME

keywords less discriminatory algorithmsdisparate impactoptimal stoppingmodel multiplicitystatistical guaranteesalgorithmic fairnessdiscrimination law

0 comments

The pith

An adaptive stopping algorithm supplies high-probability bounds showing when further retraining is unlikely to produce meaningfully less discriminatory models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper frames the search for less discriminatory algorithms as an optimal stopping problem in which a developer keeps retraining models with new random seeds until evidence shows that additional runs are unlikely to reduce disparate impact much further while preserving accuracy. It introduces an adaptive algorithm that computes a statistical upper bound on the best possible improvement still attainable from continued search. A reader would care because U.S. discrimination law can require firms to demonstrate good-faith efforts to find equally effective but less discriminatory alternatives, and this method offers a concrete way to certify that the search has gone far enough. The work also shows that stronger assumptions about how models are distributed can tighten the bounds and validates the procedure on real credit and housing data.

Core claim

We formalize LDA search under model multiplicity as an optimal stopping problem and present an adaptive stopping algorithm that returns a high-probability upper bound on the best disparate-impact gains still attainable by continued retraining, allowing developers to certify that further search is unlikely to help.

What carries the argument

Adaptive stopping algorithm that produces a high-probability upper bound on remaining disparate-impact improvement attainable through further retraining.

If this is right

Developers can produce evidence for regulators or courts that additional retraining is unlikely to yield meaningful fairness gains.
Tighter bounds become available when stronger distributional assumptions are imposed on the space of possible models.
The same stopping logic applies to real-world credit and housing datasets without requiring exhaustive search.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The approach could be adapted to other fairness metrics such as equalized odds by replacing the disparate-impact objective.
Firms might integrate the bound into routine model-selection pipelines to limit compute spent on bias audits.
The method highlights that model multiplicity can be turned into a statistical resource rather than an exhaustive enumeration problem.

Load-bearing premise

Models obtained by retraining with different random seeds can be treated as draws from a distribution that permits concentration inequalities or other probabilistic bounds on future improvement.

What would settle it

Run the stopping algorithm on a dataset where the true minimum attainable disparate impact across all possible retrainings is known in advance and check whether the reported upper bound is ever violated.

Figures

Figures reproduced from arXiv: 2512.23943 by Ben Laufer, Chris Hays, Manish Raghavan, Solon Barocas.

**Figure 2.** Figure 2: Algorithm 1 run on several datasets and models. Panel rows are ML methods and panel [PITH_FULL_IMAGE:figures/full_fig_p015_2.png] view at source ↗

**Figure 3.** Figure 3: Version of Figure 1 using the fairlearn methods. The disparate impact distribution is shifted [PITH_FULL_IMAGE:figures/full_fig_p020_3.png] view at source ↗

**Figure 4.** Figure 4: Algorithm 2 run using the same setup as for Figure 2. For comparison, the upper bound from [PITH_FULL_IMAGE:figures/full_fig_p021_4.png] view at source ↗

**Figure 5.** Figure 5: Algorithm 1 run on fairness-aware methods with the same datasets and base methods as in [PITH_FULL_IMAGE:figures/full_fig_p022_5.png] view at source ↗

**Figure 6.** Figure 6: Selection rate disparities for each dataset and model class. Reported number is the mean over [PITH_FULL_IMAGE:figures/full_fig_p022_6.png] view at source ↗

**Figure 7.** Figure 7: Accuracy for each dataset and model class. Reported number is the mean over all runs. [PITH_FULL_IMAGE:figures/full_fig_p023_7.png] view at source ↗

**Figure 8.** Figure 8: Miscoverage for each dataset and model class. Reported number is the average number of [PITH_FULL_IMAGE:figures/full_fig_p024_8.png] view at source ↗

**Figure 9.** Figure 9: Selection effect difference (EP0 [Ut − Uˆ t − Ut+1 + Uˆ t+1 | Uˆ t ]) versus the percentile of Qˆ t . 25 [PITH_FULL_IMAGE:figures/full_fig_p025_9.png] view at source ↗

**Figure 10.** Figure 10: Version of Figure 9 with Fairlearn model classes. [PITH_FULL_IMAGE:figures/full_fig_p026_10.png] view at source ↗

read the original abstract

U.S. discrimination law can impose liability on firms that fail to adopt a less discriminatory alternative (LDA): a decision policy that achieves the same business objectives while reducing disparate impact on legally protected groups. Recent scholarship argues that this doctrine has direct implications for algorithmic decision-making in high-stakes domains such as employment, lending, and housing, potentially obligating firms to search for "less discriminatory algorithms" (Black et al., 2024). Regulators have at times encouraged proactive LDA searches, reinforcing the expectation of a good-faith effort to identify equally performant models with lower disparate impact. Model multiplicity makes such searches plausible: retraining with different random seeds can yield models with comparable predictive performance but materially different disparate impacts. Yet firms cannot retrain indefinitely, raising a central question: when is the search sufficient to demonstrate good faith? We formalize LDA search under multiplicity as an optimal stopping problem in which a developer seeks to produce evidence that further search is unlikely to yield meaningful improvements. Our main contribution is an adaptive stopping algorithm that provides a high-probability upper bound on the best disparate-impact gains attainable through continued retraining, enabling developers to certify (e.g., to a court) that additional search is unlikely to help. We also show how stronger distributional assumptions over the model space can yield tighter bounds, and we validate the approach on real-world credit and housing datasets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper turns LDA search into an optimal stopping problem and supplies a high-probability bound on remaining gains from more retraining, which could help with compliance claims, but the bound rests on treating random-seed models as samples from a suitable distribution.

read the letter

The punchline is that this work gives developers a concrete statistical rule for stopping an LDA search while still being able to claim they have evidence that further effort is unlikely to produce meaningful disparate-impact improvement. That framing directly responds to the good-faith search expectation that has come up in recent legal writing on algorithmic discrimination.

Referee Report

2 major / 1 minor

Summary. The paper formalizes the search for less discriminatory algorithms (LDAs) as an optimal stopping problem under model multiplicity, where retraining with different random seeds can produce models with similar performance but varying disparate impact. The main contribution is an adaptive stopping algorithm that supplies a high-probability upper bound on the best attainable disparate-impact gains from further retraining, allowing developers to certify that additional search is unlikely to yield meaningful improvements. Stronger distributional assumptions over the model space are shown to yield tighter bounds, and the method is validated on credit and housing datasets.

Significance. If the probabilistic bounds are correctly derived, the work offers a timely statistical framework for demonstrating good-faith LDA searches in domains governed by U.S. discrimination law. It connects optimal stopping theory to algorithmic fairness and provides a practical certification tool for firms. The real-data validation on credit and housing datasets is a positive feature that grounds the claims, though the result's utility depends on the defensibility of treating random-seed retrainings as samples from a distribution amenable to concentration inequalities.

major comments (2)

[Abstract and optimal stopping formulation] Abstract, paragraph on optimal stopping formulation: the high-probability upper bound on remaining disparate-impact gains is derived by treating the sequence of models from different random seeds as draws from a (possibly unknown) distribution permitting concentration inequalities. This modeling choice is load-bearing for the guarantee yet receives only brief mention; the manuscript must explicitly state the minimal assumptions (e.g., exchangeability or i.i.d.), derive the specific tail bound used, and include a sensitivity analysis showing how bound validity degrades under plausible misspecification.
[Adaptive stopping algorithm] Section describing the adaptive stopping algorithm and bound computation: no explicit concentration inequality or step-by-step derivation of how the observed sequence of disparate-impact values is turned into the reported high-probability upper bound appears in the provided description. Without this, the central claim that the algorithm 'certifies that additional search is unlikely to help' cannot be evaluated for correctness.

minor comments (1)

[Notation and definitions] Clarify notation for 'disparate-impact gains' versus raw disparate impact; ensure the stopping criterion is defined with an explicit equation rather than prose only.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback. The comments highlight opportunities to improve the clarity of our statistical derivations and assumptions, which we have addressed through targeted revisions. We respond to each major comment below.

read point-by-point responses

Referee: [Abstract and optimal stopping formulation] Abstract, paragraph on optimal stopping formulation: the high-probability upper bound on remaining disparate-impact gains is derived by treating the sequence of models from different random seeds as draws from a (possibly unknown) distribution permitting concentration inequalities. This modeling choice is load-bearing for the guarantee yet receives only brief mention; the manuscript must explicitly state the minimal assumptions (e.g., exchangeability or i.i.d.), derive the specific tail bound used, and include a sensitivity analysis showing how bound validity degrades under plausible misspecification.

Authors: We agree that greater explicitness on the modeling assumptions will strengthen the paper. In the revised manuscript we have expanded the optimal stopping formulation section to state the minimal assumptions: the sequence of disparate-impact values obtained from random-seed retrainings is treated as exchangeable (hence amenable to concentration via de Finetti-type representations), with the i.i.d. case as a leading special case. We now derive the bound explicitly from Hoeffding’s inequality applied to the empirical maximum, and we have added an appendix containing a sensitivity analysis that quantifies bound degradation under controlled violations such as mild dependence or heavier tails. revision: yes
Referee: [Adaptive stopping algorithm] Section describing the adaptive stopping algorithm and bound computation: no explicit concentration inequality or step-by-step derivation of how the observed sequence of disparate-impact values is turned into the reported high-probability upper bound appears in the provided description. Without this, the central claim that the algorithm 'certifies that additional search is unlikely to help' cannot be evaluated for correctness.

Authors: We appreciate the referee’s call for a clearer derivation. Although the original manuscript contained the core steps, they were not presented with sufficient detail or separation. The revised Section 3 now includes a dedicated subsection that (i) states the observed sequence, (ii) recalls the relevant concentration inequality, (iii) shows the algebraic steps converting the empirical maximum into a high-probability upper bound on the attainable improvement, and (iv) presents the resulting adaptive stopping rule in both mathematical and algorithmic form. We believe this makes the certification claim fully evaluable. revision: yes

Circularity Check

0 steps flagged

No circularity: bound derived from external concentration inequalities on assumed model distribution

full rationale

The paper's central derivation formalizes LDA search as an optimal stopping problem and constructs a high-probability upper bound on remaining disparate-impact gains by applying concentration inequalities to the sequence of retrained models, which are modeled as draws from a (possibly unknown) distribution over model space. This modeling choice and the resulting tail bounds are external probabilistic machinery rather than a self-referential fit or definition; the bound is not shown to reduce by the paper's own equations to a quantity defined in terms of the same stopping data. No self-citation chains, ansatzes smuggled via prior work, or renaming of known results appear load-bearing in the abstract or described contribution. The approach remains self-contained against external benchmarks once the distributional assumption is granted.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Only the abstract is available; the ledger is therefore limited to elements explicitly invoked there. The central claim rests on the existence of a distribution over retrained models that supports concentration bounds and on the validity of the optimal-stopping reduction.

axioms (1)

domain assumption Models obtained by retraining with different random seeds behave as draws from a distribution permitting high-probability bounds on remaining disparate-impact improvement.
Invoked in the optimal-stopping formulation described in the abstract.

pith-pipeline@v0.9.0 · 5787 in / 1249 out tokens · 36351 ms · 2026-05-21T16:21:36.259367+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

4 extracted references · 4 canonical work pages

[1]

For HMDA, we use the cleaned dataset given in Cooper et al

The prediction target is whether the individual is employed. For HMDA, we use the cleaned dataset given in Cooper et al. (2024) for New York in 2017. The prediction target is whether the home mortgage was originated. Across all datasets, the reference group is all individuals designated White and the protected group is all individuals not designated White...

work page 2024
[2]

First, show that it suffices to consider the case where theXt are uniform on [0,1] via the probability integral transform

work page
[3]

Then, we show that it suffices to provide an anytime-valid upper bound on the running minimum of the sequence

work page
[4]

method of mixtures

Finally, we show that ¯pt as defined above yields such a bound. We begin by using the probability integral transform to “convert” ourX t’s into uniform random vari- ables. LetF X be the CDF ofX t. Define F −1 X (u) = inf{x:F X(x)≥u}. Let{U t}∞ t=1 be iid uniform random variables on [0,1], defined onP. LetV t = min s∈[t] Us. Then, it holds, by e.g. Ch. 6, ...

work page 2000

[1] [1]

For HMDA, we use the cleaned dataset given in Cooper et al

The prediction target is whether the individual is employed. For HMDA, we use the cleaned dataset given in Cooper et al. (2024) for New York in 2017. The prediction target is whether the home mortgage was originated. Across all datasets, the reference group is all individuals designated White and the protected group is all individuals not designated White...

work page 2024

[2] [2]

First, show that it suffices to consider the case where theXt are uniform on [0,1] via the probability integral transform

work page

[3] [3]

Then, we show that it suffices to provide an anytime-valid upper bound on the running minimum of the sequence

work page

[4] [4]

method of mixtures

Finally, we show that ¯pt as defined above yields such a bound. We begin by using the probability integral transform to “convert” ourX t’s into uniform random vari- ables. LetF X be the CDF ofX t. Define F −1 X (u) = inf{x:F X(x)≥u}. Let{U t}∞ t=1 be iid uniform random variables on [0,1], defined onP. LetV t = min s∈[t] Us. Then, it holds, by e.g. Ch. 6, ...

work page 2000