When Bits Break Recourse: Counterfactual-Faithful Quantization

Chaymae Yahyati; Ibrahim Ouahbi; Ismail Lamaakal; Khalid El Makkaoui

arxiv: 2605.17160 · v1 · pith:LYZO6TGSnew · submitted 2026-05-16 · 💻 cs.LG · cs.AI· cs.CV

When Bits Break Recourse: Counterfactual-Faithful Quantization

Chaymae Yahyati , Ismail Lamaakal , Khalid El Makkaoui , Ibrahim Ouahbi This is my paper

Pith reviewed 2026-05-20 14:39 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.CV

keywords quantizationalgorithmic recoursecounterfactual explanationsmodel compressionmachine learning deploymentfairnesscredit scoring

0 comments

The pith

Counterfactual-Faithful Quantization keeps recourse valid and low-cost after bit reduction while matching full-precision accuracy.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that standard quantization preserves predictive accuracy on tabular datasets yet often invalidates or inflates the cost of recourse actions that worked on the original model. It introduces Counterfactual-Faithful Quantization to jointly optimize quantizer parameters and mixed-precision bit allocation so that recourse points from the full-precision teacher remain correctly classified under a global bit budget. A margin-based argument supplies a sufficient condition under which bounded quantization perturbations transfer validity, cost, and direction stability from teacher to student. Experiments on Adult, German Credit, and COMPAS data show accuracy-matched baselines degrade recourse metrics while the proposed method improves them across bit widths.

Core claim

Quantization perturbs model outputs enough to invalidate many recourse actions that worked on the full-precision model. Counterfactual-Faithful Quantization (CFQ) solves this by jointly learning quantization parameters and bit allocations so that the quantized model still classifies the recourse points from the teacher model correctly, under a fixed total bit budget. This is supported by a sufficient condition derived from margin analysis that guarantees stability when perturbations are bounded.

What carries the argument

Counterfactual-Faithful Quantization (CFQ), which enforces the target outcome at teacher recourse points during quantizer training and mixed-precision allocation.

If this is right

Accuracy can stay comparable to full-precision models while validity drop and counterfactual recourse gap improve across bit budgets.
Recourse stability holds for the tested tabular datasets when the global bit constraint is enforced during training.
Mixed-precision allocation guided by counterfactual fidelity outperforms uniform accuracy-focused quantization on stability metrics.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Deployers of quantized models in lending or criminal justice may need to adopt counterfactual-aware training to keep explanations actionable after compression.
The same bounded-perturbation logic could apply to other compression methods such as pruning if analogous margin conditions can be derived.
Future low-bit training pipelines might treat recourse metrics as first-class objectives alongside accuracy.

Load-bearing premise

The margin analysis assumes quantization perturbations remain bounded so that recourse transfers from the full-precision teacher to the quantized student.

What would settle it

Measure whether recourse actions valid on the full-precision model remain valid on the quantized version once the observed quantization error exceeds the margin bound used in the proof.

Figures

Figures reproduced from arXiv: 2605.17160 by Chaymae Yahyati, Ibrahim Ouahbi, Ismail Lamaakal, Khalid El Makkaoui.

**Figure 1.** Figure 1: CFQ pipeline. A teacher recourse action δfp is computed on the full-precision model using a small number of projected gradient steps and is treated as a constant via stopgradient. The quantized model fq is trained to preserve task accuracy and to keep x + δfp a valid counterfactual under quantization, while mixed-precision bit allocation and quantizer parameters are optimized under a bit budget. We do no… view at source ↗

**Figure 2.** Figure 2: Same accuracy, different recourse. Quantization can shift local decision geometry: a minimal actionable recourse δ ⋆ f that flips the full-precision model may fail for the quantized model, requiring a different or larger action δ ⋆ fq even when predictive accuracy is unchanged. Quantization and mixed precision. Quantization reduces inference cost by representing weights and activations with low-bit numeri… view at source ↗

**Figure 3.** Figure 3: CFQ improves recourse validity across compression regimes. Validity Drop (VD) as a function of the normalized bit budget. Lower VD indicates that fullprecision recourse actions transfer more reliably to the quantized model. CFQ consistently yields lower VD than accuracy-centric quantization baselines, with the largest gains in low-bit regimes where quantization perturbations are strongest. Relative to th… view at source ↗

**Figure 4.** Figure 4: and [PITH_FULL_IMAGE:figures/full_fig_p021_4.png] view at source ↗

**Figure 5.** Figure 5: Recourse cost inflation vs compression budget. Each panel plots CRG as a function of normalized bit budget. CRG measures the relative change in minimal recourse cost induced by quantization. CFQ consistently reduces cost inflation, indicating that quantization is less likely to increase the effort required to achieve the favorable outcome [PITH_FULL_IMAGE:figures/full_fig_p022_5.png] view at source ↗

**Figure 6.** Figure 6: Subgroup visualization for VD. Bar plot of subgroup VD for ADULT split by sex at a matched budget. VD measures whether FP32 recourse actions remain valid after quantization. CFQ reduces VD for each subgroup and improves worst-group VD, indicating more reliable transfer of recourse under compression. D.6 Subgroup (fairness-slice) reporting We report conditional VD/CRG by subgroup membership and summarize wo… view at source ↗

**Figure 7.** Figure 7: Margin diagnostics at recourse points. Empirical CDFs of the target margin evaluated at the FP32 recourse point x + δ ⋆ f . Shifting the quantized margin distribution toward larger positive values corresponds to a lower probability that quantization flips the decision at the recourse point, providing a mechanistic explanation for reductions in VD under CFQ. 1 2 3 4 5 6 7 8 2 3 4 8 Layer index Assigned bitw… view at source ↗

**Figure 8.** Figure 8: Example learned bit allocation under a fixed budget. Per-layer bitwidths selected by a standard mixed-precision baseline and by CFQ. CFQ tends to allocate higher precision to layers that most affect decision-boundary geometry near recourse points, consistent with improvements in VD/CRG at the same global BitCost. D.8 Runtime and overhead CFQ introduces an overhead due to the K-step projected-gradient teach… view at source ↗

**Figure 9.** Figure 9: Cost–robustness trade-off under deployment variability. Each point shows robust success versus mean recourse cost for ADULT. The robust solver increases robustness by optimizing against multiple sampled deployment variants, but it increases the recourse cost. CFQ shifts the trade-off by making the deployed quantized model intrinsically less sensitive at recourse points, improving robust success at near-bas… view at source ↗

**Figure 10.** Figure 10: Validity Drop on ADULT under increasingly recourse-aware quantization. Standard PTQ INT4 produces the largest recourse failure rate. Mixed-precision PTQ reduces VD by allocating bits based on factual sensitivity, but it still ignores teacher recourse points. CF-PTQ calibration lowers VD by calibrating scales and clipping thresholds on both factual and counterfactual inputs. Counterfactual sensitivity allo… view at source ↗

**Figure 11.** Figure 11: Counterfactual Recourse Gap on ADULT. CRG measures whether quantization increases the minimum action cost required to achieve the target outcome. The decreasing trend from PTQ INT4 to CF-PTQ and CFQ-QAT shows that counterfactual-aware calibration reduces the tendency of the quantized model to “move the goalpost” after deployment. This complements VD: an action may remain valid in some cases but become sig… view at source ↗

**Figure 12.** Figure 12: Cost–stability trade-off on ADULT. The horizontal axis reports relative calibration or training overhead normalized by standard PTQ. CF-PTQ occupies the middle regime between PTQ and full CFQ-QAT: it gives a substantial reduction in VD with lower cost than QAT. This figure directly motivates CF-PTQ as a practical deployment variant when full QAT is too expensive. avoiding backbone updates and QAT. Thus, C… view at source ↗

**Figure 13.** Figure 13: Layer-wise sensitivity profiles used for mixed-precision PTQ on [PITH_FULL_IMAGE:figures/full_fig_p032_13.png] view at source ↗

**Figure 14.** Figure 14: Effect of teacher PGD steps on recourse stability. Increasing the number of teacher [PITH_FULL_IMAGE:figures/full_fig_p034_14.png] view at source ↗

**Figure 15.** Figure 15: Predictive accuracy under different teacher-recoursing budgets. Accuracy is unchanged [PITH_FULL_IMAGE:figures/full_fig_p034_15.png] view at source ↗

**Figure 16.** Figure 16: Effect of teacher-action noise on VD and CRG. Both metrics increase smoothly as the [PITH_FULL_IMAGE:figures/full_fig_p035_16.png] view at source ↗

**Figure 17.** Figure 17: Training-time overhead as a function of the teacher-recoursing budget. The relative [PITH_FULL_IMAGE:figures/full_fig_p037_17.png] view at source ↗

**Figure 18.** Figure 18: Validity Drop under different action-constraint regimes. CFQ consistently reduces the [PITH_FULL_IMAGE:figures/full_fig_p040_18.png] view at source ↗

**Figure 19.** Figure 19: Counterfactual Recourse Gap under different action-constraint regimes. CFQ reduces [PITH_FULL_IMAGE:figures/full_fig_p041_19.png] view at source ↗

**Figure 20.** Figure 20: Feasible Recourse Rate of the full-precision model under different action constraints. FRR [PITH_FULL_IMAGE:figures/full_fig_p041_20.png] view at source ↗

**Figure 21.** Figure 21: Validity Drop under distribution shift. CFQ consistently reduces recourse failure relative [PITH_FULL_IMAGE:figures/full_fig_p043_21.png] view at source ↗

**Figure 22.** Figure 22: Counterfactual Recourse Gap under distribution shift. CFQ lowers the relative increase [PITH_FULL_IMAGE:figures/full_fig_p043_22.png] view at source ↗

**Figure 23.** Figure 23: Subgroup Validity Drop across datasets. CFQ reduces post-quantization recourse failure [PITH_FULL_IMAGE:figures/full_fig_p045_23.png] view at source ↗

**Figure 24.** Figure 24: Subgroup Counterfactual Recourse Gap across datasets. CFQ reduces the relative increase [PITH_FULL_IMAGE:figures/full_fig_p045_24.png] view at source ↗

**Figure 25.** Figure 25: Rate–recourse curves on non-tabular datasets. Validity Drop increases as the average [PITH_FULL_IMAGE:figures/full_fig_p049_25.png] view at source ↗

**Figure 26.** Figure 26: Rate–CRG curves on non-tabular datasets. Lower bitwidths increase the relative recourse [PITH_FULL_IMAGE:figures/full_fig_p049_26.png] view at source ↗

read the original abstract

Quantization can preserve predictive accuracy under low-bit deployment while silently breaking algorithmic recourse: an actionable change that flips a decision before quantization may fail after quantization, or become substantially more costly. We formalize counterfactual sensitivity under quantization through validity, cost, and direction stability, and introduce two metrics: Validity Drop (VD) and Counterfactual Recourse Gap (CRG) that reveal recourse failures invisible to accuracy. We propose Counterfactual-Faithful Quantization (CFQ), which trains quantizer parameters and mixed-precision bit allocation to preserve counterfactual behavior by enforcing the target outcome at teacher recourse points under a global bit budget. A margin-based analysis gives a sufficient condition for recourse transfer under bounded quantization perturbations. Experiments on Adult, German Credit, and COMPAS show that accuracy-matched baselines can significantly degrade recourse stability, while CFQ maintains accuracy and substantially improves VD and CRG across bit budgets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper flags a real problem with quantization breaking recourse and gives a training fix plus two new metrics, but the formal margin bound looks too loose for the low-bit cases they test.

read the letter

The core takeaway is that standard quantization can wreck recourse stability without touching accuracy, and CFQ tries to fix that by baking counterfactual preservation into the quantizer training and bit allocation. They define Validity Drop and Counterfactual Recourse Gap to catch these failures, then optimize the quantizer so that teacher-model recourse points stay valid under the quantized model within a global bit budget. That combination of new metrics and the joint optimization is the actual novelty here, not just another quantization trick. The experiments on Adult, German Credit, and COMPAS show accuracy-matched baselines hurting VD and CRG while CFQ holds accuracy and improves both metrics across bit widths. That part lands cleanly and gives concrete evidence the issue exists. The margin-based sufficient condition is the soft spot. It only guarantees transfer when quantization error stays smaller than the decision margin at the recourse point. At 4 bits the worst-case error can exceed typical margins on these datasets, especially near boundaries, so the formal claim does not actually cover the reported regimes. The gains then rest on the empirical objective rather than the analysis. No error bars or per-dataset numbers appear in the abstract, which makes it harder to judge how consistent the improvements are. This work is aimed at people doing recourse or fairness in low-resource settings. A serious referee should see it because the problem is practical and the metrics are straightforward to check, even if the bound needs tightening or the experiments need more statistical detail.

Referee Report

1 major / 3 minor

Summary. The paper formalizes how quantization can degrade algorithmic recourse (validity, cost, and direction stability of counterfactuals), introduces Validity Drop (VD) and Counterfactual Recourse Gap (CRG) metrics, proposes Counterfactual-Faithful Quantization (CFQ) that jointly optimizes quantizer parameters and mixed-precision bit allocation to enforce target outcomes at teacher recourse points under a global bit budget, supplies a margin-based sufficient condition for recourse transfer under bounded perturbations, and reports experiments on Adult, German Credit, and COMPAS showing that accuracy-matched baselines degrade recourse stability while CFQ preserves accuracy and improves VD/CRG across bit budgets.

Significance. If the empirical gains hold under the stated conditions, the work identifies a practically relevant failure mode for quantized models in recourse-sensitive applications and supplies a targeted mitigation that does not sacrifice predictive accuracy. The introduction of VD and CRG provides concrete, falsifiable ways to measure the phenomenon beyond accuracy, and the margin analysis offers a starting point for theoretical guarantees. The experiments across three standard datasets strengthen the case that the issue is not isolated.

major comments (1)

[Margin-based analysis (abstract and §4)] Margin-based analysis (abstract and §4): the sufficient condition for validity/cost/direction stability requires quantization perturbations to remain strictly smaller than the decision margin at each teacher recourse point. For the 4-bit regime tested, worst-case ||q(x)-x|| can exceed typical margins on Adult/German Credit/COMPAS near boundaries or in low-bit regions; when this occurs the formal transfer guarantee does not apply, so the reported VD/CRG improvements rest on the empirical objective rather than the stated analysis. The manuscript should either verify the bound holds on the evaluated points or qualify the analysis as applying only above a minimum bit-width.

minor comments (3)

[Results] Results section: report dataset-specific numerical values for VD and CRG (with standard deviations over runs) rather than qualitative statements of 'substantial improvement'; include the exact bit-allocation schedules chosen by CFQ versus baselines.
[Method] Notation: define the teacher recourse point generation procedure and the precise form of the CFQ loss (including how the global bit budget is enforced) before the margin analysis; the current abstract-level description leaves the optimization target ambiguous.
[Experiments] Figure clarity: ensure recourse stability plots distinguish between validity drop and cost/direction components; add a table summarizing per-dataset accuracy, VD, and CRG at each bit-width for direct comparison.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback and for highlighting the distinction between the sufficient condition in our margin analysis and the empirical performance of CFQ. We address the major comment below and have prepared revisions to qualify the analysis appropriately while preserving the core contributions.

read point-by-point responses

Referee: Margin-based analysis (abstract and §4): the sufficient condition for validity/cost/direction stability requires quantization perturbations to remain strictly smaller than the decision margin at each teacher recourse point. For the 4-bit regime tested, worst-case ||q(x)-x|| can exceed typical margins on Adult/German Credit/COMPAS near boundaries or in low-bit regions; when this occurs the formal transfer guarantee does not apply, so the reported VD/CRG improvements rest on the empirical objective rather than the stated analysis. The manuscript should either verify the bound holds on the evaluated points or qualify the analysis as applying only above a minimum bit-width.

Authors: We agree that the margin-based result is a sufficient (not necessary) condition and that, for 4-bit quantization, the worst-case perturbation norm can exceed the decision margin at some recourse points near boundaries. In such cases the formal transfer guarantee does not apply, and the reported gains in VD and CRG are attributable to the joint optimization objective that directly enforces the teacher recourse outcome under the global bit budget. We will revise the abstract and §4 to explicitly qualify the analysis as holding only when the quantization perturbation bound is strictly smaller than the margin at each evaluated recourse point. We will also add a short discussion (with illustrative margin-versus-error estimates on the three datasets) clarifying the bit-width regimes in which the sufficient condition is expected to be satisfied. This change does not alter the empirical claims or the practical utility of CFQ. revision: yes

Circularity Check

0 steps flagged

No circularity: optimization targets external teacher recourse points and metrics are independently defined

full rationale

The paper defines CFQ as training quantizer parameters and bit allocation to enforce the target outcome specifically at recourse points obtained from a separate full-precision teacher model, under a global bit budget. Validity Drop (VD) and Counterfactual Recourse Gap (CRG) are defined directly from the stability of validity, cost, and direction between teacher and student models. The margin-based analysis supplies only a sufficient condition under an explicit bounded-perturbation assumption and is not used to derive the training objective or the reported empirical gains. No step reduces a claimed result to a self-fit, self-citation chain, or renaming of the input; the central experimental comparison against accuracy-matched baselines therefore remains independent of the method's own outputs.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the existence of a bounded quantization perturbation and on the availability of reliable teacher recourse points; both are domain assumptions rather than derived quantities.

axioms (1)

domain assumption Quantization perturbations remain bounded
Invoked by the margin-based sufficient condition for recourse transfer

pith-pipeline@v0.9.0 · 5695 in / 1149 out tokens · 29494 ms · 2026-05-20T14:39:09.187378+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

A margin-based analysis gives a sufficient condition for recourse transfer under bounded quantization perturbations.
IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean J_uniquely_calibrated_via_higher_derivative unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Proposition 5.1 (Margin robustness at the recourse point)

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

42 extracted references · 42 canonical work pages · 11 internal anchors

[1]

Arthur Asuncion and David J. Newman. Uci machine learning repository. https://archive. ics.uci.edu/, 2007

work page 2007
[2]

Post-training 4-bit quantization of convolution networks for rapid-deployment, 2019

Ron Banner, Yury Nahshan, Elad Hoffer, and Daniel Soudry. Post-training 4-bit quantization of convolution networks for rapid-deployment, 2019. URL https://arxiv.org/abs/1810. 05723

work page 2019
[3]

Census income (adult) data set

Barry Becker and Ronny Kohavi. Census income (adult) data set. https://archive.ics. uci.edu/dataset/2/adult, 1996. URL https://archive.ics.uci.edu/dataset/2/ adult

work page 1996
[4]

Model multiplicity: Opportunities, concerns, and solutions

Emily Black, Manish Raghavan, and Solon Barocas. Model multiplicity: Opportunities, concerns, and solutions. InProceedings of the 2022 ACM Conference on Fairness, Ac- countability, and Transparency (FAccT), 2022. doi: 10.1145/3531146.3533149. URL https://dl.acm.org/doi/10.1145/3531146.3533149

work page doi:10.1145/3531146.3533149 2022
[5]

PACT: Parameterized clipping activation for quantized neural networks

Jungwook Choi, Zhuo Wang, Swagath Venkataramani, Pierce I-Jen Chuang, Vijayalakshmi Srinivasan, and Kailash Gopalakrishnan. PACT: Parameterized clipping activation for quantized neural networks. InInternational Conference on Learning Representations (ICLR), 2018. URL https://openreview.net/forum?id=By5ugjyCb

work page 2018
[6]

Mahoney, and Kurt Keutzer

Zhen Dong, Zhewei Yao, Danish Arfeen, Amir Gholami, Michael W. Mahoney, and Kurt Keutzer. Hawq: Hessian aware quantization of neural networks with mixed-precision. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019. URL https://openaccess.thecvf.com/content_ICCV_2019/html/Dong_HAWQ_Hessian_ Aware_Quantization_of_Neural...

work page 2019
[7]

Mahoney, and Kurt Keutzer

Zhen Dong, Zhewei Yao, Daiyaan Arfeen, Amir Gholami, Michael W. Mahoney, and Kurt Keutzer. HAWQ-V2: Hessian aware trace-weighted quantization of neural net- works. InAdvances in Neural Information Processing Systems (NeurIPS), 2020. doi: 10.48550/arXiv.1911.03852. URL https://proceedings.neurips.cc/paper_files/ paper/2020/hash/d77c703536718b95308130ff2e5c...

work page doi:10.48550/arxiv.1911.03852 2020
[8]

The accuracy, fairness, and limits of predicting recidivism

Julia Dressel and Hany Farid. The accuracy, fairness, and limits of predicting recidivism. Science Advances, 4(1), 2018. doi: 10.1126/sciadv.aao5580. URL https://www.science. org/doi/10.1126/sciadv.aao5580. 10

work page doi:10.1126/sciadv.aao5580 2018
[9]

Esser, Jeffrey L

Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani, Rathinakumar Appuswamy, and Dharmendra S. Modha. Learned step size quantization, 2020. URL https://arxiv.org/ abs/1902.08153

work page arXiv 2020
[10]

GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

Elias Frantar and Dan Alistarh. Gptq: Accurate post-training quantization for generative pre-trained transformers, 2022. URLhttps://arxiv.org/abs/2210.17323

work page internal anchor Pith review Pith/arXiv arXiv 2022
[11]

Mahoney and Kurt Keutzer , year=

Amir Gholami, Sehoon Kim, Zhen Dong, Zhewei Yao, Michael W. Mahoney, and Kurt Keutzer. A survey of quantization methods for efficient neural network inference, 2021. URL https: //arxiv.org/abs/2103.13630

work page arXiv 2021
[12]

Song Han, Huizi Mao, and William J. Dally. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding, 2016. URL https://arxiv. org/abs/1510.00149

work page internal anchor Pith review Pith/arXiv arXiv 2016
[13]

Deep Residual Learning for Image Recognition

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition, 2015. URLhttps://arxiv.org/abs/1512.03385

work page internal anchor Pith review Pith/arXiv arXiv 2015
[14]

Distilling the knowledge in a neural network,

Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. Distilling the knowledge in a neural network,

work page
[15]

URLhttps://arxiv.org/abs/1503.02531

work page internal anchor Pith review Pith/arXiv arXiv
[16]

Statlog (german credit data) data set

Hans Hofmann. Statlog (german credit data) data set. https://archive.ics.uci.edu/ dataset/144/statlog+german+credit+data, 1994

work page 1994
[17]

Quantization and training of neural networks for efficient integer-arithmetic-only inference

Benoit Jacob, Skirmantas Kligys, Bo Chen, Menglong Zhu, Matthew Tang, Andrew Howard, Hartwig Adam, and Dmitry Kalenichenko. Quantization and training of neural networks for efficient integer-arithmetic-only inference. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018. doi: 10.1109/CVPR.2018. 00286. URL https:...

work page doi:10.1109/cvpr.2018 2018
[18]

Towards Realistic Individual Recourse and Actionable Explanations in Black-Box Decision Making Systems

Shalmali Joshi, Oluwasanmi Koyejo, Warut Vijitbenjaronk, Been Kim, and Joydeep Ghosh. Towards realistic individual recourse and actionable explanations in black-box decision making systems, 2019. URLhttps://arxiv.org/abs/1907.09615

work page internal anchor Pith review Pith/arXiv arXiv 2019
[19]

doi:10.1145/3527848 , keywords =

Amir-Hossein Karimi, Gilles Barthe, Bernhard Schölkopf, and Isabel Valera. A survey of algorithmic recourse: Contrastive explanations and consequential recommendations.ACM Comput. Surv., 55(5), 2022. ISSN 0360-0300. URLhttps://doi.org/10.1145/3527848

work page doi:10.1145/3527848 2022
[20]

Auto-Encoding Variational Bayes

Diederik P Kingma and Max Welling. Auto-encoding variational bayes, 2022. URL https: //arxiv.org/abs/1312.6114

work page internal anchor Pith review Pith/arXiv arXiv 2022
[21]

Quantizing deep convolutional networks for efficient inference: A whitepaper

Raghuraman Krishnamoorthi. Quantizing deep convolutional networks for efficient inference: A whitepaper, 2018. URLhttps://arxiv.org/abs/1806.08342

work page internal anchor Pith review Pith/arXiv arXiv 2018
[22]

Lecun, L

Y . Lecun, L. Bottou, Y . Bengio, and P. Haffner. Gradient-based learning applied to document recognition.Proceedings of the IEEE, 86(11):2278–2324, 1998

work page 1998
[23]

Counterfactual explanations and model multiplicity: a relational verification view

Francesco Leofante, Elena Botoeva, and Vineet Rajani. Counterfactual explanations and model multiplicity: a relational verification view. InProceedings of the 20th International Conference on Principles of Knowledge Representation and Reasoning (KR), 2023. URL https://proceedings.kr.org/2023/78/kr2023-0078-leofante-et-al.pdf

work page 2023
[24]

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Ji Lin, Jiaming Tang, Haotian Tang, Shang Yang, Wei-Ming Chen, Wei-Chen Wang, Guangxuan Xiao, Xingyu Dang, Chuang Gan, and Song Han. Awq: Activation-aware weight quantization for llm compression and acceleration, 2024. URLhttps://arxiv.org/abs/2306.00978

work page internal anchor Pith review Pith/arXiv arXiv 2024
[25]

Deep Learning Face Attributes in the Wild

Ziwei Liu, Ping Luo, Xiaogang Wang, and Xiaoou Tang. Deep learning face attributes in the wild, 2015. URLhttps://arxiv.org/abs/1411.7766

work page internal anchor Pith review Pith/arXiv arXiv 2015
[26]

Preserving causal constraints in counterfactual explanations for machine learning classifiers, 2020

Divyat Mahajan, Chenhao Tan, and Amit Sharma. Preserving causal constraints in counterfactual explanations for machine learning classifiers, 2020. URL https://arxiv.org/abs/1912. 03277. 11

work page 2020
[27]

Marx, Flavio du Pin Calmon, and Berk Ustun

Charles T. Marx, Flavio du Pin Calmon, and Berk Ustun. Predictive multiplicity in classification. InProceedings of the 37th International Conference on Machine Learning (ICML), volume 119 ofProceedings of Machine Learning Research, 2020. URL https://proceedings.mlr. press/v119/marx20a.html

work page 2020
[28]

Vera Liao, and Rachel K

Ramaravind K. Mothilal, Amit Sharma, and Chenhao Tan. Explaining machine learning classifiers through diverse counterfactual explanations. InProceedings of the 2020 Conference on Fairness, Accountability, and Transparency, page 607–617. ACM, January 2020. doi: 10.1145/3351095.3372850. URLhttp://dx.doi.org/10.1145/3351095.3372850

work page doi:10.1145/3351095.3372850 2020
[29]

Up or down? adaptive rounding for post-training quantization

Markus Nagel, Rana Ali Amjad, Mart van Baalen, Christos Louizos, and Tijmen Blankevoort. Up or down? adaptive rounding for post-training quantization. InProceedings of the 37th International Conference on Machine Learning (ICML), volume 119 ofProceedings of Machine Learning Research, pages 7197–7206, 2020. URL https://proceedings.mlr.press/ v119/nagel20a.html

work page 2020
[30]

Distributionally robust recourse action

Duy Nguyen, Ngoc Bui, and Viet Anh Nguyen. Distributionally robust recourse action. InInternational Conference on Learning Representations (ICLR), 2023. URL https: //openreview.net/pdf?id=E3ip6qBLF7

work page 2023
[31]

Learning model-agnostic coun- terfactual explanations for tabular data

Martin Pawelczyk, Klaus Broelemann, and Gjergji Kasneci. Learning model-agnostic coun- terfactual explanations for tabular data. InProceedings of The Web Conference 2020, page 3126–3132. ACM, April 2020. doi: 10.1145/3366423.3380087. URL http://dx.doi.org/ 10.1145/3366423.3380087

work page doi:10.1145/3366423.3380087 2020
[32]

On counterfactual explanations under predictive multiplicity

Martin Pawelczyk, Klaus Broelemann, and Gjergji Kasneci. On counterfactual explanations under predictive multiplicity. InProceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI), volume 124 ofProceedings of Machine Learning Research, pages 809–818,

work page
[33]

URLhttps://proceedings.mlr.press/v124/pawelczyk20a.html

work page
[34]

FACE: Feasible and actionable counterfactual explanations

Rafael Poyiadzi, Kacper Sokol, Raul Santos-Rodriguez, Tijl De Bie, and Peter Flach. FACE: Feasible and actionable counterfactual explanations. InProceedings of the AAAI/ACM Con- ference on AI, Ethics, and Society (AIES), 2020. doi: 10.1145/3375627.3375850. URL https://dl.acm.org/doi/10.1145/3375627.3375850

work page doi:10.1145/3375627.3375850 2020
[35]

Propublica compas analysis

ProPublica. Propublica compas analysis. https://github.com/propublica/ compas-analysis, 2016

work page 2016
[36]

Algorithmic recourse in the wild: Understanding the impact of data and model shifts, 2020

Kaivalya Rawal, Ece Kamar, and Himabindu Lakkaraju. Algorithmic recourse in the wild: Understanding the impact of data and model shifts, 2020. URL https://arxiv.org/abs/ 2012.11788

work page arXiv 2020
[37]

Interpreting the latent space of gans for semantic face editing, 2020

Yujun Shen, Jinjin Gu, Xiaoou Tang, and Bolei Zhou. Interpreting the latent space of gans for semantic face editing, 2020. URLhttps://arxiv.org/abs/1907.10786

work page arXiv 2020
[38]

Towards robust and reliable algorithmic recourse

Sohini Upadhyay, Shalmali Joshi, and Himabindu Lakkaraju. Towards robust and reliable algorithmic recourse. InAdvances in Neural Information Processing Systems (NeurIPS), 2021. doi: 10.48550/arXiv.2102.13620. URLhttps://arxiv.org/abs/2102.13620

work page doi:10.48550/arxiv.2102.13620 2021
[39]

Actionable recourse in linear classification

Berk Ustun, Alexander Spangher, and Yang Liu. Actionable recourse in linear classification. InProceedings of the Conference on Fairness, Accountability, and Transparency, page 10–19. ACM, January 2019. doi: 10.1145/3287560.3287566. URL http://dx.doi.org/10.1145/ 3287560.3287566

work page doi:10.1145/3287560.3287566 2019
[40]

Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPR

Sandra Wachter, Brent Mittelstadt, and Chris Russell. Counterfactual explanations without opening the black box: Automated decisions and the gdpr, 2018. URL https://arxiv.org/ abs/1711.00399

work page internal anchor Pith review Pith/arXiv arXiv 2018
[41]

HAQ: Hardware-Aware Automated Quantization with Mixed Precision

Kuan Wang, Zhijian Liu, Yujun Lin, Ji Lin, and Song Han. HAQ: Hardware-aware automated quantization with mixed precision. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019. doi: 10.48550/arXiv.1811.08886. URL https://openaccess.thecvf.com/content_CVPR_2019/papers/Wang_HAQ_ Hardware-Aware_Automated_Quantizatio...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1811.08886 2019
[42]

recourse robust to model shift

Han Xiao, Kashif Rasul, and Roland V ollgraf. Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms, 2017. URL https://arxiv.org/abs/1708. 07747. 13 When Bits Break Recourse: Counterfactual-Faithful Quantization Appendix Table of Contents A Recourse Optimization and Constraints. . . . . . . . . . . . . . . . . . . . . . . . . ....

work page 2017

[1] [1]

Arthur Asuncion and David J. Newman. Uci machine learning repository. https://archive. ics.uci.edu/, 2007

work page 2007

[2] [2]

Post-training 4-bit quantization of convolution networks for rapid-deployment, 2019

Ron Banner, Yury Nahshan, Elad Hoffer, and Daniel Soudry. Post-training 4-bit quantization of convolution networks for rapid-deployment, 2019. URL https://arxiv.org/abs/1810. 05723

work page 2019

[3] [3]

Census income (adult) data set

Barry Becker and Ronny Kohavi. Census income (adult) data set. https://archive.ics. uci.edu/dataset/2/adult, 1996. URL https://archive.ics.uci.edu/dataset/2/ adult

work page 1996

[4] [4]

Model multiplicity: Opportunities, concerns, and solutions

Emily Black, Manish Raghavan, and Solon Barocas. Model multiplicity: Opportunities, concerns, and solutions. InProceedings of the 2022 ACM Conference on Fairness, Ac- countability, and Transparency (FAccT), 2022. doi: 10.1145/3531146.3533149. URL https://dl.acm.org/doi/10.1145/3531146.3533149

work page doi:10.1145/3531146.3533149 2022

[5] [5]

PACT: Parameterized clipping activation for quantized neural networks

Jungwook Choi, Zhuo Wang, Swagath Venkataramani, Pierce I-Jen Chuang, Vijayalakshmi Srinivasan, and Kailash Gopalakrishnan. PACT: Parameterized clipping activation for quantized neural networks. InInternational Conference on Learning Representations (ICLR), 2018. URL https://openreview.net/forum?id=By5ugjyCb

work page 2018

[6] [6]

Mahoney, and Kurt Keutzer

Zhen Dong, Zhewei Yao, Danish Arfeen, Amir Gholami, Michael W. Mahoney, and Kurt Keutzer. Hawq: Hessian aware quantization of neural networks with mixed-precision. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019. URL https://openaccess.thecvf.com/content_ICCV_2019/html/Dong_HAWQ_Hessian_ Aware_Quantization_of_Neural...

work page 2019

[7] [7]

Mahoney, and Kurt Keutzer

Zhen Dong, Zhewei Yao, Daiyaan Arfeen, Amir Gholami, Michael W. Mahoney, and Kurt Keutzer. HAWQ-V2: Hessian aware trace-weighted quantization of neural net- works. InAdvances in Neural Information Processing Systems (NeurIPS), 2020. doi: 10.48550/arXiv.1911.03852. URL https://proceedings.neurips.cc/paper_files/ paper/2020/hash/d77c703536718b95308130ff2e5c...

work page doi:10.48550/arxiv.1911.03852 2020

[8] [8]

The accuracy, fairness, and limits of predicting recidivism

Julia Dressel and Hany Farid. The accuracy, fairness, and limits of predicting recidivism. Science Advances, 4(1), 2018. doi: 10.1126/sciadv.aao5580. URL https://www.science. org/doi/10.1126/sciadv.aao5580. 10

work page doi:10.1126/sciadv.aao5580 2018

[9] [9]

Esser, Jeffrey L

Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani, Rathinakumar Appuswamy, and Dharmendra S. Modha. Learned step size quantization, 2020. URL https://arxiv.org/ abs/1902.08153

work page arXiv 2020

[10] [10]

GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

Elias Frantar and Dan Alistarh. Gptq: Accurate post-training quantization for generative pre-trained transformers, 2022. URLhttps://arxiv.org/abs/2210.17323

work page internal anchor Pith review Pith/arXiv arXiv 2022

[11] [11]

Mahoney and Kurt Keutzer , year=

Amir Gholami, Sehoon Kim, Zhen Dong, Zhewei Yao, Michael W. Mahoney, and Kurt Keutzer. A survey of quantization methods for efficient neural network inference, 2021. URL https: //arxiv.org/abs/2103.13630

work page arXiv 2021

[12] [12]

Song Han, Huizi Mao, and William J. Dally. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding, 2016. URL https://arxiv. org/abs/1510.00149

work page internal anchor Pith review Pith/arXiv arXiv 2016

[13] [13]

Deep Residual Learning for Image Recognition

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition, 2015. URLhttps://arxiv.org/abs/1512.03385

work page internal anchor Pith review Pith/arXiv arXiv 2015

[14] [14]

Distilling the knowledge in a neural network,

Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. Distilling the knowledge in a neural network,

work page

[15] [15]

URLhttps://arxiv.org/abs/1503.02531

work page internal anchor Pith review Pith/arXiv arXiv

[16] [16]

Statlog (german credit data) data set

Hans Hofmann. Statlog (german credit data) data set. https://archive.ics.uci.edu/ dataset/144/statlog+german+credit+data, 1994

work page 1994

[17] [17]

Quantization and training of neural networks for efficient integer-arithmetic-only inference

Benoit Jacob, Skirmantas Kligys, Bo Chen, Menglong Zhu, Matthew Tang, Andrew Howard, Hartwig Adam, and Dmitry Kalenichenko. Quantization and training of neural networks for efficient integer-arithmetic-only inference. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018. doi: 10.1109/CVPR.2018. 00286. URL https:...

work page doi:10.1109/cvpr.2018 2018

[18] [18]

Towards Realistic Individual Recourse and Actionable Explanations in Black-Box Decision Making Systems

Shalmali Joshi, Oluwasanmi Koyejo, Warut Vijitbenjaronk, Been Kim, and Joydeep Ghosh. Towards realistic individual recourse and actionable explanations in black-box decision making systems, 2019. URLhttps://arxiv.org/abs/1907.09615

work page internal anchor Pith review Pith/arXiv arXiv 2019

[19] [19]

doi:10.1145/3527848 , keywords =

Amir-Hossein Karimi, Gilles Barthe, Bernhard Schölkopf, and Isabel Valera. A survey of algorithmic recourse: Contrastive explanations and consequential recommendations.ACM Comput. Surv., 55(5), 2022. ISSN 0360-0300. URLhttps://doi.org/10.1145/3527848

work page doi:10.1145/3527848 2022

[20] [20]

Auto-Encoding Variational Bayes

Diederik P Kingma and Max Welling. Auto-encoding variational bayes, 2022. URL https: //arxiv.org/abs/1312.6114

work page internal anchor Pith review Pith/arXiv arXiv 2022

[21] [21]

Quantizing deep convolutional networks for efficient inference: A whitepaper

Raghuraman Krishnamoorthi. Quantizing deep convolutional networks for efficient inference: A whitepaper, 2018. URLhttps://arxiv.org/abs/1806.08342

work page internal anchor Pith review Pith/arXiv arXiv 2018

[22] [22]

Lecun, L

Y . Lecun, L. Bottou, Y . Bengio, and P. Haffner. Gradient-based learning applied to document recognition.Proceedings of the IEEE, 86(11):2278–2324, 1998

work page 1998

[23] [23]

Counterfactual explanations and model multiplicity: a relational verification view

Francesco Leofante, Elena Botoeva, and Vineet Rajani. Counterfactual explanations and model multiplicity: a relational verification view. InProceedings of the 20th International Conference on Principles of Knowledge Representation and Reasoning (KR), 2023. URL https://proceedings.kr.org/2023/78/kr2023-0078-leofante-et-al.pdf

work page 2023

[24] [24]

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Ji Lin, Jiaming Tang, Haotian Tang, Shang Yang, Wei-Ming Chen, Wei-Chen Wang, Guangxuan Xiao, Xingyu Dang, Chuang Gan, and Song Han. Awq: Activation-aware weight quantization for llm compression and acceleration, 2024. URLhttps://arxiv.org/abs/2306.00978

work page internal anchor Pith review Pith/arXiv arXiv 2024

[25] [25]

Deep Learning Face Attributes in the Wild

Ziwei Liu, Ping Luo, Xiaogang Wang, and Xiaoou Tang. Deep learning face attributes in the wild, 2015. URLhttps://arxiv.org/abs/1411.7766

work page internal anchor Pith review Pith/arXiv arXiv 2015

[26] [26]

Preserving causal constraints in counterfactual explanations for machine learning classifiers, 2020

Divyat Mahajan, Chenhao Tan, and Amit Sharma. Preserving causal constraints in counterfactual explanations for machine learning classifiers, 2020. URL https://arxiv.org/abs/1912. 03277. 11

work page 2020

[27] [27]

Marx, Flavio du Pin Calmon, and Berk Ustun

Charles T. Marx, Flavio du Pin Calmon, and Berk Ustun. Predictive multiplicity in classification. InProceedings of the 37th International Conference on Machine Learning (ICML), volume 119 ofProceedings of Machine Learning Research, 2020. URL https://proceedings.mlr. press/v119/marx20a.html

work page 2020

[28] [28]

Vera Liao, and Rachel K

Ramaravind K. Mothilal, Amit Sharma, and Chenhao Tan. Explaining machine learning classifiers through diverse counterfactual explanations. InProceedings of the 2020 Conference on Fairness, Accountability, and Transparency, page 607–617. ACM, January 2020. doi: 10.1145/3351095.3372850. URLhttp://dx.doi.org/10.1145/3351095.3372850

work page doi:10.1145/3351095.3372850 2020

[29] [29]

Up or down? adaptive rounding for post-training quantization

Markus Nagel, Rana Ali Amjad, Mart van Baalen, Christos Louizos, and Tijmen Blankevoort. Up or down? adaptive rounding for post-training quantization. InProceedings of the 37th International Conference on Machine Learning (ICML), volume 119 ofProceedings of Machine Learning Research, pages 7197–7206, 2020. URL https://proceedings.mlr.press/ v119/nagel20a.html

work page 2020

[30] [30]

Distributionally robust recourse action

Duy Nguyen, Ngoc Bui, and Viet Anh Nguyen. Distributionally robust recourse action. InInternational Conference on Learning Representations (ICLR), 2023. URL https: //openreview.net/pdf?id=E3ip6qBLF7

work page 2023

[31] [31]

Learning model-agnostic coun- terfactual explanations for tabular data

Martin Pawelczyk, Klaus Broelemann, and Gjergji Kasneci. Learning model-agnostic coun- terfactual explanations for tabular data. InProceedings of The Web Conference 2020, page 3126–3132. ACM, April 2020. doi: 10.1145/3366423.3380087. URL http://dx.doi.org/ 10.1145/3366423.3380087

work page doi:10.1145/3366423.3380087 2020

[32] [32]

On counterfactual explanations under predictive multiplicity

Martin Pawelczyk, Klaus Broelemann, and Gjergji Kasneci. On counterfactual explanations under predictive multiplicity. InProceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI), volume 124 ofProceedings of Machine Learning Research, pages 809–818,

work page

[33] [33]

URLhttps://proceedings.mlr.press/v124/pawelczyk20a.html

work page

[34] [34]

FACE: Feasible and actionable counterfactual explanations

Rafael Poyiadzi, Kacper Sokol, Raul Santos-Rodriguez, Tijl De Bie, and Peter Flach. FACE: Feasible and actionable counterfactual explanations. InProceedings of the AAAI/ACM Con- ference on AI, Ethics, and Society (AIES), 2020. doi: 10.1145/3375627.3375850. URL https://dl.acm.org/doi/10.1145/3375627.3375850

work page doi:10.1145/3375627.3375850 2020

[35] [35]

Propublica compas analysis

ProPublica. Propublica compas analysis. https://github.com/propublica/ compas-analysis, 2016

work page 2016

[36] [36]

Algorithmic recourse in the wild: Understanding the impact of data and model shifts, 2020

Kaivalya Rawal, Ece Kamar, and Himabindu Lakkaraju. Algorithmic recourse in the wild: Understanding the impact of data and model shifts, 2020. URL https://arxiv.org/abs/ 2012.11788

work page arXiv 2020

[37] [37]

Interpreting the latent space of gans for semantic face editing, 2020

Yujun Shen, Jinjin Gu, Xiaoou Tang, and Bolei Zhou. Interpreting the latent space of gans for semantic face editing, 2020. URLhttps://arxiv.org/abs/1907.10786

work page arXiv 2020

[38] [38]

Towards robust and reliable algorithmic recourse

Sohini Upadhyay, Shalmali Joshi, and Himabindu Lakkaraju. Towards robust and reliable algorithmic recourse. InAdvances in Neural Information Processing Systems (NeurIPS), 2021. doi: 10.48550/arXiv.2102.13620. URLhttps://arxiv.org/abs/2102.13620

work page doi:10.48550/arxiv.2102.13620 2021

[39] [39]

Actionable recourse in linear classification

Berk Ustun, Alexander Spangher, and Yang Liu. Actionable recourse in linear classification. InProceedings of the Conference on Fairness, Accountability, and Transparency, page 10–19. ACM, January 2019. doi: 10.1145/3287560.3287566. URL http://dx.doi.org/10.1145/ 3287560.3287566

work page doi:10.1145/3287560.3287566 2019

[40] [40]

Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPR

Sandra Wachter, Brent Mittelstadt, and Chris Russell. Counterfactual explanations without opening the black box: Automated decisions and the gdpr, 2018. URL https://arxiv.org/ abs/1711.00399

work page internal anchor Pith review Pith/arXiv arXiv 2018

[41] [41]

HAQ: Hardware-Aware Automated Quantization with Mixed Precision

Kuan Wang, Zhijian Liu, Yujun Lin, Ji Lin, and Song Han. HAQ: Hardware-aware automated quantization with mixed precision. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019. doi: 10.48550/arXiv.1811.08886. URL https://openaccess.thecvf.com/content_CVPR_2019/papers/Wang_HAQ_ Hardware-Aware_Automated_Quantizatio...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1811.08886 2019

[42] [42]

recourse robust to model shift

Han Xiao, Kashif Rasul, and Roland V ollgraf. Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms, 2017. URL https://arxiv.org/abs/1708. 07747. 13 When Bits Break Recourse: Counterfactual-Faithful Quantization Appendix Table of Contents A Recourse Optimization and Constraints. . . . . . . . . . . . . . . . . . . . . . . . . ....

work page 2017