Cumulative Meta-Learning from Active Learning Queries for Robustness to Spurious Correlations

Jingxian Wang; Kin Whye Chew

arxiv: 2605.20771 · v1 · pith:ASVQYX3Ynew · submitted 2026-05-20 · 💻 cs.LG

Cumulative Meta-Learning from Active Learning Queries for Robustness to Spurious Correlations

Kin Whye Chew , Jingxian Wang This is my paper

Pith reviewed 2026-05-21 06:25 UTC · model grok-4.3

classification 💻 cs.LG

keywords active learningmeta-learningspurious correlationsrobustnessinductive biasminority group accuracygeneralization

0 comments

The pith

CAML meta-learns a cumulative inductive bias from active-learning queries to reduce reliance on spurious correlations.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper aims to show that simply adding active-learning queries to the labeled set is not enough in deep learning, because their influence gets diluted and models can memorize spurious patterns instead. CAML turns each querying round into a meta-learning task, using the existing labeled data to adapt the model and the new queries to evaluate how well the adaptation focuses on core features. It keeps and refines one inductive bias across all rounds rather than starting fresh each time, which creates interactions between what was learned earlier and what the new queries require. A sympathetic reader would care because spurious correlations make models unreliable on subgroups that matter for fairness and safety, and this method offers a way to make the queried data more influential without collecting extra annotations.

Core claim

CAML casts each active-learning round as a meta-learning task: the current labeled set serves as meta-train data for adaptation, while the newly queried batch serves as meta-test data for evaluating generalization. Unlike conventional meta-learning, which treats tasks as independent and identically distributed, CAML exploits the sequential dependence between active-learning rounds by maintaining a cumulative inductive bias that is progressively refined. Theoretically, this cumulative formulation introduces interaction terms that couple earlier meta-learned inductive biases with later query-induced objectives, capturing dependencies absent from standard meta-learning.

What carries the argument

the cumulative inductive bias that is progressively refined across active-learning rounds and introduces interaction terms coupling earlier meta-learned biases with later query-induced objectives

If this is right

Minority-group accuracy improves across spurious-correlation benchmarks and acquisition strategies.
Gains reach up to 27.8 percent on Dominoes, 29.9 percent on Waterbirds, 14.3 percent on SpuCo, and 24.0 percent on CivilComments.
The influence of informative samples is preserved instead of being diluted by larger labeled sets because the prior is meta-learned.
Sequential dependencies between rounds are captured through interaction terms that standard meta-learning omits.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The cumulative bias could be carried forward into deployment to guide adaptation on new incoming data streams without restarting the meta-learning process.
Combining CAML with group-aware objectives or reweighting might amplify the separation of core features on datasets where queries alone are insufficient.
The approach suggests a path for active learning in non-stationary environments where the nature of spurious correlations changes over time.

Load-bearing premise

The sequential dependence between active-learning rounds can be productively exploited by maintaining a single progressively refined cumulative inductive bias, and that the queried batch functions as effective meta-test data capable of distinguishing core features from spurious ones without additional modeling choices.

What would settle it

Running the same active-learning loops and benchmarks but replacing the cumulative bias update with an independent meta-learning reset each round and finding no loss in minority-group accuracy gains would show the cumulative mechanism is not required.

Figures

Figures reproduced from arXiv: 2605.20771 by Jingxian Wang, Kin Whye Chew.

**Figure 2.** Figure 2: Minority-group test accuracy (%) across active-learning rounds across spurious-correlation [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗

**Figure 3.** Figure 3: Ablation studies characterizing CAML’s minority-group accuracy gains over the Shuffle [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗

**Figure 4.** Figure 4: Additional CIFAR100 results. Standard test accuracy (%) across active-learning rounds on CIFAR100 using BADGE acquisition. CAML improves standard test accuracy over the corresponding shuffle baseline, suggesting that query-based meta-learning can also provide benefits beyond explicitly spurious-correlation benchmarks. 21 [PITH_FULL_IMAGE:figures/full_fig_p021_4.png] view at source ↗

read the original abstract

Spurious correlations in real-world datasets cause machine learning models to rely on irrelevant patterns, undermining reliability, generalization, and fairness. Active learning offers a promising way to address this failure mode by querying informative samples that distinguish core features from spurious ones. However, standard active-learning methods simply append queried examples to the labeled set, effectively updating only the likelihood term. In deep learning regimes, the influence of these informative samples can be diluted by the larger labeled set and memorized by overparameterized models. We propose Cumulative Active Meta-Learning (CAML), an active-learning framework that uses queried examples to meta-learn the prior, or inductive bias, governing how the model adapts. CAML casts each active-learning round as a meta-learning task: the current labeled set serves as meta-train data for adaptation, while the newly queried batch serves as meta-test data for evaluating generalization. Unlike conventional meta-learning, which treats tasks as independent and identically distributed, CAML exploits the sequential dependence between active-learning rounds by maintaining a cumulative inductive bias that is progressively refined. Theoretically, we show that this cumulative formulation introduces interaction terms that couple earlier meta-learned inductive biases with later query-induced objectives, capturing dependencies absent from standard meta-learning. Empirically, CAML improves minority-group accuracy across spurious-correlation benchmarks and acquisition strategies, with gains of up to 27.8% on Dominoes, 29.9% on Waterbirds, 14.3% on SpuCo, and 24.0% on CivilComments.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

CAML reframes active learning rounds as sequential meta-tasks to accumulate an inductive bias, reporting clear minority accuracy gains on spurious benchmarks but leaving open whether the queried batches truly force core-feature preference.

read the letter

The key point is that this paper turns each active learning round into a meta-learning task where the current labeled set acts as meta-train and the new queries as meta-test, then carries forward a single refined inductive bias instead of starting over or just appending samples. The interaction terms are meant to capture how earlier biases shape later adaptations, which is the main novelty over standard active learning or i.i.d. meta-learning setups.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes Cumulative Active Meta-Learning (CAML), an active-learning framework that meta-learns a cumulative inductive bias from queried examples to improve robustness to spurious correlations. Each active-learning round is cast as a meta-learning task with the current labeled set as meta-train data and the newly queried batch as meta-test data; a single progressively refined prior is maintained across rounds rather than treating tasks as i.i.d. The paper derives interaction terms that couple earlier meta-learned biases with later query-induced objectives and reports empirical gains in minority-group accuracy of up to 27.8% on Dominoes, 29.9% on Waterbirds, 14.3% on SpuCo, and 24.0% on CivilComments across multiple acquisition strategies.

Significance. If the central claim holds, the work provides a principled way to exploit sequential dependence in active learning to refine inductive biases against spurious features, addressing the dilution of informative samples in overparameterized regimes. The combination of a cumulative meta-objective with active querying is a natural extension of existing robustness techniques and could influence practical pipelines on benchmarks where spurious correlations are known to degrade minority-group performance.

major comments (2)

[Theoretical Analysis] §3 (Theoretical Analysis): the derivation of the interaction terms shows coupling between successive meta-objectives, but does not establish that the resulting cumulative bias must penalize spurious-feature reliance; because both the meta-train set and the queried meta-test batch are drawn from the same training distribution that exhibits the spurious correlation by construction, it is not immediate that the meta-test loss distinguishes core from spurious features without additional assumptions on the acquisition function or the form of the adaptation.
[Empirical Evaluation] §5 (Empirical Evaluation): the reported minority-group accuracy improvements are presented without ablations that isolate the cumulative aspect from standard (non-cumulative) meta-learning or from simply appending the queried batch; likewise, no variance across random seeds, statistical significance tests, or sensitivity to the inner-loop adaptation steps are provided, making it difficult to assess whether the gains are robust or attributable to the proposed interaction terms.

minor comments (2)

[Method] Notation for the cumulative prior (e.g., how the meta-parameters are updated across rounds) is introduced without an explicit recurrence or pseudocode; a compact algorithm box would clarify the difference from standard MAML-style updates.
[Experiments] The abstract states gains 'across acquisition strategies' but the main text should tabulate per-strategy results (including random sampling) to substantiate the claim that CAML is acquisition-agnostic.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive feedback on our manuscript. We address each of the major comments below, indicating the revisions we plan to make to strengthen the paper.

read point-by-point responses

Referee: [Theoretical Analysis] §3 (Theoretical Analysis): the derivation of the interaction terms shows coupling between successive meta-objectives, but does not establish that the resulting cumulative bias must penalize spurious-feature reliance; because both the meta-train set and the queried meta-test batch are drawn from the same training distribution that exhibits the spurious correlation by construction, it is not immediate that the meta-test loss distinguishes core from spurious features without additional assumptions on the acquisition function or the form of the adaptation.

Authors: We appreciate this observation. The theoretical analysis in §3 derives the interaction terms arising from the cumulative formulation, which couple the meta-learned bias from previous rounds with the current query-induced objective. While the meta-train and meta-test sets are indeed drawn from the same distribution, the key is that the acquisition function selects batches that are informative with respect to distinguishing core and spurious features. We will revise the manuscript to explicitly state the assumptions on the acquisition function (e.g., that it prioritizes examples where spurious correlations are less predictive) and discuss how the meta-test loss on such batches encourages adaptation that penalizes spurious reliance. This will include a new paragraph clarifying the conditions under which the cumulative bias reduces spurious correlations. revision: yes
Referee: [Empirical Evaluation] §5 (Empirical Evaluation): the reported minority-group accuracy improvements are presented without ablations that isolate the cumulative aspect from standard (non-cumulative) meta-learning or from simply appending the queried batch; likewise, no variance across random seeds, statistical significance tests, or sensitivity to the inner-loop adaptation steps are provided, making it difficult to assess whether the gains are robust or attributable to the proposed interaction terms.

Authors: We agree that additional ablations and statistical reporting would strengthen the empirical evaluation. In the revised manuscript, we will add ablations comparing CAML to (i) standard meta-learning without the cumulative aspect and (ii) simply appending the queried batch to the labeled set without meta-learning. We will report mean and standard deviation of minority-group accuracies over 5 random seeds for all experiments. We will also include statistical significance tests (e.g., paired t-tests) comparing CAML to baselines. Finally, we will provide sensitivity analysis to the number of inner-loop adaptation steps, showing results for varying step counts. These additions will help isolate the contribution of the cumulative interaction terms. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation introduces new cumulative terms independently of inputs

full rationale

The paper defines CAML by casting active-learning rounds as sequential meta-tasks where the labeled set is meta-train and the queried batch is meta-test, then maintains a cumulative inductive bias across rounds. The theoretical contribution is the derivation of interaction terms that couple earlier biases to later objectives; this follows directly from the sequential formulation but does not reduce any performance claim to a fitted quantity or self-referential definition. Empirical gains are reported on external public benchmarks (Dominoes, Waterbirds, etc.) rather than on quantities defined inside the method. No self-citation is load-bearing for the central result, no ansatz is smuggled, and no prediction is statistically forced by construction. The framework is therefore self-contained against external validation.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that active-learning queries can serve as meta-test data to refine an inductive bias that generalizes across rounds; no free parameters or invented entities are explicitly introduced in the abstract.

axioms (1)

domain assumption Active-learning rounds exhibit sequential dependence that can be captured by a single cumulative inductive bias rather than independent tasks.
Invoked when the abstract states that CAML 'exploits the sequential dependence between active-learning rounds by maintaining a cumulative inductive bias that is progressively refined.'

pith-pipeline@v0.9.0 · 5801 in / 1379 out tokens · 61518 ms · 2026-05-21T06:25:18.341934+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

CAML casts each active-learning round as a meta-learning task: the current labeled set serves as meta-train data for adaptation, while the newly queried batch serves as meta-test data for evaluating generalization... introduces interaction terms that couple earlier meta-learned inductive biases with later query-induced objectives
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

theoretically, we show that this cumulative formulation introduces interaction terms that couple earlier meta-learned inductive biases with later query-induced objectives

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

195 extracted references · 195 canonical work pages · 6 internal anchors

[1]

and Gale, William A

Lewis, David D. and Gale, William A. , editor =. A Sequential Algorithm for Training Text Classifiers , booktitle =. 1994 , pages =

work page 1994
[2]

and Ghahramani, Zoubin and Jordan, Michael I

Cohn, David A. and Ghahramani, Zoubin and Jordan, Michael I. , title =. Journal of Artificial Intelligence Research , volume =. 1996 , doi =

work page 1996
[3]

and Ba, Jimmy , title =

Kingma, Diederik P. and Ba, Jimmy , title =. International Conference on Learning Representations , year =

work page
[4]

International Conference on Learning Representations , year =

Lovering, Charles and Jha, Rohan and Linzen, Tal and Pavlick, Ellie , title =. International Conference on Learning Representations , year =

work page
[5]

A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference , author =. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers) , month = jun, year =. doi:10.18653/v1/N18-1101 , url =

work page internal anchor Pith review doi:10.18653/v1/n18-1101 2018
[6]

Companion Proceedings of The 2019 World Wide Web Conference , year =

Borkan, Daniel and Dixon, Lucas and Sorensen, Jeffrey and Thain, Nithum and Vasserman, Lucy , title =. Companion Proceedings of The 2019 World Wide Web Conference , year =. doi:10.1145/3308560.3317593 , url =

work page doi:10.1145/3308560.3317593 2019
[7]

International Conference on Learning Representations , year =

Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds , author =. International Conference on Learning Representations , year =

work page
[8]

Advances in Neural Information Processing Systems , volume =

On Warm-Starting Neural Network Training , author =. Advances in Neural Information Processing Systems , volume =. 2020 , url =

work page 2020
[9]

International Conference on Learning Representations , year =

Active Learning for Convolutional Neural Networks: A Core-Set Approach , author =. International Conference on Learning Representations , year =

work page
[10]

Proceedings of the 34th International Conference on Machine Learning , pages =

Deep Bayesian Active Learning with Image Data , author =. Proceedings of the 34th International Conference on Machine Learning , pages =. 2017 , editor =

work page 2017
[11]

Proceedings of the AAAI Conference on Artificial Intelligence , volume =

Active Learning by Learning , author =. Proceedings of the AAAI Conference on Artificial Intelligence , volume =. 2015 , doi =

work page 2015
[12]

Advances in Neural Information Processing Systems , volume =

Active Learning by Querying Informative and Representative Examples , author =. Advances in Neural Information Processing Systems , volume =. 2010 , publisher =

work page 2010
[13]

Journal of Machine Learning Research , volume =

Online Choice of Active Learning Algorithms , author =. Journal of Machine Learning Research , volume =. 2004 , url =

work page 2004
[14]

Proceedings of the AAAI Conference on Artificial Intelligence , volume =

Active Discriminative Text Representation Learning , author =. Proceedings of the AAAI Conference on Artificial Intelligence , volume =

work page
[15]

Proceedings of the 23rd International Conference on Machine Learning , pages =

Batch Mode Active Learning and Its Application to Medical Image Classification , author =. Proceedings of the 23rd International Conference on Machine Learning , pages =. 2006 , doi =

work page 2006
[16]

International Conference on Learning Representations , year =

Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations , author =. International Conference on Learning Representations , year =

work page
[17]

Proceedings of the 38th International Conference on Machine Learning , pages =

Just Train Twice: Improving Group Robustness without Training Group Information , author =. Proceedings of the 38th International Conference on Machine Learning , pages =. 2021 , editor =

work page 2021
[18]

Proceedings of the 39th International Conference on Machine Learning , pages =

Correct-N-Contrast: A Contrastive Approach for Improving Robustness to Spurious Correlations , author =. Proceedings of the 39th International Conference on Machine Learning , pages =. 2022 , editor =

work page 2022
[19]

Proceedings of the IEEE International Conference on Computer Vision , pages =

Deep Learning Face Attributes in the Wild , author =. Proceedings of the IEEE International Conference on Computer Vision , pages =. 2015 , url =

work page 2015
[20]

Advances in Neural Information Processing Systems , volume =

Learning from Failure: Training Debiased Classifier from Biased Classifier , author =. Advances in Neural Information Processing Systems , volume =. 2020 , publisher =

work page 2020
[21]

International Conference on Learning Representations , year =

Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization , author =. International Conference on Learning Representations , year =

work page
[22]

Proceedings of the First Conference on Causal Learning and Reasoning , pages =

Simple Data Balancing Achieves Competitive Worst-Group-Accuracy , author =. Proceedings of the First Conference on Causal Learning and Reasoning , pages =. 2022 , editor =

work page 2022
[23]

International Conference on Learning Representations , year =

Reinforcement Learning with Unsupervised Auxiliary Tasks , author =. International Conference on Learning Representations , year =

work page
[24]

Proceedings of the AAAI Conference on Artificial Intelligence , volume =

Learning to Generalize: Meta-Learning for Domain Generalization , author =. Proceedings of the AAAI Conference on Artificial Intelligence , volume =. 2018 , doi =

work page 2018
[25]

and Haque, Imran S

Koh, Pang Wei and Sagawa, Shiori and Marklund, Henrik and Xie, Sang Michael and Zhang, Marvin and Balsubramani, Akshay and Hu, Weihua and Yasunaga, Michihiro and Phillips, Richard Lanas and Gao, Irena and Lee, Tony and David, Etienne and Stavness, Ian and Guo, Wei and Earnshaw, Berton A. and Haque, Imran S. and Beery, Sara M. and Leskovec, Jure and Kundaj...

work page 2021
[26]

2019 , publisher =

Devlin, Jacob and Chang, Ming-Wei and Lee, Kenton and Toutanova, Kristina , booktitle =. 2019 , publisher =

work page 2019
[27]

International Conference on Learning Representations , year =

Decoupled Weight Decay Regularization , author =. International Conference on Learning Representations , year =

work page
[28]

Proceedings of the 34th International Conference on Machine Learning , pages =

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , author =. Proceedings of the 34th International Conference on Machine Learning , pages =. 2017 , editor =

work page 2017
[29]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages =

Deep Residual Learning for Image Recognition , author =. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages =. 2016 , url =

work page 2016
[30]

Proceedings of the European Conference on Computer Vision (ECCV) , pages =

Recognition in Terra Incognita , author =. Proceedings of the European Conference on Computer Vision (ECCV) , pages =. 2018 , url =

work page 2018
[31]

Proceedings of the 35th International Conference on Machine Learning , pages =

Learning to Reweight Examples for Robust Deep Learning , author =. Proceedings of the 35th International Conference on Machine Learning , pages =. 2018 , editor =

work page 2018
[32]

Advances in Neural Information Processing Systems , volume =

Active Learning Helps Pretrained Models Learn the Intended Task , author =. Advances in Neural Information Processing Systems , volume =. 2022 , publisher =

work page 2022
[33]

Advances in Neural Information Processing Systems , volume =

On Feature Learning in the Presence of Spurious Correlations , author =. Advances in Neural Information Processing Systems , volume =. 2022 , publisher =

work page 2022
[34]

2017 , eprint=

Deep Active Learning over the Long Tail , author=. 2017 , eprint=

work page 2017
[35]

2019 , eprint=

Discriminative Active Learning , author=. 2019 , eprint=

work page 2019
[36]

2018 , eprint=

Adversarial Active Learning for Deep Networks: a Margin Based Approach , author=. 2018 , eprint=

work page 2018
[37]

and Genewein, Tim and Nurnberger, Andreas and Kohler, Jan M

Beluch, William H. and Genewein, Tim and Nurnberger, Andreas and Kohler, Jan M. , booktitle=. The Power of Ensembles for Active Learning in Image Classification , year=

work page
[38]

Margin-Based Active Learning for Structured Output Spaces

Roth, Dan and Small, Kevin. Margin-Based Active Learning for Structured Output Spaces. Machine Learning: ECML 2006. 2006

work page 2006
[39]

Cost-Effective Active Learning for Deep Image Classification , year=

Wang, Keze and Zhang, Dongyu and Li, Ya and Zhang, Ruimao and Lin, Liang , journal=. Cost-Effective Active Learning for Deep Image Classification , year=

work page
[40]

A new active labeling method for deep learning , year=

Wang, Dan and Shang, Yi , booktitle=. A new active labeling method for deep learning , year=

work page
[41]

2011 , eprint=

Bayesian Active Learning for Classification and Preference Learning , author=. 2011 , eprint=

work page 2011
[42]

Active Learning Literature Survey , Type =

Burr Settles , Institution =. Active Learning Literature Survey , Type =

work page
[43]

2016 , eprint=

Coresets and Sketches , author=. 2016 , eprint=

work page 2016
[44]

ACM Comput

Liu, Peng and Wang, Lizhe and Ranjan, Rajiv and He, Guojin and Zhao, Lei , title =. ACM Comput. Surv. , month =. 2022 , issue_date =. doi:10.1145/3510414 , abstract =

work page doi:10.1145/3510414 2022
[45]

Theory of Optimal Experimental Design , volume =

Fedorov, Valerii , year =. Theory of Optimal Experimental Design , volume =. Wiley Interdisciplinary Reviews: Computational Statistics , doi =

work page
[46]

2021 , eprint=

A Survey of Deep Active Learning , author=. 2021 , eprint=

work page 2021
[47]

IEEE Transactions on Medical Imaging , year=

Intelligent Labeling Based on Fisher Information for Medical Image Segmentation Using Deep Learning , author=. IEEE Transactions on Medical Imaging , year=

work page
[48]

2016 , eprint=

Active Learning for Speech Recognition: the Power of Gradients , author=. 2016 , eprint=

work page 2016
[49]

Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics , pages =

Deep Active Learning: Unified and Principled Method for Query and Training , author =. Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics , pages =. 2020 , editor =

work page 2020
[50]

Agnostic active learning , journal =

Maria-Florina Balcan and Alina Beygelzimer and John Langford , keywords =. Agnostic active learning , journal =. 2009 , note =. doi:https://doi.org/10.1016/j.jcss.2008.07.003 , url =

work page doi:10.1016/j.jcss.2008.07.003 2009
[51]

Angluin, Dana , title =. Mach. Learn. , month =. 1988 , issue_date =. doi:10.1023/A:1022821128753 , abstract =

work page doi:10.1023/a:1022821128753 1988
[52]

Nature , year=

Functional genomic hypothesis generation and experimentation by a robot scientist , author=. Nature , year=

work page
[53]

Engelson , abstract =

Ido Dagan and Sean P. Engelson , abstract =. Committee-Based Sampling For Training Probabilistic Classifiers , editor =. Machine Learning Proceedings 1995 , publisher =. 1995 , isbn =. doi:https://doi.org/10.1016/B978-1-55860-377-6.50027-X , url =

work page doi:10.1016/b978-1-55860-377-6.50027-x 1995
[54]

Proceedings of the 40th International Conference on Machine Learning , pages =

Why does Throwing Away Data Improve Worst-Group Error? , author =. Proceedings of the 40th International Conference on Machine Learning , pages =. 2023 , editor =

work page 2023
[55]

What is the Effect of Importance Weighting in Deep Learning?

Byrd, Jonathon and Lipton, Zachary C. , keywords =. What is the Effect of Importance Weighting in Deep Learning? , publisher =. 2018 , copyright =. doi:10.48550/ARXIV.1812.03372 , url =

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1812.03372 2018
[56]

Grad -CAM: Visual Explanations from Deep Networks via Gradient -based Localization

Ramprasaath R. Selvaraju and Michael Cogswell and Abhishek Das and Ramakrishna Vedantam and Devi Parikh and Dhruv Batra , title =. doi:10.1007/s11263-019-01228-7 , url =

work page doi:10.1007/s11263-019-01228-7
[57]

Salient imagenet: How to discover spurious features in deep learning?arXiv preprint arXiv:2110.04301, 2021

Singla, Sahil and Feizi, Soheil , keywords =. Salient ImageNet: How to discover spurious features in Deep Learning? , publisher =. 2021 , copyright =. doi:10.48550/ARXIV.2110.04301 , url =

work page doi:10.48550/arxiv.2110.04301 2021
[58]

Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study

Zech, John R and Badgeley, Marcus A and Liu, Manway and Costa, Anthony B and Titano, Joseph J and Oermann, Eric Karl. Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study. PLoS Med

work page
[59]

CUB-200-2011 , DOI=

Wah, Catherine and Branson, Steve and Welinder, Peter and Perona, Pietro and Belongie, Serge , year=. CUB-200-2011 , DOI=

work page 2011
[60]

The Pitfalls of Simplicity Bias in Neural Networks , url =

Shah, Harshay and Tamuly, Kaustav and Raghunathan, Aditi and Jain, Prateek and Netrapalli, Praneeth , booktitle =. The Pitfalls of Simplicity Bias in Neural Networks , url =

work page
[61]

Approximating CNNs with Bag-of-local-Features models works surprisingly well on ImageNet

Wieland Brendel and Matthias Bethge , title =. CoRR , volume =. 2019 , url =. 1904.00760 , timestamp =

work page internal anchor Pith review Pith/arXiv arXiv 2019
[62]

CoRR abs/1811.12231 (2018), http://arxiv.org/ abs/1811.12231

Robert Geirhos and Patricia Rubisch and Claudio Michaelis and Matthias Bethge and Felix A. Wichmann and Wieland Brendel , title =. CoRR , volume =. 2018 , url =. 1811.12231 , timestamp =

work page arXiv 2018
[63]

2018 , eprint=

Auxiliary Tasks in Multi-task Learning , author=. 2018 , eprint=

work page 2018
[64]

Module-Aware Optimization for Auxiliary Learning , url =

Chen, Hong and Wang, Xin and Liu, Yue and Zhou, Yuwei and Guan, Chaoyu and Zhu, Wenwu , booktitle =. Module-Aware Optimization for Auxiliary Learning , url =

work page
[65]

Adaptive Auxiliary Task Weighting for Reinforcement Learning , url =

Lin, Xingyu and Baweja, Harjatin and Kantor, George and Held, David , booktitle =. Adaptive Auxiliary Task Weighting for Reinforcement Learning , url =

work page
[66]

2020 , eprint=

Auxiliary Task Reweighting for Minimum-data Learning , author=. 2020 , eprint=

work page 2020
[67]

2018 , eprint=

Fine-Grained Visual Categorization using Meta-Learning Optimization with Sample Selection of Auxiliary Data , author=. 2018 , eprint=

work page 2018
[68]

2023 , eprint=

Diversify and Disambiguate: Learning From Underspecified Data , author=. 2023 , eprint=

work page 2023
[69]

Invariant Risk Minimization

Invariant Risk Minimization , author =. arXiv preprint arXiv:1907.02893 , year =

work page internal anchor Pith review Pith/arXiv arXiv 1907
[70]

Multi-branch fusion auxiliary learning for the detection of pneumonia from chest X-ray images , journal =

Jia Liu and Jing Qi and Wei Chen and Yongjian Nian , keywords =. Multi-branch fusion auxiliary learning for the detection of pneumonia from chest X-ray images , journal =. 2022 , issn =. doi:https://doi.org/10.1016/j.compbiomed.2022.105732 , url =

work page doi:10.1016/j.compbiomed.2022.105732 2022
[71]

Gradient Starvation:

Mohammad Pezeshki and S. Gradient Starvation:. CoRR , volume =. 2020 , url =. 2011.09468 , timestamp =

work page arXiv 2020
[72]

2024 , eprint=

Spurious Correlations in Machine Learning: A Survey , author=. 2024 , eprint=

work page 2024
[73]

Shortcut Learning in Deep Neural Networks , journal =

Robert Geirhos and J. Shortcut Learning in Deep Neural Networks , journal =. 2020 , url =. 2004.07780 , timestamp =

work page arXiv 2020
[74]

Journal of Machine Learning Research , volume =

Underspecification Presents Challenges for Credibility in Modern Machine Learning , author =. Journal of Machine Learning Research , volume =. 2022 , url =

work page 2022
[75]

2022 , eprint=

MaskTune: Mitigating Spurious Correlations by Forcing to Explore , author=. 2022 , eprint=

work page 2022
[76]

Proceedings of the 37th International Conference on Machine Learning , pages =

Invariant Risk Minimization Games , author =. Proceedings of the 37th International Conference on Machine Learning , pages =. 2020 , editor =

work page 2020
[77]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =

Lin, Yong and Dong, Hanze and Wang, Hao and Zhang, Tong , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =. 2022 , pages =

work page 2022
[78]

Proceedings of the 39th International Conference on Machine Learning , pages =

Sparse Invariant Risk Minimization , author =. Proceedings of the 39th International Conference on Machine Learning , pages =. 2022 , editor =

work page 2022
[79]

2021 , eprint=

A Too-Good-to-be-True Prior to Reduce Shortcut Reliance , author=. 2021 , eprint=

work page 2021
[80]

2020 , eprint=

Towards Debiasing NLU Models from Unknown Biases , author=. 2020 , eprint=

work page 2020

Showing first 80 references.

[1] [1]

and Gale, William A

Lewis, David D. and Gale, William A. , editor =. A Sequential Algorithm for Training Text Classifiers , booktitle =. 1994 , pages =

work page 1994

[2] [2]

and Ghahramani, Zoubin and Jordan, Michael I

Cohn, David A. and Ghahramani, Zoubin and Jordan, Michael I. , title =. Journal of Artificial Intelligence Research , volume =. 1996 , doi =

work page 1996

[3] [3]

and Ba, Jimmy , title =

Kingma, Diederik P. and Ba, Jimmy , title =. International Conference on Learning Representations , year =

work page

[4] [4]

International Conference on Learning Representations , year =

Lovering, Charles and Jha, Rohan and Linzen, Tal and Pavlick, Ellie , title =. International Conference on Learning Representations , year =

work page

[5] [5]

A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference , author =. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers) , month = jun, year =. doi:10.18653/v1/N18-1101 , url =

work page internal anchor Pith review doi:10.18653/v1/n18-1101 2018

[6] [6]

Companion Proceedings of The 2019 World Wide Web Conference , year =

Borkan, Daniel and Dixon, Lucas and Sorensen, Jeffrey and Thain, Nithum and Vasserman, Lucy , title =. Companion Proceedings of The 2019 World Wide Web Conference , year =. doi:10.1145/3308560.3317593 , url =

work page doi:10.1145/3308560.3317593 2019

[7] [7]

International Conference on Learning Representations , year =

Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds , author =. International Conference on Learning Representations , year =

work page

[8] [8]

Advances in Neural Information Processing Systems , volume =

On Warm-Starting Neural Network Training , author =. Advances in Neural Information Processing Systems , volume =. 2020 , url =

work page 2020

[9] [9]

International Conference on Learning Representations , year =

Active Learning for Convolutional Neural Networks: A Core-Set Approach , author =. International Conference on Learning Representations , year =

work page

[10] [10]

Proceedings of the 34th International Conference on Machine Learning , pages =

Deep Bayesian Active Learning with Image Data , author =. Proceedings of the 34th International Conference on Machine Learning , pages =. 2017 , editor =

work page 2017

[11] [11]

Proceedings of the AAAI Conference on Artificial Intelligence , volume =

Active Learning by Learning , author =. Proceedings of the AAAI Conference on Artificial Intelligence , volume =. 2015 , doi =

work page 2015

[12] [12]

Advances in Neural Information Processing Systems , volume =

Active Learning by Querying Informative and Representative Examples , author =. Advances in Neural Information Processing Systems , volume =. 2010 , publisher =

work page 2010

[13] [13]

Journal of Machine Learning Research , volume =

Online Choice of Active Learning Algorithms , author =. Journal of Machine Learning Research , volume =. 2004 , url =

work page 2004

[14] [14]

Proceedings of the AAAI Conference on Artificial Intelligence , volume =

Active Discriminative Text Representation Learning , author =. Proceedings of the AAAI Conference on Artificial Intelligence , volume =

work page

[15] [15]

Proceedings of the 23rd International Conference on Machine Learning , pages =

Batch Mode Active Learning and Its Application to Medical Image Classification , author =. Proceedings of the 23rd International Conference on Machine Learning , pages =. 2006 , doi =

work page 2006

[16] [16]

International Conference on Learning Representations , year =

Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations , author =. International Conference on Learning Representations , year =

work page

[17] [17]

Proceedings of the 38th International Conference on Machine Learning , pages =

Just Train Twice: Improving Group Robustness without Training Group Information , author =. Proceedings of the 38th International Conference on Machine Learning , pages =. 2021 , editor =

work page 2021

[18] [18]

Proceedings of the 39th International Conference on Machine Learning , pages =

Correct-N-Contrast: A Contrastive Approach for Improving Robustness to Spurious Correlations , author =. Proceedings of the 39th International Conference on Machine Learning , pages =. 2022 , editor =

work page 2022

[19] [19]

Proceedings of the IEEE International Conference on Computer Vision , pages =

Deep Learning Face Attributes in the Wild , author =. Proceedings of the IEEE International Conference on Computer Vision , pages =. 2015 , url =

work page 2015

[20] [20]

Advances in Neural Information Processing Systems , volume =

Learning from Failure: Training Debiased Classifier from Biased Classifier , author =. Advances in Neural Information Processing Systems , volume =. 2020 , publisher =

work page 2020

[21] [21]

International Conference on Learning Representations , year =

Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization , author =. International Conference on Learning Representations , year =

work page

[22] [22]

Proceedings of the First Conference on Causal Learning and Reasoning , pages =

Simple Data Balancing Achieves Competitive Worst-Group-Accuracy , author =. Proceedings of the First Conference on Causal Learning and Reasoning , pages =. 2022 , editor =

work page 2022

[23] [23]

International Conference on Learning Representations , year =

Reinforcement Learning with Unsupervised Auxiliary Tasks , author =. International Conference on Learning Representations , year =

work page

[24] [24]

Proceedings of the AAAI Conference on Artificial Intelligence , volume =

Learning to Generalize: Meta-Learning for Domain Generalization , author =. Proceedings of the AAAI Conference on Artificial Intelligence , volume =. 2018 , doi =

work page 2018

[25] [25]

and Haque, Imran S

Koh, Pang Wei and Sagawa, Shiori and Marklund, Henrik and Xie, Sang Michael and Zhang, Marvin and Balsubramani, Akshay and Hu, Weihua and Yasunaga, Michihiro and Phillips, Richard Lanas and Gao, Irena and Lee, Tony and David, Etienne and Stavness, Ian and Guo, Wei and Earnshaw, Berton A. and Haque, Imran S. and Beery, Sara M. and Leskovec, Jure and Kundaj...

work page 2021

[26] [26]

2019 , publisher =

Devlin, Jacob and Chang, Ming-Wei and Lee, Kenton and Toutanova, Kristina , booktitle =. 2019 , publisher =

work page 2019

[27] [27]

International Conference on Learning Representations , year =

Decoupled Weight Decay Regularization , author =. International Conference on Learning Representations , year =

work page

[28] [28]

Proceedings of the 34th International Conference on Machine Learning , pages =

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , author =. Proceedings of the 34th International Conference on Machine Learning , pages =. 2017 , editor =

work page 2017

[29] [29]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages =

Deep Residual Learning for Image Recognition , author =. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages =. 2016 , url =

work page 2016

[30] [30]

Proceedings of the European Conference on Computer Vision (ECCV) , pages =

Recognition in Terra Incognita , author =. Proceedings of the European Conference on Computer Vision (ECCV) , pages =. 2018 , url =

work page 2018

[31] [31]

Proceedings of the 35th International Conference on Machine Learning , pages =

Learning to Reweight Examples for Robust Deep Learning , author =. Proceedings of the 35th International Conference on Machine Learning , pages =. 2018 , editor =

work page 2018

[32] [32]

Advances in Neural Information Processing Systems , volume =

Active Learning Helps Pretrained Models Learn the Intended Task , author =. Advances in Neural Information Processing Systems , volume =. 2022 , publisher =

work page 2022

[33] [33]

Advances in Neural Information Processing Systems , volume =

On Feature Learning in the Presence of Spurious Correlations , author =. Advances in Neural Information Processing Systems , volume =. 2022 , publisher =

work page 2022

[34] [34]

2017 , eprint=

Deep Active Learning over the Long Tail , author=. 2017 , eprint=

work page 2017

[35] [35]

2019 , eprint=

Discriminative Active Learning , author=. 2019 , eprint=

work page 2019

[36] [36]

2018 , eprint=

Adversarial Active Learning for Deep Networks: a Margin Based Approach , author=. 2018 , eprint=

work page 2018

[37] [37]

and Genewein, Tim and Nurnberger, Andreas and Kohler, Jan M

Beluch, William H. and Genewein, Tim and Nurnberger, Andreas and Kohler, Jan M. , booktitle=. The Power of Ensembles for Active Learning in Image Classification , year=

work page

[38] [38]

Margin-Based Active Learning for Structured Output Spaces

Roth, Dan and Small, Kevin. Margin-Based Active Learning for Structured Output Spaces. Machine Learning: ECML 2006. 2006

work page 2006

[39] [39]

Cost-Effective Active Learning for Deep Image Classification , year=

Wang, Keze and Zhang, Dongyu and Li, Ya and Zhang, Ruimao and Lin, Liang , journal=. Cost-Effective Active Learning for Deep Image Classification , year=

work page

[40] [40]

A new active labeling method for deep learning , year=

Wang, Dan and Shang, Yi , booktitle=. A new active labeling method for deep learning , year=

work page

[41] [41]

2011 , eprint=

Bayesian Active Learning for Classification and Preference Learning , author=. 2011 , eprint=

work page 2011

[42] [42]

Active Learning Literature Survey , Type =

Burr Settles , Institution =. Active Learning Literature Survey , Type =

work page

[43] [43]

2016 , eprint=

Coresets and Sketches , author=. 2016 , eprint=

work page 2016

[44] [44]

ACM Comput

Liu, Peng and Wang, Lizhe and Ranjan, Rajiv and He, Guojin and Zhao, Lei , title =. ACM Comput. Surv. , month =. 2022 , issue_date =. doi:10.1145/3510414 , abstract =

work page doi:10.1145/3510414 2022

[45] [45]

Theory of Optimal Experimental Design , volume =

Fedorov, Valerii , year =. Theory of Optimal Experimental Design , volume =. Wiley Interdisciplinary Reviews: Computational Statistics , doi =

work page

[46] [46]

2021 , eprint=

A Survey of Deep Active Learning , author=. 2021 , eprint=

work page 2021

[47] [47]

IEEE Transactions on Medical Imaging , year=

Intelligent Labeling Based on Fisher Information for Medical Image Segmentation Using Deep Learning , author=. IEEE Transactions on Medical Imaging , year=

work page

[48] [48]

2016 , eprint=

Active Learning for Speech Recognition: the Power of Gradients , author=. 2016 , eprint=

work page 2016

[49] [49]

Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics , pages =

Deep Active Learning: Unified and Principled Method for Query and Training , author =. Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics , pages =. 2020 , editor =

work page 2020

[50] [50]

Agnostic active learning , journal =

Maria-Florina Balcan and Alina Beygelzimer and John Langford , keywords =. Agnostic active learning , journal =. 2009 , note =. doi:https://doi.org/10.1016/j.jcss.2008.07.003 , url =

work page doi:10.1016/j.jcss.2008.07.003 2009

[51] [51]

Angluin, Dana , title =. Mach. Learn. , month =. 1988 , issue_date =. doi:10.1023/A:1022821128753 , abstract =

work page doi:10.1023/a:1022821128753 1988

[52] [52]

Nature , year=

Functional genomic hypothesis generation and experimentation by a robot scientist , author=. Nature , year=

work page

[53] [53]

Engelson , abstract =

Ido Dagan and Sean P. Engelson , abstract =. Committee-Based Sampling For Training Probabilistic Classifiers , editor =. Machine Learning Proceedings 1995 , publisher =. 1995 , isbn =. doi:https://doi.org/10.1016/B978-1-55860-377-6.50027-X , url =

work page doi:10.1016/b978-1-55860-377-6.50027-x 1995

[54] [54]

Proceedings of the 40th International Conference on Machine Learning , pages =

Why does Throwing Away Data Improve Worst-Group Error? , author =. Proceedings of the 40th International Conference on Machine Learning , pages =. 2023 , editor =

work page 2023

[55] [55]

What is the Effect of Importance Weighting in Deep Learning?

Byrd, Jonathon and Lipton, Zachary C. , keywords =. What is the Effect of Importance Weighting in Deep Learning? , publisher =. 2018 , copyright =. doi:10.48550/ARXIV.1812.03372 , url =

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1812.03372 2018

[56] [56]

Grad -CAM: Visual Explanations from Deep Networks via Gradient -based Localization

Ramprasaath R. Selvaraju and Michael Cogswell and Abhishek Das and Ramakrishna Vedantam and Devi Parikh and Dhruv Batra , title =. doi:10.1007/s11263-019-01228-7 , url =

work page doi:10.1007/s11263-019-01228-7

[57] [57]

Salient imagenet: How to discover spurious features in deep learning?arXiv preprint arXiv:2110.04301, 2021

Singla, Sahil and Feizi, Soheil , keywords =. Salient ImageNet: How to discover spurious features in Deep Learning? , publisher =. 2021 , copyright =. doi:10.48550/ARXIV.2110.04301 , url =

work page doi:10.48550/arxiv.2110.04301 2021

[58] [58]

Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study

Zech, John R and Badgeley, Marcus A and Liu, Manway and Costa, Anthony B and Titano, Joseph J and Oermann, Eric Karl. Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study. PLoS Med

work page

[59] [59]

CUB-200-2011 , DOI=

Wah, Catherine and Branson, Steve and Welinder, Peter and Perona, Pietro and Belongie, Serge , year=. CUB-200-2011 , DOI=

work page 2011

[60] [60]

The Pitfalls of Simplicity Bias in Neural Networks , url =

Shah, Harshay and Tamuly, Kaustav and Raghunathan, Aditi and Jain, Prateek and Netrapalli, Praneeth , booktitle =. The Pitfalls of Simplicity Bias in Neural Networks , url =

work page

[61] [61]

Approximating CNNs with Bag-of-local-Features models works surprisingly well on ImageNet

Wieland Brendel and Matthias Bethge , title =. CoRR , volume =. 2019 , url =. 1904.00760 , timestamp =

work page internal anchor Pith review Pith/arXiv arXiv 2019

[62] [62]

CoRR abs/1811.12231 (2018), http://arxiv.org/ abs/1811.12231

Robert Geirhos and Patricia Rubisch and Claudio Michaelis and Matthias Bethge and Felix A. Wichmann and Wieland Brendel , title =. CoRR , volume =. 2018 , url =. 1811.12231 , timestamp =

work page arXiv 2018

[63] [63]

2018 , eprint=

Auxiliary Tasks in Multi-task Learning , author=. 2018 , eprint=

work page 2018

[64] [64]

Module-Aware Optimization for Auxiliary Learning , url =

Chen, Hong and Wang, Xin and Liu, Yue and Zhou, Yuwei and Guan, Chaoyu and Zhu, Wenwu , booktitle =. Module-Aware Optimization for Auxiliary Learning , url =

work page

[65] [65]

Adaptive Auxiliary Task Weighting for Reinforcement Learning , url =

Lin, Xingyu and Baweja, Harjatin and Kantor, George and Held, David , booktitle =. Adaptive Auxiliary Task Weighting for Reinforcement Learning , url =

work page

[66] [66]

2020 , eprint=

Auxiliary Task Reweighting for Minimum-data Learning , author=. 2020 , eprint=

work page 2020

[67] [67]

2018 , eprint=

Fine-Grained Visual Categorization using Meta-Learning Optimization with Sample Selection of Auxiliary Data , author=. 2018 , eprint=

work page 2018

[68] [68]

2023 , eprint=

Diversify and Disambiguate: Learning From Underspecified Data , author=. 2023 , eprint=

work page 2023

[69] [69]

Invariant Risk Minimization

Invariant Risk Minimization , author =. arXiv preprint arXiv:1907.02893 , year =

work page internal anchor Pith review Pith/arXiv arXiv 1907

[70] [70]

Multi-branch fusion auxiliary learning for the detection of pneumonia from chest X-ray images , journal =

Jia Liu and Jing Qi and Wei Chen and Yongjian Nian , keywords =. Multi-branch fusion auxiliary learning for the detection of pneumonia from chest X-ray images , journal =. 2022 , issn =. doi:https://doi.org/10.1016/j.compbiomed.2022.105732 , url =

work page doi:10.1016/j.compbiomed.2022.105732 2022

[71] [71]

Gradient Starvation:

Mohammad Pezeshki and S. Gradient Starvation:. CoRR , volume =. 2020 , url =. 2011.09468 , timestamp =

work page arXiv 2020

[72] [72]

2024 , eprint=

Spurious Correlations in Machine Learning: A Survey , author=. 2024 , eprint=

work page 2024

[73] [73]

Shortcut Learning in Deep Neural Networks , journal =

Robert Geirhos and J. Shortcut Learning in Deep Neural Networks , journal =. 2020 , url =. 2004.07780 , timestamp =

work page arXiv 2020

[74] [74]

Journal of Machine Learning Research , volume =

Underspecification Presents Challenges for Credibility in Modern Machine Learning , author =. Journal of Machine Learning Research , volume =. 2022 , url =

work page 2022

[75] [75]

2022 , eprint=

MaskTune: Mitigating Spurious Correlations by Forcing to Explore , author=. 2022 , eprint=

work page 2022

[76] [76]

Proceedings of the 37th International Conference on Machine Learning , pages =

Invariant Risk Minimization Games , author =. Proceedings of the 37th International Conference on Machine Learning , pages =. 2020 , editor =

work page 2020

[77] [77]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =

Lin, Yong and Dong, Hanze and Wang, Hao and Zhang, Tong , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =. 2022 , pages =

work page 2022

[78] [78]

Proceedings of the 39th International Conference on Machine Learning , pages =

Sparse Invariant Risk Minimization , author =. Proceedings of the 39th International Conference on Machine Learning , pages =. 2022 , editor =

work page 2022

[79] [79]

2021 , eprint=

A Too-Good-to-be-True Prior to Reduce Shortcut Reliance , author=. 2021 , eprint=

work page 2021

[80] [80]

2020 , eprint=

Towards Debiasing NLU Models from Unknown Biases , author=. 2020 , eprint=

work page 2020