Co-occurring associated retained concepts in Diffusion Unlearning

Georu Lee; Hoki Kim; Jinseong Park; Miso Kim; Woojin Lee; Yunji Kim

arxiv: 2606.24192 · v1 · pith:7W5R57OAnew · submitted 2026-06-23 · 💻 cs.CV · cs.AI· cs.CL

Co-occurring associated retained concepts in Diffusion Unlearning

Miso Kim , Georu Lee , Yunji Kim , Hoki Kim , Jinseong Park , Woojin Lee This is my paper

Pith reviewed 2026-06-26 00:49 UTC · model grok-4.3

classification 💻 cs.CV cs.AIcs.CL

keywords diffusion modelsconcept unlearningco-occurring conceptsCARE scoreReCAREconcept erasureimage generationgenerative models

0 comments

The pith

ReCARE erases only target concepts in diffusion models by constructing and using a CARE-set of benign co-occurring tokens.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Diffusion unlearning often removes not only harmful targets such as nudity but also related benign concepts like the presence of people. The paper defines these suppressed elements as CARE and introduces a CARE score to measure their preservation directly. ReCARE builds an automatic vocabulary of benign tokens from the target images and incorporates it into training to protect those associations. Experiments on nudity, Van Gogh style, and Tench object removal show improved balance among erasure strength, overall model utility, and CARE retention compared with prior methods.

Core claim

ReCARE automatically constructs the CARE-set, a curated vocabulary of benign co-occurring tokens extracted from target images, and leverages this vocabulary during training for stable unlearning that safeguards CARE while erasing only the target concept.

What carries the argument

The CARE-set, a vocabulary of benign co-occurring tokens extracted from target images, used to guide training so that only the target concept is removed.

Load-bearing premise

The automatically constructed CARE-set from target images reliably identifies only benign co-occurring concepts that should be preserved without introducing new biases or missing critical associations.

What would settle it

Running ReCARE on a new target concept such as violence and measuring whether the CARE score on held-out co-occurring tokens drops below the scores achieved by existing unlearning baselines.

Figures

Figures reproduced from arXiv: 2606.24192 by Georu Lee, Hoki Kim, Jinseong Park, Miso Kim, Woojin Lee, Yunji Kim.

**Figure 1.** Figure 1: Preserving Co-occurring Concepts in Nudity Unlearning. After unlearning nudity, we present generations from two prompts (“A nude person” and “A person”). Baseline methods (AdvUnlearn, AGE, or STEREO) suppress benign co-occurring concepts person, failing to generate person images. In contrast, our proposed ReCARE preserves those concepts while erasing nudity. ABSTRACT Unlearning has emerged as a key techniq… view at source ↗

**Figure 2.** Figure 2: Given a removed target, we can extract the co-occurring words from generated images and [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: Qualitative failure cases in existing [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 5.** Figure 5: Correlation between CARE score and human-annotated ground truth across different targets and methods (Pearson r = 0.905). Existing evaluation metrics (e.g., FID, CLIP score) fail to measure whether CARE concepts are preserved. This is because they only capture global fidelity or semantic similarity to prompts, without explicitly verifying the presence of specific benign co-occurring concepts. Therefore, w… view at source ↗

**Figure 6.** Figure 6: Overview of ReCARE. (1) Global Clustering groups candidate tokens on the t-SNE pro [PITH_FULL_IMAGE:figures/full_fig_p005_6.png] view at source ↗

**Figure 7.** Figure 7: Radar chart of Nudity unlearning. Nudity unlearning. Our method achieves the highest RATIO (See [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗

**Figure 8.** Figure 8: Radar chart of Tench object unlearning. Van Gogh style unlearning. Our method achieves the highest RATIO, reflecting the most reliable trade-off among robustness, utility, and CARE preservation [PITH_FULL_IMAGE:figures/full_fig_p008_8.png] view at source ↗

**Figure 9.** Figure 9: Qualitative results on three unlearning tasks (Nudity, Tench, and Van Gogh). For each task, we show results under CCE attacks and CARE prompts. Baselines often fail either by still generating the erased concept (top rows) or by suppressing benign CARE concepts such as person, stars, or freshwater (bottom rows). Ours successfully removes the target concept while preserving CARE across all three tasks. Full … view at source ↗

**Figure 10.** Figure 10: Qualitative results on the Nudity unlearning task. Attack CARE SD v1.4 ESD FMN AC UCE SPM MACE RECE AdvUnlearn AGE STEREO Ours “A misty depiction of ” stars “A painting in the style of ” { } [PITH_FULL_IMAGE:figures/full_fig_p021_10.png] view at source ↗

**Figure 11.** Figure 11: Qualitative results on the Van Gogh style unlearning task. CARE Target CARE + SD v1.4 ESD FMN SalUn EraseDiff SPM AdvUnlearn AGE STEREO Ours Attack “A luminous depiction of ” freshwater "A photo of in " tench freshwater “A photo of ” { } [PITH_FULL_IMAGE:figures/full_fig_p021_11.png] view at source ↗

**Figure 12.** Figure 12: Quantitative results on the Tench object unlearning task. 0.0 0.2 0.4 0.6 0.8 1.0 Defense Utility CARE OURS STEREO AdvUnlearn AGE [PITH_FULL_IMAGE:figures/full_fig_p021_12.png] view at source ↗

**Figure 13.** Figure 13: Radar chart of Van Gogh style unlearning. 21 [PITH_FULL_IMAGE:figures/full_fig_p021_13.png] view at source ↗

**Figure 14.** Figure 14: Qualitative results of other artists’ styles (Picasso, Monet, Matisse) from the [PITH_FULL_IMAGE:figures/full_fig_p022_14.png] view at source ↗

**Figure 15.** Figure 15: prompt examples for care score evaluation. [PITH_FULL_IMAGE:figures/full_fig_p023_15.png] view at source ↗

**Figure 16.** Figure 16: Qualitative results for mixed-concept prompts constructed in the [PITH_FULL_IMAGE:figures/full_fig_p026_16.png] view at source ↗

read the original abstract

Unlearning has emerged as a key technique to mitigate harmful content generation in diffusion models. However, existing methods often remove not only the target concept, but also benign co-occurring concepts. As illustrated in Fig.1, unlearning nudity can unintentionally suppress the concept of person, preventing a model from generating images with person. We define these undesirably suppressed co-occurring concepts that must be preserved CARE (Co-occurring Associated REtained concepts). Then, we introduce the CARE score, a general metric that directly quantifies their preservation across unlearning tasks. With this foundation, we propose ReCARE (Robust erasure for CARE), a framework that explicitly safeguards CARE while erasing only the target concept. ReCARE automatically constructs the CARE-set, a curated vocabulary of benign co-occurring tokens extracted from target images, and leverages this vocabulary during training for stable unlearning. Extensive experiments across various target concepts (Nudity, Van Gogh style, and Tench object) demonstrate that ReCARE achieves overall state-of-the-art performance in balancing robust concept erasure, overall utility, and CARE preservation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

ReCARE gives a direct way to protect co-occurring benign concepts during unlearning but its automatic CARE-set extraction lacks shown validation.

read the letter

ReCARE targets the real issue where unlearning a target like nudity also wipes out related concepts such as person. The paper defines CARE as those retained co-occurring concepts, introduces a score to measure their preservation, and builds ReCARE around an automatically extracted vocabulary of benign tokens pulled from the target images themselves.

What stands out as new is the CARE metric and the ReCARE training step that uses this set to guide erasure. The experiments cover three cases—nudity, Van Gogh style, and Tench object—and report better overall balance between strong erasure, model utility, and CARE preservation than prior methods.

The framing is useful and the problem it names is genuine. Existing unlearning work often ignores this side effect, so naming it and trying to fix it with a concrete mechanism is a step ahead.

The soft spot is the CARE-set construction. The method depends on the extracted tokens being reliably benign and complete, yet the abstract gives no details on filtering rules, human checks, or sensitivity tests. If the set pulls in concepts that should be erased or misses key associations, both the metric and the SOTA claims become unreliable. The lack of reported baselines, statistical tests, or ablation on the extraction step makes the performance numbers hard to assess from the given summary.

This is for researchers working on diffusion model safety and concept unlearning. A reader already following that literature would see the practical angle and could build on the idea.

It deserves peer review. The core proposal is clear enough and the experiments are broad enough to merit referee input, even if the validation of the set needs more work.

Referee Report

2 major / 1 minor

Summary. The manuscript defines CARE (Co-occurring Associated REtained concepts) as benign co-occurring concepts that unlearning methods should preserve rather than suppress. It introduces the CARE score as a metric to quantify preservation of these concepts and proposes ReCARE, a framework that automatically constructs a CARE-set vocabulary of benign co-occurring tokens from target images and uses it during training to erase only the target concept. Experiments on three targets (Nudity, Van Gogh style, Tench object) claim state-of-the-art performance balancing robust concept erasure, overall utility, and CARE preservation.

Significance. If the CARE-set construction proves reliable and the experimental comparisons are rigorous, the work would usefully highlight and mitigate an under-addressed side-effect of diffusion unlearning. The CARE score offers a concrete, general-purpose evaluation tool that could be adopted by other methods. The automatic extraction approach is novel but its validity is central to all downstream claims.

major comments (2)

[Abstract / Method] Abstract and Method (CARE-set construction): The central SOTA claim rests on the automatically extracted CARE-set containing only concepts that should be retained. No validation (human labeling of associations, sensitivity analysis on extraction rules, or completeness checks) is described, making both the CARE score and reported superiority unreliable if the set includes concepts that ought to be erased or omits critical ones.
[Abstract] Abstract: The state-of-the-art claim on three concepts provides no details on baselines, exact metrics, statistical significance testing, or potential post-hoc selection of results, preventing verification of the balancing performance.

minor comments (1)

[Introduction / Fig. 1] Fig. 1 is referenced but its caption and the surrounding text could more explicitly link the illustrated failure mode to the quantitative CARE score.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the two major comments below and will revise the manuscript to strengthen the presentation of the CARE-set construction and the abstract claims.

read point-by-point responses

Referee: [Abstract / Method] Abstract and Method (CARE-set construction): The central SOTA claim rests on the automatically extracted CARE-set containing only concepts that should be retained. No validation (human labeling of associations, sensitivity analysis on extraction rules, or completeness checks) is described, making both the CARE score and reported superiority unreliable if the set includes concepts that ought to be erased or omits critical ones.

Authors: We agree that explicit validation of the CARE-set is important for reliability. The current manuscript describes the automatic extraction as selecting tokens that co-occur with the target concept in the input images (Section 3.2), with the assumption that these are benign co-occurring concepts to be retained. However, no human validation, sensitivity analysis on extraction thresholds, or completeness checks are reported. In the revision we will add (i) a detailed description of the extraction rules and hyperparameters, (ii) sensitivity analysis varying vocabulary size and co-occurrence thresholds, and (iii) a small-scale human study confirming that extracted tokens are indeed benign and should be preserved for the three evaluated targets. revision: yes
Referee: [Abstract] Abstract: The state-of-the-art claim on three concepts provides no details on baselines, exact metrics, statistical significance testing, or potential post-hoc selection of results, preventing verification of the balancing performance.

Authors: The abstract is intentionally concise; the full experimental section (Section 4) reports the baselines (ESD, UCE, FMN, etc.), exact metrics (erasure success rate, FID on MS-COCO, CARE score), and results across the three targets. Statistical significance is assessed via multiple random seeds and reported with standard deviations, but these details are not summarized in the abstract. We will revise the abstract to include the key quantitative improvements and a brief statement on the evaluation protocol, while retaining the overall length constraint. revision: partial

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper defines CARE concepts and the CARE score as new constructs, builds the CARE-set automatically as an explicit component of the ReCARE method, and supports its SOTA claim via empirical experiments across multiple target concepts (Nudity, Van Gogh, Tench) that compare erasure, utility, and preservation metrics. No equation, prediction, or central result reduces by construction to a fitted parameter or self-referential input; the performance assertions rest on external comparisons using the introduced metric rather than tautological re-labeling of inputs. The assumption that the extracted CARE-set contains only benign concepts is a correctness/validity concern, not a circularity reduction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no explicit free parameters, axioms, or invented entities beyond the high-level definitions of CARE and CARE-set; full paper would be needed to audit these.

pith-pipeline@v0.9.1-grok · 5727 in / 970 out tokens · 21067 ms · 2026-06-26T00:49:00.648981+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

55 extracted references · 15 canonical work pages · 3 internal anchors

[1]

Scaling Learning Algorithms Towards

Bengio, Yoshua and LeCun, Yann , booktitle =. Scaling Learning Algorithms Towards
[2]

The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

Unlearning-Aware Minimization , author=. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=
[3]

Neural computation , volume=

A fast learning algorithm for deep belief nets , author=. Neural computation , volume=. 2006 , publisher=

2006
[4]

2016 , publisher=

Deep learning , author=. 2016 , publisher=

2016
[5]

Proceedings of the IEEE/CVF international conference on computer vision , pages=

Erasing concepts from diffusion models , author=. Proceedings of the IEEE/CVF international conference on computer vision , pages=
[6]

European Conference on Computer Vision , pages=

Reliable and efficient concept erasure of text-to-image diffusion models , author=. European Conference on Computer Vision , pages=. 2024 , organization=

2024
[7]

Advances in neural information processing systems , volume=

Defensive unlearning with adversarial training for robust concept erasure in diffusion models , author=. Advances in neural information processing systems , volume=
[8]

Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

Stereo: A two-stage framework for adversarially robust concept erasing from text-to-image diffusion models , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=
[9]

arXiv preprint arXiv:2501.18950 , year=

Fantastic targets for concept erasure in diffusion models and where to find them , author=. arXiv preprint arXiv:2501.18950 , year=

work page arXiv
[10]

2021 IEEE symposium on security and privacy (SP) , pages=

Machine unlearning , author=. 2021 IEEE symposium on security and privacy (SP) , pages=. 2021 , organization=

2021
[11]

Advances in neural information processing systems , volume=

Making ai forget you: Data deletion in machine learning , author=. Advances in neural information processing systems , volume=
[12]

arXiv preprint arXiv:2210.04610 , year=

Red-teaming the stable diffusion safety filter , author=. arXiv preprint arXiv:2210.04610 , year=

work page arXiv
[13]

arXiv preprint arXiv:2502.08011 , year=

Training-free safe denoisers for safe use of diffusion models , author=. arXiv preprint arXiv:2502.08011 , year=

work page arXiv
[14]

arXiv preprint arXiv:2410.12761 , year=

Safree: Training-free and adaptive guard for safe text-to-image and video generation , author=. arXiv preprint arXiv:2410.12761 , year=

work page arXiv
[15]

Proceedings of the Fifteenth ACM Conference on Data and Application Security and Privacy , pages=

Espresso: Robust concept filtering in text-to-image models , author=. Proceedings of the Fifteenth ACM Conference on Data and Application Security and Privacy , pages=
[16]

Advances in neural information processing systems , volume=

Laion-5b: An open large-scale dataset for training next generation image-text models , author=. Advances in neural information processing systems , volume=
[17]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) , pages=

Comclip: Training-free compositional image and text matching , author=. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) , pages=

2024
[18]

Journal of machine learning research , volume=

Visualizing data using t-SNE , author=. Journal of machine learning research , volume=
[19]

for now , author=

To generate or not? safety-driven unlearned diffusion models are still easy to generate unsafe images... for now , author=. European Conference on Computer Vision , pages=. 2024 , organization=

2024
[20]

arXiv preprint arXiv:2301.00704 , year=

Muse: Text-to-image generation via masked generative transformers , author=. arXiv preprint arXiv:2301.00704 , year=

work page arXiv
[21]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

High-resolution image synthesis with latent diffusion models , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=
[22]

arXiv preprint arXiv:2110.01963 , year=

Multimodal datasets: misogyny, pornography, and malignant stereotypes , author=. arXiv preprint arXiv:2110.01963 , year=

work page arXiv
[23]

Advances in neural information processing systems , volume=

Pick-a-pic: An open dataset of user preferences for text-to-image generation , author=. Advances in neural information processing systems , volume=
[24]

European Conference on Computer Vision , pages=

Is retain set all you need in machine unlearning? restoring performance of unlearned models with out-of-distribution images , author=. European Conference on Computer Vision , pages=. 2024 , organization=

2024
[25]

arXiv preprint arXiv:2502.15082 , year=

Upcore: Utility-preserving coreset selection for balanced unlearning , author=. arXiv preprint arXiv:2502.15082 , year=

work page arXiv
[26]

arXiv preprint arXiv:2504.10185 , year=

Llm unlearning reveals a stronger-than-expected coreset effect in current benchmarks , author=. arXiv preprint arXiv:2504.10185 , year=

work page arXiv
[27]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Unlearning concepts in diffusion model via concept domain correction and concept preserving gradient , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=
[28]

2015 IEEE symposium on security and privacy , pages=

Towards making systems forget with machine unlearning , author=. 2015 IEEE symposium on security and privacy , pages=. 2015 , organization=

2015
[29]

2009 IEEE conference on computer vision and pattern recognition , pages=

Imagenet: A large-scale hierarchical image database , author=. 2009 IEEE conference on computer vision and pattern recognition , pages=. 2009 , organization=

2009
[30]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision , pages=

Unified concept editing in diffusion models , author=. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision , pages=
[31]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Mace: Mass concept erasure in diffusion models , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=
[32]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

One-dimensional adapter to rule them all: Concepts diffusion models and erasing applications , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=
[33]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Forget-me-not: Learning to forget in text-to-image diffusion models , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=
[34]

arXiv preprint arXiv:2310.10012 , year=

Ring-a-bell! how reliable are concept removal methods for diffusion models? , author=. arXiv preprint arXiv:2310.10012 , year=

work page arXiv
[35]

arXiv preprint arXiv:2308.01508 , year=

Circumventing concept erasure methods for text-to-image generative models , author=. arXiv preprint arXiv:2308.01508 , year=

work page arXiv
[36]

Proceedings of the 2021 conference on empirical methods in natural language processing , pages=

Clipscore: A reference-free evaluation metric for image captioning , author=. Proceedings of the 2021 conference on empirical methods in natural language processing , pages=

2021
[37]

Advances in neural information processing systems , volume=

Gans trained by a two time-scale update rule converge to a local nash equilibrium , author=. Advances in neural information processing systems , volume=
[38]

SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation

Salun: Empowering machine unlearning via gradient-based weight saliency in both image classification and generation , author=. arXiv preprint arXiv:2310.12508 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[39]

arXiv preprint arXiv:2401.05779 , year=

Erasediff: Erasing data influence in diffusion models , author=. arXiv preprint arXiv:2401.05779 , year=

work page arXiv
[40]

Proceedings of the IEEE conference on computer vision and pattern recognition , pages=

You only look once: Unified, real-time object detection , author=. Proceedings of the IEEE conference on computer vision and pattern recognition , pages=
[41]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Safe latent diffusion: Mitigating inappropriate degeneration in diffusion models , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=
[42]

2019 , publisher=

Nudenet: Neural nets for nudity classification, detection and selective censoring , author=. 2019 , publisher=

2019
[43]

arXiv preprint arXiv:2006.03677 , year=

Visual transformers: Token-based image representation and processing for computer vision , author=. arXiv preprint arXiv:2006.03677 , year=

work page arXiv 2006
[44]

Large-scale Classification of Fine-Art Paintings: Learning The Right Metric on The Right Feature

Large-scale classification of fine-art paintings: Learning the right metric on the right feature , author=. arXiv preprint arXiv:1505.00855 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[45]

Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 1) , year=

Benchmark for compositional text-to-image synthesis , author=. Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 1) , year=
[46]

Advances in Neural Information Processing Systems , volume=

Selective amnesia: A continual learning approach to forgetting in deep generative models , author=. Advances in Neural Information Processing Systems , volume=
[47]

Proceedings of the IEEE/CVF international conference on computer vision , pages=

Sigmoid loss for language image pre-training , author=. Proceedings of the IEEE/CVF international conference on computer vision , pages=
[48]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Tifa: Accurate and interpretable text-to-image faithfulness evaluation with question answering , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=
[49]

The Thirteenth International Conference on Learning Representations , year=

Learning LLM-as-a-judge for preference alignment , author=. The Thirteenth International Conference on Learning Representations , year=
[50]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Improving automatic vqa evaluation using large language models , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=
[51]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Hallusionbench: an advanced diagnostic suite for entangled language hallucination and visual illusion in large vision-language models , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=
[52]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=
[53]

Advances in Neural Information Processing Systems , volume=

FiVA: Fine-grained visual attribute dataset for text-to-image diffusion models , author=. Advances in Neural Information Processing Systems , volume=
[54]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing , pages=

Meta-rewarding language models: Self-improving alignment with llm-as-a-meta-judge , author=. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing , pages=

2025
[55]

Not Every Time and Frequency Need to Be Forgotten in Diffusion Unlearning

Data Unlearning Beyond Uniform Forgetting via Diffusion Time and Frequency Selection , author=. arXiv preprint arXiv:2510.17917 , year=

work page internal anchor Pith review arXiv

[1] [1]

Scaling Learning Algorithms Towards

Bengio, Yoshua and LeCun, Yann , booktitle =. Scaling Learning Algorithms Towards

[2] [2]

The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

Unlearning-Aware Minimization , author=. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

[3] [3]

Neural computation , volume=

A fast learning algorithm for deep belief nets , author=. Neural computation , volume=. 2006 , publisher=

2006

[4] [4]

2016 , publisher=

Deep learning , author=. 2016 , publisher=

2016

[5] [5]

Proceedings of the IEEE/CVF international conference on computer vision , pages=

Erasing concepts from diffusion models , author=. Proceedings of the IEEE/CVF international conference on computer vision , pages=

[6] [6]

European Conference on Computer Vision , pages=

Reliable and efficient concept erasure of text-to-image diffusion models , author=. European Conference on Computer Vision , pages=. 2024 , organization=

2024

[7] [7]

Advances in neural information processing systems , volume=

Defensive unlearning with adversarial training for robust concept erasure in diffusion models , author=. Advances in neural information processing systems , volume=

[8] [8]

Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

Stereo: A two-stage framework for adversarially robust concept erasing from text-to-image diffusion models , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

[9] [9]

arXiv preprint arXiv:2501.18950 , year=

Fantastic targets for concept erasure in diffusion models and where to find them , author=. arXiv preprint arXiv:2501.18950 , year=

work page arXiv

[10] [10]

2021 IEEE symposium on security and privacy (SP) , pages=

Machine unlearning , author=. 2021 IEEE symposium on security and privacy (SP) , pages=. 2021 , organization=

2021

[11] [11]

Advances in neural information processing systems , volume=

Making ai forget you: Data deletion in machine learning , author=. Advances in neural information processing systems , volume=

[12] [12]

arXiv preprint arXiv:2210.04610 , year=

Red-teaming the stable diffusion safety filter , author=. arXiv preprint arXiv:2210.04610 , year=

work page arXiv

[13] [13]

arXiv preprint arXiv:2502.08011 , year=

Training-free safe denoisers for safe use of diffusion models , author=. arXiv preprint arXiv:2502.08011 , year=

work page arXiv

[14] [14]

arXiv preprint arXiv:2410.12761 , year=

Safree: Training-free and adaptive guard for safe text-to-image and video generation , author=. arXiv preprint arXiv:2410.12761 , year=

work page arXiv

[15] [15]

Proceedings of the Fifteenth ACM Conference on Data and Application Security and Privacy , pages=

Espresso: Robust concept filtering in text-to-image models , author=. Proceedings of the Fifteenth ACM Conference on Data and Application Security and Privacy , pages=

[16] [16]

Advances in neural information processing systems , volume=

Laion-5b: An open large-scale dataset for training next generation image-text models , author=. Advances in neural information processing systems , volume=

[17] [17]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) , pages=

Comclip: Training-free compositional image and text matching , author=. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) , pages=

2024

[18] [18]

Journal of machine learning research , volume=

Visualizing data using t-SNE , author=. Journal of machine learning research , volume=

[19] [19]

for now , author=

To generate or not? safety-driven unlearned diffusion models are still easy to generate unsafe images... for now , author=. European Conference on Computer Vision , pages=. 2024 , organization=

2024

[20] [20]

arXiv preprint arXiv:2301.00704 , year=

Muse: Text-to-image generation via masked generative transformers , author=. arXiv preprint arXiv:2301.00704 , year=

work page arXiv

[21] [21]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

High-resolution image synthesis with latent diffusion models , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

[22] [22]

arXiv preprint arXiv:2110.01963 , year=

Multimodal datasets: misogyny, pornography, and malignant stereotypes , author=. arXiv preprint arXiv:2110.01963 , year=

work page arXiv

[23] [23]

Advances in neural information processing systems , volume=

Pick-a-pic: An open dataset of user preferences for text-to-image generation , author=. Advances in neural information processing systems , volume=

[24] [24]

European Conference on Computer Vision , pages=

Is retain set all you need in machine unlearning? restoring performance of unlearned models with out-of-distribution images , author=. European Conference on Computer Vision , pages=. 2024 , organization=

2024

[25] [25]

arXiv preprint arXiv:2502.15082 , year=

Upcore: Utility-preserving coreset selection for balanced unlearning , author=. arXiv preprint arXiv:2502.15082 , year=

work page arXiv

[26] [26]

arXiv preprint arXiv:2504.10185 , year=

Llm unlearning reveals a stronger-than-expected coreset effect in current benchmarks , author=. arXiv preprint arXiv:2504.10185 , year=

work page arXiv

[27] [27]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Unlearning concepts in diffusion model via concept domain correction and concept preserving gradient , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

[28] [28]

2015 IEEE symposium on security and privacy , pages=

Towards making systems forget with machine unlearning , author=. 2015 IEEE symposium on security and privacy , pages=. 2015 , organization=

2015

[29] [29]

2009 IEEE conference on computer vision and pattern recognition , pages=

Imagenet: A large-scale hierarchical image database , author=. 2009 IEEE conference on computer vision and pattern recognition , pages=. 2009 , organization=

2009

[30] [30]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision , pages=

Unified concept editing in diffusion models , author=. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision , pages=

[31] [31]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Mace: Mass concept erasure in diffusion models , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

[32] [32]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

One-dimensional adapter to rule them all: Concepts diffusion models and erasing applications , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

[33] [33]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Forget-me-not: Learning to forget in text-to-image diffusion models , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

[34] [34]

arXiv preprint arXiv:2310.10012 , year=

Ring-a-bell! how reliable are concept removal methods for diffusion models? , author=. arXiv preprint arXiv:2310.10012 , year=

work page arXiv

[35] [35]

arXiv preprint arXiv:2308.01508 , year=

Circumventing concept erasure methods for text-to-image generative models , author=. arXiv preprint arXiv:2308.01508 , year=

work page arXiv

[36] [36]

Proceedings of the 2021 conference on empirical methods in natural language processing , pages=

Clipscore: A reference-free evaluation metric for image captioning , author=. Proceedings of the 2021 conference on empirical methods in natural language processing , pages=

2021

[37] [37]

Advances in neural information processing systems , volume=

Gans trained by a two time-scale update rule converge to a local nash equilibrium , author=. Advances in neural information processing systems , volume=

[38] [38]

SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation

Salun: Empowering machine unlearning via gradient-based weight saliency in both image classification and generation , author=. arXiv preprint arXiv:2310.12508 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[39] [39]

arXiv preprint arXiv:2401.05779 , year=

Erasediff: Erasing data influence in diffusion models , author=. arXiv preprint arXiv:2401.05779 , year=

work page arXiv

[40] [40]

Proceedings of the IEEE conference on computer vision and pattern recognition , pages=

You only look once: Unified, real-time object detection , author=. Proceedings of the IEEE conference on computer vision and pattern recognition , pages=

[41] [41]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Safe latent diffusion: Mitigating inappropriate degeneration in diffusion models , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

[42] [42]

2019 , publisher=

Nudenet: Neural nets for nudity classification, detection and selective censoring , author=. 2019 , publisher=

2019

[43] [43]

arXiv preprint arXiv:2006.03677 , year=

Visual transformers: Token-based image representation and processing for computer vision , author=. arXiv preprint arXiv:2006.03677 , year=

work page arXiv 2006

[44] [44]

Large-scale Classification of Fine-Art Paintings: Learning The Right Metric on The Right Feature

Large-scale classification of fine-art paintings: Learning the right metric on the right feature , author=. arXiv preprint arXiv:1505.00855 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[45] [45]

Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 1) , year=

Benchmark for compositional text-to-image synthesis , author=. Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 1) , year=

[46] [46]

Advances in Neural Information Processing Systems , volume=

Selective amnesia: A continual learning approach to forgetting in deep generative models , author=. Advances in Neural Information Processing Systems , volume=

[47] [47]

Proceedings of the IEEE/CVF international conference on computer vision , pages=

Sigmoid loss for language image pre-training , author=. Proceedings of the IEEE/CVF international conference on computer vision , pages=

[48] [48]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Tifa: Accurate and interpretable text-to-image faithfulness evaluation with question answering , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

[49] [49]

The Thirteenth International Conference on Learning Representations , year=

Learning LLM-as-a-judge for preference alignment , author=. The Thirteenth International Conference on Learning Representations , year=

[50] [50]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

Improving automatic vqa evaluation using large language models , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

[51] [51]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Hallusionbench: an advanced diagnostic suite for entangled language hallucination and visual illusion in large vision-language models , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

[52] [52]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

[53] [53]

Advances in Neural Information Processing Systems , volume=

FiVA: Fine-grained visual attribute dataset for text-to-image diffusion models , author=. Advances in Neural Information Processing Systems , volume=

[54] [54]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing , pages=

Meta-rewarding language models: Self-improving alignment with llm-as-a-meta-judge , author=. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing , pages=

2025

[55] [55]

Not Every Time and Frequency Need to Be Forgotten in Diffusion Unlearning

Data Unlearning Beyond Uniform Forgetting via Diffusion Time and Frequency Selection , author=. arXiv preprint arXiv:2510.17917 , year=

work page internal anchor Pith review arXiv