pith. machine review for the scientific record. sign in

Red-teaming the stable diffusion safety filter

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

years

2026 7

verdicts

UNVERDICTED 7

representative citing papers

Closed-Form Concept Erasure via Double Projections

cs.LG · 2026-04-11 · unverdicted · novelty 6.0

A training-free double-projection linear transformation erases target concepts from generative models by computing a proxy projection then applying a constrained update in the left null space of known directions.

SHIFT: Steering Hidden Intermediates in Flow Transformers

cs.CV · 2026-04-10 · unverdicted · novelty 5.0

SHIFT learns and applies steering vectors to selected layers and timesteps in DiT models to suppress concepts, shift styles, or bias objects while keeping image quality and prompt adherence intact.

citing papers explorer

Showing 7 of 7 citing papers.