UVR is a training-free framework that uses attention modulation based on identified information flow stages in multimodal DiT attention to erase unsafe semantics in image synthesis and editing at 91% and 77% rates while preserving quality.
Freeflux: Un- derstanding and exploiting layer-specific roles in rope- based mmdit for versatile image editing.arXiv preprint arXiv:2503.16153,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2years
2026 2representative citing papers
citing papers explorer
-
Unified Safe In-context Image Generation in Multimodal Diffusion Transformers via Restricting Unsafe Information Flows
UVR is a training-free framework that uses attention modulation based on identified information flow stages in multimodal DiT attention to erase unsafe semantics in image synthesis and editing at 91% and 77% rates while preserving quality.
- Prompt Reinjection: Alleviating Prompt Forgetting in Multimodal Diffusion Transformers