BitResEdit improves text alignment in VAR image editing by +1.07 CLIP on edited regions via bitwise source-negative guidance and gated residual code injection while preserving background.
Flexvar: Flexible visual autoregressive modeling without residual prediction.arXiv preprint arXiv:2502.20313,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
MEPA adds token-routed MoE and residual self-supervised feature alignment to VAR models, reporting better FID on ImageNet 256x256 with half the training epochs and fewer parameters than dense baselines.
citing papers explorer
-
Edit the Bits, Diff the Codes: Bitwise Residual Editing for Visual Autoregressive Models
BitResEdit improves text alignment in VAR image editing by +1.07 CLIP on edited regions via bitwise source-negative guidance and gated residual code injection while preserving background.
-
MEPA: Multi-Scale Representation Alignment for Visual Autoregressive Modeling with Mixture of Experts
MEPA adds token-routed MoE and residual self-supervised feature alignment to VAR models, reporting better FID on ImageNet 256x256 with half the training epochs and fewer parameters than dense baselines.