Flashi2v: Fourier-guided latent shifting prevents conditional image leakage in image-to-video generation

Yunyang Ge, Xinhua Cheng, Chengshu Zhao, Xianyi He, Shenghai Yuan, Bin Lin, Bin Zhu, Li Yuan · 2025 · arXiv 2509.25187

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Rebalancing Reference Frame Dominance to Improve Motion in Image-to-Video Models

cs.CV · 2026-05-19 · unverdicted · novelty 6.0 · 3 refs

Reference-frame dominance in self-attention suppresses motion in image-to-video models; DyMoS rebalances attention from generated frames to the reference during initial denoising steps to improve dynamics while preserving fidelity.

OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning

cs.CV · 2026-05-27 · unverdicted · novelty 4.0

OSP-Next reports 83.73% VBench score and up to 2.27x speedup via hybrid sparse attention, SSP parallelism, HiF8 quantization, and Mix-GRPO on diffusion transformers.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Rebalancing Reference Frame Dominance to Improve Motion in Image-to-Video Models cs.CV · 2026-05-19 · unverdicted · none · ref 24 · 3 links
Reference-frame dominance in self-attention suppresses motion in image-to-video models; DyMoS rebalances attention from generated frames to the reference during initial denoising steps to improve dynamics while preserving fidelity.
OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning cs.CV · 2026-05-27 · unverdicted · none · ref 8
OSP-Next reports 83.73% VBench score and up to 2.27x speedup via hybrid sparse attention, SSP parallelism, HiF8 quantization, and Mix-GRPO on diffusion transformers.

Flashi2v: Fourier-guided latent shifting prevents conditional image leakage in image-to-video generation

fields

years

verdicts

representative citing papers

citing papers explorer