pith. sign in

ConsistI2V: Enhancing Vi- sual Consistency for Image-to-Video Generation

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

citation-role summary

background 3 baseline 1

citation-polarity summary

fields

cs.CV 11 cs.DC 1

years

2026 10 2025 2

verdicts

UNVERDICTED 12

clear filters

representative citing papers

Compositional Video Generation via Inference-Time Guidance

cs.CV · 2026-05-14 · unverdicted · novelty 6.0

CVG improves compositional faithfulness in frozen text-to-video diffusion models by steering early denoising steps with gradients from a classifier trained on the model's own cross-attention features.

Show-o2: Improved Native Unified Multimodal Models

cs.CV · 2025-06-18 · unverdicted · novelty 4.0

Show-o2 unifies text, image, and video understanding and generation in a single autoregressive-plus-flow-matching model built on 3D causal VAE representations.

Image-to-Video Diffusion: From Foundations to Open Frontiers

cs.CV · 2026-05-17 · unverdicted · novelty 3.0

A survey that organizes diffusion image-to-video methods into a taxonomy, distills core designs in condition encoding, temporal modeling, noise prior, and upsampling, and discusses applications plus challenges.

citing papers explorer

Showing 12 of 12 citing papers after filters.