Proposes FGDPA, a reparameterizable CNN for real-time underwater image enhancement that injects DCT frequency priors and uses frequency-guided dual-path attention to reach SOTA quality at 4.23K parameters and 600+ FPS.
Animeagent: Is the multi-agent via image-to-video models a good disney storytelling artist?
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.CV 3years
2026 3verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
KEC constructs hierarchical textual knowledge from LLMs to create knowledge-enhanced image features that improve clustering performance over baselines and zero-shot CLIP on 20 datasets.
A survey that organizes diffusion image-to-video methods into a taxonomy, distills core designs in condition encoding, temporal modeling, noise prior, and upsampling, and discusses applications plus challenges.
citing papers explorer
-
Real-Time Underwater Image Enhancement via Frequency-Guided Dual-Path Attention
Proposes FGDPA, a reparameterizable CNN for real-time underwater image enhancement that injects DCT frequency priors and uses frequency-guided dual-path attention to reach SOTA quality at 4.23K parameters and 600+ FPS.
-
Hierarchical Textual Knowledge for Enhanced Image Clustering
KEC constructs hierarchical textual knowledge from LLMs to create knowledge-enhanced image features that improve clustering performance over baselines and zero-shot CLIP on 20 datasets.
-
Image-to-Video Diffusion: From Foundations to Open Frontiers
A survey that organizes diffusion image-to-video methods into a taxonomy, distills core designs in condition encoding, temporal modeling, noise prior, and upsampling, and discusses applications plus challenges.