pith. sign in

hub Baseline reference

Videoscore: Building automatic metrics to simulate fine-grained human feedback for video genera- tion

Baseline reference. 80% of citing Pith papers use this work as a benchmark or comparison.

14 Pith papers citing it
Baseline 80% of classified citations

hub tools

citation-role summary

baseline 2 dataset 2 background 1

citation-polarity summary

fields

cs.CV 13 cs.MM 1

representative citing papers

How Far Are Video Models from True Multimodal Reasoning?

cs.CV · 2026-04-21 · unverdicted · novelty 6.0

Current video models succeed on basic understanding but achieve under 25% success on logically grounded generation and near 0% on interactive generation, exposing gaps in multimodal reasoning.

DanceGRPO: Unleashing GRPO on Visual Generation

cs.CV · 2025-05-12 · unverdicted · novelty 6.0

DanceGRPO applies GRPO to visual generation tasks to achieve stable policy optimization across diffusion models, rectified flows, multiple tasks, and diverse reward models, outperforming prior RL methods.

Improving Video Generation with Human Feedback

cs.CV · 2025-01-23 · unverdicted · novelty 6.0

A human preference dataset and VideoReward model enable Flow-DPO and Flow-NRG to produce smoother, better-aligned videos from text prompts in flow-based generators.

citing papers explorer

Showing 14 of 14 citing papers.