pith. machine review for the scientific record. sign in

arXiv preprint arXiv:2509.06040 (2025) 2, 3

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

fields

cs.CV 5 cs.LG 4

years

2026 9

verdicts

UNVERDICTED 9

representative citing papers

OP-GRPO: Efficient Off-Policy GRPO for Flow-Matching Models

cs.CV · 2026-04-05 · unverdicted · novelty 8.0

OP-GRPO is the first off-policy GRPO method for flow-matching models that reuses trajectories via replay buffer and importance sampling corrections, matching on-policy performance with 34.2% of the training steps.

A Systematic Post-Train Framework for Video Generation

cs.CV · 2026-04-28 · unverdicted · novelty 5.0

A post-training pipeline for video generation models combines SFT, RLHF with novel GRPO, prompt enhancement, and inference optimization to improve visual quality, temporal coherence, and instruction following.

citing papers explorer

Showing 9 of 9 citing papers.