pith. sign in

Visual autoregressive mod- eling: Scalable image generation via next-scale prediction.arXiv preprint arXiv:2404.02905, 2024

19 Pith papers cite this work. Polarity classification is still indexing.

19 Pith papers citing it

citation-role summary

background 2 other 1

citation-polarity summary

polarities

unclear 2 background 1

clear filters

representative citing papers

Normalizing Trajectory Models

cs.CV · 2026-05-08 · unverdicted · novelty 7.0 · 2 refs

NTM models each generative reverse step as a conditional normalizing flow with a hybrid shallow-deep architecture, enabling exact-likelihood training and strong four-step sampling performance on text-to-image tasks.

Amplifying Membership Signal Through Chained Regeneration

cs.LG · 2026-06-30 · unverdicted · novelty 6.0

MADreMIA amplifies membership inference signals by showing that memorized samples maintain higher coherence and slower degradation in chained regeneration trajectories than non-members.

SubdivAR: Autoregressive Next-Scale Prediction for Neural Mesh Subdivision

cs.CV · 2026-06-25 · unverdicted · novelty 6.0

SubdivAR reformulates neural mesh subdivision as autoregressive next-scale coordinate prediction with a topology-aware transformer and reports 18.8% and 14.2% reductions in Hausdorff and Chamfer distance over baselines on a new 40K-mesh dataset.

OmniGen-AR: AutoRegressive Any-to-Image Generation

cs.CV · 2026-06-08 · unverdicted · novelty 6.0

OmniGen-AR is a unified autoregressive framework for any-to-image generation that tokenizes text and visual conditions together and uses disentangled causal attention to support tasks like text-to-image, depth-to-image, image editing, and text-to-video while reporting 0.63 on GenEval and 80.02 on VB

Generative Refinement Networks for Visual Synthesis

cs.CV · 2026-04-14 · unverdicted · novelty 6.0

GRN uses hierarchical binary quantization and entropy-guided refinement to set new ImageNet records of 0.56 rFID for reconstruction and 1.81 gFID for class-conditional generation while releasing code and models.

Autoregressive Video Generation without Vector Quantization

cs.CV · 2024-12-18 · unverdicted · novelty 6.0

NOVA reformulates video generation as non-quantized autoregressive frame-by-frame temporal prediction combined with set-by-set spatial prediction, outperforming prior AR video models and some diffusion models in efficiency and quality.

citing papers explorer

Showing 1 of 1 citing paper after filters.