hub Mixed citations

Vbench: Comprehensive benchmark suite for video generative models

Ziqi Huang, Yinan He, Jiashuo Yu, Fan Zhang, Chenyang Si, Yuming Jiang, Yuanhan Zhang, Tianxing Wu, Qingyang Jin, Nattapol Chanpaisit, Yaohui Wang, Xinyuan Chen, Limin Wang, Dahua Lin, Yu Qiao, Ziwei Liu · 2024

Mixed citation behavior. Most common role is background (62%).

13 Pith papers citing it

Background 62% of classified citations

browse 13 citing papers

hub tools

JSON dossier citing papers JSON

citation-role summary

background 5 baseline 1 dataset 1 method 1

citation-polarity summary

background 5 baseline 1 use dataset 1 use method 1

representative citing papers

AniMatrix: An Anime Video Generation Model that Thinks in Art, Not Physics

cs.CV · 2026-05-05 · unverdicted · novelty 7.0

AniMatrix generates anime videos by structuring artistic production rules into a controllable taxonomy and training the model to prioritize those rules over physical realism, achieving top scores from professional animators on prompt understanding and artistic motion.

Assessing Pancreatic Ductal Adenocarcinoma Vascular Invasion: the PDACVI Benchmark

cs.CV · 2026-04-30 · accept · novelty 7.0

The CURVAS-PDACVI benchmark supplies a multi-annotated PDAC dataset and shows that uncertainty-aware models yield better-calibrated maps and more robust performance than binary segmentation methods at clinically ambiguous tumor-vessel interfaces.

The Structured Output Benchmark: A Multi-Source Benchmark for Evaluating Structured Output Quality in Large Language Models

cs.CL · 2026-04-28 · accept · novelty 7.0

SOB benchmark shows LLMs achieve near-perfect schema compliance but value accuracy of only 83% on text, 67% on images, and 24% on audio.

Oracle Noise: Faster Semantic Spherical Alignment for Interpretable Latent Optimization

cs.CV · 2026-04-26 · unverdicted · novelty 7.0

Oracle Noise optimizes diffusion model noise on a Riemannian hypersphere guided by key prompt words to preserve the Gaussian prior, eliminate norm inflation, and achieve faster semantic alignment than Euclidean methods.

Distance Field Rasterization for End-to-End Mesh Reconstruction

cs.GR · 2026-04-26 · unverdicted · novelty 7.0

SDFRaster optimizes a continuous SDF on a Delaunay tetrahedral grid, renders it by rasterizing tetrahedra, and integrates differentiable Marching Tetrahedra for end-to-end mesh reconstruction without post-processing.

Toward Visually Realistic Simulation: A Benchmark for Evaluating Robot Manipulation in Simulation

cs.RO · 2026-05-07 · unverdicted · novelty 6.0

VISER is a new visually realistic simulation benchmark for robot manipulation tasks that uses PBR materials and MLLM-assisted asset generation, achieving 0.92 Pearson correlation with real-world policy performance.

Auditing Frontier Vision-Language Models for Trustworthy Medical VQA: Grounding Failures, Format Collapse, and Domain Adaptation

cs.AI · 2026-04-30 · conditional · novelty 6.0

Auditing five frontier VLMs reveals severe grounding failures (max 0.23 IoU, 19.1% Acc@0.5) and format collapse (up to 99% parse failure) in medical VQA; fine-tuning yields 85.5% SLAKE recall but perception remains the primary trustworthiness issue.

Beyond Fidelity: Semantic Similarity Assessment in Low-Level Image Processing

cs.CV · 2026-04-28 · unverdicted · novelty 6.0 · 2 refs

T3S is a new semantic similarity score for processed images that decomposes semantics into foreground entities, background entities, and relations, outperforming fidelity metrics on COCO and SPA-Data.

Evaluation without Generation: Non-Generative Assessment of Harmful Model Specialization with Applications to CSAM

cs.LG · 2026-04-28 · unverdicted · novelty 6.0

Gaussian probing infers harmful model specialization from parameter perturbations and internal representation responses to Gaussian latent ensembles rather than from generated outputs.

ZID-Net: Zero-Inference Diffusion Prior Decoupling Network for Single Image Dehazing

cs.CV · 2026-04-26 · conditional · novelty 6.0

ZID-Net decouples diffusion-based priors into a training-only head to create an efficient feed-forward network for single-image dehazing, reporting 40.75 dB PSNR on RESIDE and 19 ms inference.

DeepSignature: Digitally Signed, Content-Encoding Watermarks for Robust and Transparent Image Authentication

cs.CR · 2026-04-24 · unverdicted · novelty 6.0

DeepSignature embeds digitally signed content-encoding watermarks via neural networks for robust image authentication, source attribution, and latent-space tamper localization.

Parameter-Efficient Multi-View Proficiency Estimation: From Discriminative Classification to Generative Feedback

cs.CV · 2026-05-05 · unverdicted · novelty 5.0

SkillFormer, PATS, and ProfVLM deliver state-of-the-art multi-view proficiency estimation on Ego-Exo4D with up to 20x fewer parameters by combining selective fusion, dense sampling, and generative feedback.

FreeTimeGS++: Secrets of Dynamic Gaussian Splatting and Their Principles

cs.CV · 2026-05-05 · 3 refs

citing papers explorer

Showing 13 of 13 citing papers.

AniMatrix: An Anime Video Generation Model that Thinks in Art, Not Physics cs.CV · 2026-05-05 · unverdicted · none · ref 35
AniMatrix generates anime videos by structuring artistic production rules into a controllable taxonomy and training the model to prioritize those rules over physical realism, achieving top scores from professional animators on prompt understanding and artistic motion.
Assessing Pancreatic Ductal Adenocarcinoma Vascular Invasion: the PDACVI Benchmark cs.CV · 2026-04-30 · accept · none · ref 28
The CURVAS-PDACVI benchmark supplies a multi-annotated PDAC dataset and shows that uncertainty-aware models yield better-calibrated maps and more robust performance than binary segmentation methods at clinically ambiguous tumor-vessel interfaces.
The Structured Output Benchmark: A Multi-Source Benchmark for Evaluating Structured Output Quality in Large Language Models cs.CL · 2026-04-28 · accept · none · ref 18
SOB benchmark shows LLMs achieve near-perfect schema compliance but value accuracy of only 83% on text, 67% on images, and 24% on audio.
Oracle Noise: Faster Semantic Spherical Alignment for Interpretable Latent Optimization cs.CV · 2026-04-26 · unverdicted · none · ref 12
Oracle Noise optimizes diffusion model noise on a Riemannian hypersphere guided by key prompt words to preserve the Gaussian prior, eliminate norm inflation, and achieve faster semantic alignment than Euclidean methods.
Distance Field Rasterization for End-to-End Mesh Reconstruction cs.GR · 2026-04-26 · unverdicted · none · ref 4
SDFRaster optimizes a continuous SDF on a Delaunay tetrahedral grid, renders it by rasterizing tetrahedra, and integrates differentiable Marching Tetrahedra for end-to-end mesh reconstruction without post-processing.
Toward Visually Realistic Simulation: A Benchmark for Evaluating Robot Manipulation in Simulation cs.RO · 2026-05-07 · unverdicted · none · ref 38
VISER is a new visually realistic simulation benchmark for robot manipulation tasks that uses PBR materials and MLLM-assisted asset generation, achieving 0.92 Pearson correlation with real-world policy performance.
Auditing Frontier Vision-Language Models for Trustworthy Medical VQA: Grounding Failures, Format Collapse, and Domain Adaptation cs.AI · 2026-04-30 · conditional · none · ref 2
Auditing five frontier VLMs reveals severe grounding failures (max 0.23 IoU, 19.1% Acc@0.5) and format collapse (up to 99% parse failure) in medical VQA; fine-tuning yields 85.5% SLAKE recall but perception remains the primary trustworthiness issue.
Beyond Fidelity: Semantic Similarity Assessment in Low-Level Image Processing cs.CV · 2026-04-28 · unverdicted · none · ref 29 · 2 links
T3S is a new semantic similarity score for processed images that decomposes semantics into foreground entities, background entities, and relations, outperforming fidelity metrics on COCO and SPA-Data.
Evaluation without Generation: Non-Generative Assessment of Harmful Model Specialization with Applications to CSAM cs.LG · 2026-04-28 · unverdicted · none · ref 36
Gaussian probing infers harmful model specialization from parameter perturbations and internal representation responses to Gaussian latent ensembles rather than from generated outputs.
ZID-Net: Zero-Inference Diffusion Prior Decoupling Network for Single Image Dehazing cs.CV · 2026-04-26 · conditional · none · ref 15
ZID-Net decouples diffusion-based priors into a training-only head to create an efficient feed-forward network for single-image dehazing, reporting 40.75 dB PSNR on RESIDE and 19 ms inference.
DeepSignature: Digitally Signed, Content-Encoding Watermarks for Robust and Transparent Image Authentication cs.CR · 2026-04-24 · unverdicted · none · ref 41
DeepSignature embeds digitally signed content-encoding watermarks via neural networks for robust image authentication, source attribution, and latent-space tamper localization.
Parameter-Efficient Multi-View Proficiency Estimation: From Discriminative Classification to Generative Feedback cs.CV · 2026-05-05 · unverdicted · none · ref 9
SkillFormer, PATS, and ProfVLM deliver state-of-the-art multi-view proficiency estimation on Ego-Exo4D with up to 20x fewer parameters by combining selective fusion, dense sampling, and generative feedback.
FreeTimeGS++: Secrets of Dynamic Gaussian Splatting and Their Principles cs.CV · 2026-05-05 · unreviewed · ref 5 · 3 links

Vbench: Comprehensive benchmark suite for video generative models

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer