hub Canonical reference

Bovik, Hamid R

Wang, Z · 2004 · arXiv 2003.819861

Canonical reference. 75% of citing Pith papers cite this work as background.

76 Pith papers citing it

Background 75% of classified citations

read on arXiv browse 76 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 9 method 2 dataset 1

citation-polarity summary

background 9 use method 2 use dataset 1

representative citing papers

Online TT-ALS for Streaming Tensor Decomposition with Incremental Orthogonalization

math.NA · 2026-06-30 · unverdicted · novelty 7.0

Online TT-ALS achieves exact core updates in streaming TT decomposition with monotonic objective decrease, temporal smoothness, and linear rank complexity.

Animation2Code: Evaluating Temporal Visual Reasoning in Video-to-Code Generation

cs.CV · 2026-06-26 · unverdicted · novelty 7.0

Animation2Code benchmark with 1,069 videos tests VLMs on generating animation code, showing persistent failures in temporal consistency despite good visual matches.

AutoMedBench: Towards Medical AutoResearch with Agentic AI Models

cs.AI · 2026-06-01 · conditional · novelty 7.0

AutoMedBench evaluates AI agents on long-horizon medical workflows across five stages and finds validation and submission as dominant failure points based on thousands of runs.

A Systematic Benchmark of Intraoperative Ultrasound-to-MR Synthesis for Brain Tumour Surgery

cs.CV · 2026-05-30 · conditional · novelty 7.0

On the public ReMIND dataset, a systematic benchmark of six synthesis models across 48 experiments finds LPIPS correlates with downstream segmentation utility while SSIM does not, with SynDiff-2.5D performing best.

DirectorBench: Diagnosing Long-Form Video Generation with Personalized Multi-Agent Evaluation

cs.CL · 2026-05-28 · unverdicted · novelty 7.0

DirectorBench is a profile-aware diagnostic benchmark that localizes bottlenecks in long-form video generation workflows using structured checkpoints and multi-agent evaluation.

Loki: Representation over Architecture for Diffusion-Based Portrait Animation

cs.CV · 2026-05-22 · unverdicted · novelty 7.0

Loki replaces RGB conditioning stacks with identity-orthogonal parametric face encodings rasterized for diffusion, achieving efficient cross-ID portrait animation without cross-ID training data.

Your Neighbors Know: Leveraging Local Neighborhoods for Backdoor Detection in Decentralized Learning

cs.LG · 2026-05-19 · unverdicted · novelty 7.0 · 2 refs

Argus enables backdoor detection in decentralized ML by collaborative neighbor-based validation of triggers, backed by convergence theory and reducing attack success by up to 90% on tested datasets.

PanoPlane: Plane-Aware Panoramic Completion for Sparse-View Indoor 3D Gaussian Splatting

cs.CV · 2026-05-13 · unverdicted · novelty 7.0

PanoPlane achieves up to 17.8% PSNR gains in sparse-view indoor novel view synthesis by using training-free plane-aware panoramic completion to supervise 3D Gaussian Splatting.

GuardMarkGS: Unified Ownership Tracing and Edit Deterrence for 3D Gaussian Splatting

cs.CV · 2026-05-13 · unverdicted · novelty 7.0

GuardMarkGS unifies watermarking and adversarial edit deterrence into a single optimization framework for protecting 3D Gaussian Splatting assets.

SyMTRS: Benchmark Multi-Task Synthetic Dataset for Depth, Domain Adaptation and Super-Resolution in Aerial Imagery

cs.CV · 2026-04-23 · unverdicted · novelty 7.0

A new large-scale synthetic multi-task benchmark dataset supplying pixel-perfect depth, domain-shifted night imagery, and multi-scale low-resolution pairs for aerial remote sensing.

MESA: A Training-Free Multi-Exemplar Deep Framework for Restoring Ancient Inscription Textures

cs.CV · 2026-04-19 · unverdicted · novelty 7.0

MESA restores ancient inscription textures via multi-exemplar style transfer from VGG19 features with per-layer exemplar selection and OCR-derived weights, without any model training.

GeRM: A Generative Rendering Model From Physically Realistic to Photorealistic

cs.CV · 2026-04-10 · unverdicted · novelty 7.0 · 2 refs

GeRM learns a distribution transfer vector field via a multi-condition ControlNet to convert physically-based renders into photorealistic images using text prompts and a 50K expert-curated dataset.

LumaFlux: Lifting 8-Bit Worlds to HDR Reality with Physically-Guided Diffusion Transformers

cs.CV · 2026-04-03 · unverdicted · novelty 7.0

LumaFlux is a physically and perceptually guided diffusion transformer for SDR-to-HDR conversion that introduces PGA, PCM, and HDR Residual Coupler modules plus a new training corpus and benchmark, outperforming prior ITM methods.

SNIC: Synthesized Noisy Images using Calibration

eess.IV · 2025-12-17 · unverdicted · novelty 7.0

A sensor-specific calibration pipeline using dark frames produces synthesized noisy RAW images that close 54-64% of the PSNR gap to real noise versus manufacturer profiles, accompanied by the open SNIC dataset of over 6600 paired images.

Delta Rectified Flow Sampling for Text-to-Image Editing

cs.CV · 2025-09-01 · unverdicted · novelty 7.0

DRFS is a new inversion-free editing technique for rectified flow models that models source-target velocity discrepancies and applies a time-dependent shift to improve fidelity and unify prior methods like DDS and FlowEdit.

Task complexity shapes internal representations and robustness in neural networks

cs.LG · 2025-08-07 · unverdicted · novelty 7.0

Harder classification tasks produce neural representations whose accuracy collapses under binarization and shuffling while easier tasks remain robust, defining task complexity via the performance gap between full-precision and perturbed networks.

PhotIQA: A photoacoustic image data set with image quality ratings

eess.IV · 2025-07-04 · conditional · novelty 7.0

PhotIQA is a new public dataset of 1134 expert-rated photoacoustic images for benchmarking image quality assessment in medical imaging.

SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM

cs.RO · 2025-04-18 · unverdicted · novelty 7.0

Presents SLAM&Render, a robot-recorded benchmark dataset with 40 multi-modal sequences for testing SLAM, novel view synthesis, and Gaussian Splatting under controlled variations in lighting, arrangements, and occlusions.

Cyclic 2.5D Perceptual Loss for Cross-Modal 3D Medical Image Synthesis: T1w MRI to Tau PET

eess.IV · 2024-06-18 · unverdicted · novelty 7.0

Proposes a cyclic 2.5D perceptual loss with manufacturer SUVR standardization for T1w MRI to tau PET synthesis, reporting improved regional agreement on ADNI and SCAN cohorts across U-Net, UNETR, SwinUNETR, CycleGAN, and Pix2Pix.

Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels

cs.CV · 2023-12-28 · conditional · novelty 7.0

Q-Align trains LMMs on discrete text-defined levels for visual scoring, achieving SOTA on IQA, IAA, and VQA while unifying the tasks in OneAlign.

PointSplat: Compact Gaussian Splatting via Human-Centric Prediction

cs.CV · 2026-06-30 · unverdicted · novelty 6.0

PointSplat infers compact Gaussian splats directly in 3D space from input point sets via ray casting and Point-Image Transformer to reduce inter-view redundancy and improve novel-view quality for humans.

GRay: Ray Tracing 3D Gaussians Near the Speed of Splats

cs.GR · 2026-06-29 · unverdicted · novelty 6.0

GRay is a ray tracer for 3D Gaussians that exploits dense small primitives for logarithmic scaling, rendering nearly 4x faster and optimizing nearly 10x faster than prior ray tracing while remaining competitive with splatting at somewhat lower quality.

AVTok: 1D Unified Tokenization for Holistic Audio-Video Generation

cs.CV · 2026-06-29 · unverdicted · novelty 6.0

AVTok is a unified tokenizer that converts audio-video pairs into a compact 1D latent representation via dual-stream transformer and hierarchical training for improved reconstruction and cross-modal generation.

Recovering Sharp Conductivity Features in the Finite-Data Calder\'on Problem with Physics-Informed Neural Networks

cs.LG · 2026-06-26 · unverdicted · novelty 6.0

A PINN framework with separate networks for conductivity and potentials, multiscale wavelet excitations, and FFE recovers dominant conductivity structures from finite DtN data with 3-12% relative error on synthetic tests, with FFE aiding sharp features.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Measuring the Transferability of Adversarial Examples cs.LG · 2019-07-14 · unverdicted · none · ref 22
Empirical measurement of adversarial example transferability between VGG and Inception model classes with methodological refinements to attack strength selection, perturbation clipping, and evaluation via SSIM.

Bovik, Hamid R

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer