hub

Schmon, and Chris G

Hou, Z · 2022 · arXiv 6347.2022

20 Pith papers cite this work. Polarity classification is still indexing.

20 Pith papers citing it

read on arXiv browse 20 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Learning Spectral and Polarimetric Clues for One-to-Multimodal Novel View Synthesis

cs.CV · 2026-07-02 · unverdicted · novelty 7.0 · 2 refs

SPoILeR uses multimodal pre-training to enable accurate novel view synthesis of infrared, polarimetric, and multispectral data from RGB-supervised fine-tuning on new scenes.

AbsoluteDegradation: A Physics-Inspired Synthetic Film-Degradation Pipeline and Archival Film Restoration Benchmark

cs.CV · 2026-07-02 · unverdicted · novelty 7.0

AbsoluteDegradation supplies a physics-inspired synthetic degradation pipeline and a large real-world archival benchmark to train and evaluate film restoration models.

Sparsity-Inducing Divergence Losses for Biometric Verification

cs.CV · 2026-06-30 · unverdicted · novelty 7.0

Q-Margin encodes margin penalties into the reference measure of an alpha-divergence loss to produce sparse discriminative embeddings for face and speaker verification.

Learning to Deny: Action Denial in Multimodal Large Language Models

cs.CV · 2026-06-30 · unverdicted · novelty 7.0

MLLMs drop from over 85% accuracy on action presence to under 50% on matched action-denial videos, exposing a causal verification gap that causal graph prompts partially close.

Physics-Guided Deep Unfolding for Blind Cross-Sensor Spectral Super-Resolution via Learning the Spectral Transformation Function

cs.CV · 2026-06-04 · unverdicted · novelty 7.0 · 2 refs

PGU-Net is a deep unfolding network for blind cross-sensor spectral super-resolution that jointly reconstructs the HSI and learns the spectral transformation function via alternating optimization stages.

AVTrack: Audio-Visual Tracking in Human-centric Complex Scenes

cs.CV · 2026-06-01 · unverdicted · novelty 7.0

Introduces AVTrack dataset for audio-visual tracking in challenging human-centric scenes, demonstrating performance drops in existing methods.

Reviving In-domain Fine-tuning Methods for Source-Free Cross-domain Few-shot Learning

cs.CV · 2026-05-12 · unverdicted · novelty 7.0

LoRA adapters fix collapsed visual CLS token attention in CLIP for superior cross-domain few-shot learning, and the new Semantic Probe framework revives prompt methods to reach state-of-the-art on four benchmarks.

ExpertEdit: Learning Skill-Aware Motion Editing from Expert Videos

cs.CV · 2026-04-12 · unverdicted · novelty 7.0

ExpertEdit edits novice motions to expert skill levels by learning a motion prior from unpaired videos and infilling masked skill-critical spans.

The Inattentional Gap: Task-Conditioned Language and Vision Models Omit the Safety-Critical Signals They Can Otherwise Report

cs.CL · 2026-06-25 · unverdicted · novelty 6.0

Task conditioning suppresses safety-critical signal reporting in language and vision models that unconstrained versions report at higher rates, creating an inattentional gap that decouples benchmark safety from real-world safety.

On the QUEST for Uncertainty Quantification via Highest Density Regions

cs.LG · 2026-06-17 · unverdicted · novelty 6.0

QUEST measures uncertainty via the Lebesgue volume of highest-density regions of a distribution's support, evaluated at robustness parameter alpha, and claims to satisfy UQ axioms while outperforming variance and differential entropy on selective prediction tasks.

MSIQ: Moment-based Scale-Invariant Quality Measure for Single Image Super-Resolution

cs.CV · 2026-05-17 · unverdicted · novelty 6.0

MSIQ is a scale-invariant, model-free quality metric for single image super-resolution using normalized central geometric moments for direct comparison of different-resolution images.

Preventing Latent Rehearsal Decay in Online Continual SSL with SOLAR

cs.LG · 2026-04-12 · unverdicted · novelty 6.0

SOLAR prevents latent rehearsal decay in online continual SSL by adaptively managing replay buffers with deviation proxies and an explicit overlap loss, delivering both fast convergence and state-of-the-art final accuracy on vision benchmarks.

Holi-DETR: Holistic Fashion Item Detection Leveraging Contextual Information

cs.CV · 2025-12-29 · unverdicted · novelty 6.0

Holi-DETR improves fashion item detection by integrating co-occurrence probabilities, inter-item spatial arrangements, and body keypoint relationships into the DETR architecture.

On the notion of missingness for path attribution explainability methods in medical settings: Guiding the selection of medically meaningful baselines

cs.LG · 2025-08-20 · unverdicted · novelty 6.0

Counterfactual baselines for Integrated Gradients yield more faithful and medically relevant attributions than standard baselines across three medical datasets.

On Diffusion Modeling for Anomaly Detection

cs.LG · 2023-05-29 · unverdicted · novelty 6.0

Diffusion models via DDPM work for anomaly detection but are slow; the proposed DTE method estimates diffusion time distribution analytically and with a neural net to deliver faster inference while outperforming DDPM on ADBench for unsupervised and semi-supervised settings.

Automatic Uncertainty-Aware Synthetic Data Bootstrapping for Historical Map Segmentation

cs.CV · 2025-11-19 · unverdicted · novelty 5.0

Synthetic historical maps are generated from modern vector data via style transfer and uncertainty emulation to train segmentation models for historical map corpora.

When Token Compression Breaks: Structural Pruning vs. Token Reduction for Robust ViT Segmentation under High Compression

cs.CV · 2026-07-02 · unverdicted · novelty 4.0

Token compression in ViT segmentation degrades sharply at high ratios due to information loss while structural pruning degrades smoothly; a moderate prune-then-merge pipeline improves the trade-off on ADE20K and Cityscapes under corruption.

Visual Prompting Meets Feature Reconstruction-Based Anomaly Detection with Dual-Teacher Supervision

cs.CV · 2026-06-08 · unverdicted · novelty 4.0

Combines visual prompting, dual-teacher supervision, and diffusion augmentation on an MMR backbone to gain 3.5 percentage points on the AeBAD anomaly detection dataset.

Binary Road Surface Classification Using Machine Learning on Production Vehicle Signals During Cruising

cs.LG · 2026-06-01 · unverdicted · novelty 3.0

Machine learning on vehicle signals enables binary road surface classification into grip or slip conditions during cruising.

AI Driven Soccer Analysis Using Computer Vision

cs.CV · 2026-04-09 · unverdicted · novelty 2.0

A system combining object detection, segmentation, keypoint prediction, and homography transforms soccer video into real-world player positions and tactical statistics.

citing papers explorer

Showing 20 of 20 citing papers.

Learning Spectral and Polarimetric Clues for One-to-Multimodal Novel View Synthesis cs.CV · 2026-07-02 · unverdicted · none · ref 1 · 2 links
SPoILeR uses multimodal pre-training to enable accurate novel view synthesis of infrared, polarimetric, and multispectral data from RGB-supervised fine-tuning on new scenes.
AbsoluteDegradation: A Physics-Inspired Synthetic Film-Degradation Pipeline and Archival Film Restoration Benchmark cs.CV · 2026-07-02 · unverdicted · none · ref 54
AbsoluteDegradation supplies a physics-inspired synthetic degradation pipeline and a large real-world archival benchmark to train and evaluate film restoration models.
Sparsity-Inducing Divergence Losses for Biometric Verification cs.CV · 2026-06-30 · unverdicted · none · ref 4
Q-Margin encodes margin penalties into the reference measure of an alpha-divergence loss to produce sparse discriminative embeddings for face and speaker verification.
Learning to Deny: Action Denial in Multimodal Large Language Models cs.CV · 2026-06-30 · unverdicted · none · ref 76
MLLMs drop from over 85% accuracy on action presence to under 50% on matched action-denial videos, exposing a causal verification gap that causal graph prompts partially close.
Physics-Guided Deep Unfolding for Blind Cross-Sensor Spectral Super-Resolution via Learning the Spectral Transformation Function cs.CV · 2026-06-04 · unverdicted · none · ref 22 · 2 links
PGU-Net is a deep unfolding network for blind cross-sensor spectral super-resolution that jointly reconstructs the HSI and learns the spectral transformation function via alternating optimization stages.
AVTrack: Audio-Visual Tracking in Human-centric Complex Scenes cs.CV · 2026-06-01 · unverdicted · none · ref 51
Introduces AVTrack dataset for audio-visual tracking in challenging human-centric scenes, demonstrating performance drops in existing methods.
Reviving In-domain Fine-tuning Methods for Source-Free Cross-domain Few-shot Learning cs.CV · 2026-05-12 · unverdicted · none · ref 42
LoRA adapters fix collapsed visual CLS token attention in CLIP for superior cross-domain few-shot learning, and the new Semantic Probe framework revives prompt methods to reach state-of-the-art on four benchmarks.
ExpertEdit: Learning Skill-Aware Motion Editing from Expert Videos cs.CV · 2026-04-12 · unverdicted · none · ref 8
ExpertEdit edits novice motions to expert skill levels by learning a motion prior from unpaired videos and infilling masked skill-critical spans.
The Inattentional Gap: Task-Conditioned Language and Vision Models Omit the Safety-Critical Signals They Can Otherwise Report cs.CL · 2026-06-25 · unverdicted · none · ref 53
Task conditioning suppresses safety-critical signal reporting in language and vision models that unconstrained versions report at higher rates, creating an inattentional gap that decouples benchmark safety from real-world safety.
On the QUEST for Uncertainty Quantification via Highest Density Regions cs.LG · 2026-06-17 · unverdicted · none · ref 54
QUEST measures uncertainty via the Lebesgue volume of highest-density regions of a distribution's support, evaluated at robustness parameter alpha, and claims to satisfy UQ axioms while outperforming variance and differential entropy on selective prediction tasks.
MSIQ: Moment-based Scale-Invariant Quality Measure for Single Image Super-Resolution cs.CV · 2026-05-17 · unverdicted · none · ref 3
MSIQ is a scale-invariant, model-free quality metric for single image super-resolution using normalized central geometric moments for direct comparison of different-resolution images.
Preventing Latent Rehearsal Decay in Online Continual SSL with SOLAR cs.LG · 2026-04-12 · unverdicted · none · ref 18
SOLAR prevents latent rehearsal decay in online continual SSL by adaptively managing replay buffers with deviation proxies and an explicit overlap loss, delivering both fast convergence and state-of-the-art final accuracy on vision benchmarks.
Holi-DETR: Holistic Fashion Item Detection Leveraging Contextual Information cs.CV · 2025-12-29 · unverdicted · none · ref 6
Holi-DETR improves fashion item detection by integrating co-occurrence probabilities, inter-item spatial arrangements, and body keypoint relationships into the DETR architecture.
On the notion of missingness for path attribution explainability methods in medical settings: Guiding the selection of medically meaningful baselines cs.LG · 2025-08-20 · unverdicted · none · ref 14
Counterfactual baselines for Integrated Gradients yield more faithful and medically relevant attributions than standard baselines across three medical datasets.
On Diffusion Modeling for Anomaly Detection cs.LG · 2023-05-29 · unverdicted · none · ref 49
Diffusion models via DDPM work for anomaly detection but are slow; the proposed DTE method estimates diffusion time distribution analytically and with a neural net to deliver faster inference while outperforming DDPM on ADBench for unsupervised and semi-supervised settings.
Automatic Uncertainty-Aware Synthetic Data Bootstrapping for Historical Map Segmentation cs.CV · 2025-11-19 · unverdicted · none · ref 19
Synthetic historical maps are generated from modern vector data via style transfer and uncertainty emulation to train segmentation models for historical map corpora.
When Token Compression Breaks: Structural Pruning vs. Token Reduction for Robust ViT Segmentation under High Compression cs.CV · 2026-07-02 · unverdicted · none · ref 14
Token compression in ViT segmentation degrades sharply at high ratios due to information loss while structural pruning degrades smoothly; a moderate prune-then-merge pipeline improves the trade-off on ADE20K and Cityscapes under corruption.
Visual Prompting Meets Feature Reconstruction-Based Anomaly Detection with Dual-Teacher Supervision cs.CV · 2026-06-08 · unverdicted · none · ref 14
Combines visual prompting, dual-teacher supervision, and diffusion augmentation on an MMR backbone to gain 3.5 percentage points on the AeBAD anomaly detection dataset.
Binary Road Surface Classification Using Machine Learning on Production Vehicle Signals During Cruising cs.LG · 2026-06-01 · unverdicted · none · ref 31
Machine learning on vehicle signals enables binary road surface classification into grip or slip conditions during cruising.
AI Driven Soccer Analysis Using Computer Vision cs.CV · 2026-04-09 · unverdicted · none · ref 2
A system combining object detection, segmentation, keypoint prediction, and homography transforms soccer video into real-world player positions and tactical statistics.

Schmon, and Chris G

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer