The pascal visual object classes (voc) challenge.International Journal of Computer Vision, 88:303–338, 06 2010

Mark Everingham, Luc Van Gool, Christopher K · 2010 · DOI 10.1007/s11263-009-0275-4

14 Pith papers cite this work. Polarity classification is still indexing.

14 Pith papers citing it

open at publisher browse 14 citing papers

citation-role summary

dataset 3

citation-polarity summary

use dataset 3

representative citing papers

AutoMedBench: Towards Medical AutoResearch with Agentic AI Models

cs.AI · 2026-06-01 · conditional · novelty 7.0

AutoMedBench evaluates AI agents on long-horizon medical workflows across five stages and finds validation and submission as dominant failure points based on thousands of runs.

BOOKMARKS: Efficient Active Storyline Memory for Role-playing

cs.CL · 2026-05-13 · unverdicted · novelty 7.0

BOOKMARKS introduces searchable bookmarks as reusable answers to storyline questions, enabling active initialization and passive synchronization for more consistent role-playing agent memory than recurrent summarization.

Computer Vision for MOBA Analytics: A Dataset and Baseline for Visibility Analysis in Dota 2

cs.CV · 2026-06-25 · unverdicted · novelty 6.0

Introduces the Dota2-Vis dataset of 288 videos from 144 TI 2025 matches plus 2,477 annotated minimaps and evaluates YOLO11 variants for player-icon detection to produce visibility curves.

Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents

cs.CL · 2026-06-04 · unverdicted · novelty 6.0

Introduces a benchmark dataset for data snapshot extraction focused on semantically meaningful analytical artifacts in institutional documents and shows open-source layout models struggle to generalize from academic benchmarks.

WildRoadBench: A Wild Aerial Road-Damage Grounding Benchmark for Vision-Language Models and Autonomous Agents

cs.CV · 2026-05-19 · unverdicted · novelty 6.0 · 2 refs

WildRoadBench is a new dual-track benchmark on professionally annotated wild UAV road-damage images showing closed-source VLMs lead but leave over half the AP_50 metric on the table while agents lag and open-source models collapse on small targets.

GAZE: Grounded Agentic Zero-shot Evaluation with Viewer-Level Tools and Literature Retrieval on Rare Brain MRI

cs.LG · 2026-04-25 · unverdicted · novelty 6.0

GAZE framework with viewer tools and literature retrieval achieves 58.2 mAP@0.3 lesion localization and 34.9% top-1 diagnostic accuracy on 906 rare brain MRI cases in zero-shot setting, with larger gains on rarest pathologies.

Variational Feature Compression for Model-Specific Representations

cs.CV · 2026-04-08 · unverdicted · novelty 6.0

A variational latent bottleneck with KL regularization and a dynamic binary mask based on saliency produces model-specific features that keep high accuracy for one classifier but drop others below 2% on CIFAR-100 with over 45x suppression.

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

cs.CV · 2026-03-26 · unverdicted · novelty 6.0

MuRF fuses multi-resolution features from frozen vision foundation models at inference time to create stronger representations without any training.

Cross-Domain Few-Shot Segmentation via Ordinary Differential Equations over Time Intervals

cs.CV · 2025-09-01 · unverdicted · novelty 6.0

FSS-TIs models cross-domain few-shot segmentation as an ODE process with Fourier-based spectral perturbations to create domain-agnostic features and enable effective fine-tuning on limited support samples.

MoEIoU: Rethinking Bounding-Box Regression as a Mixture of Experts

cs.CV · 2026-05-30 · unverdicted · novelty 5.0

MoEIoU is a mixture-of-experts IoU loss using log-sum-exp aggregation and curriculum weighting that reports consistent gains over prior IoU losses on PASCAL VOC, HRIPCB, and MS COCO with YOLO models.

XiYOLO: Energy-Aware Object Detection via Iterative Architecture Search and Scaling

cs.CV · 2026-05-07 · unverdicted · novelty 4.0

XiYOLO uses iterative energy-aware neural architecture search and scaling to produce object detectors with stronger accuracy-energy tradeoffs than YOLO baselines on GPUs and NPUs.

GarmNet: Improving Global with Local Perception for Robotic Laundry Folding

cs.RO · 2019-06-30 · unverdicted · novelty 4.0

GarmNet jointly localizes garments and detects grasp landmarks on the CloPeMa dataset, reducing localization error by 24.7% when landmark detection is included.

Generalization Under Scrutiny: Cross-Domain Detection Progresses, Pitfalls, and Persistent Challenges

cs.CV · 2026-04-09 · unverdicted · novelty 3.0

A survey that organizes methods for cross-domain object detection into a taxonomy, analyzes domain shift across detection stages, and outlines persistent challenges.

PipeMFL-240K: A Large-scale Dataset and Benchmark for Object Detection in Pipeline Magnetic Flux Leakage Imaging

cs.CV · 2026-02-04 · 2 refs

citing papers explorer

Showing 14 of 14 citing papers.

AutoMedBench: Towards Medical AutoResearch with Agentic AI Models cs.AI · 2026-06-01 · conditional · none · ref 19
AutoMedBench evaluates AI agents on long-horizon medical workflows across five stages and finds validation and submission as dominant failure points based on thousands of runs.
BOOKMARKS: Efficient Active Storyline Memory for Role-playing cs.CL · 2026-05-13 · unverdicted · none · ref 41
BOOKMARKS introduces searchable bookmarks as reusable answers to storyline questions, enabling active initialization and passive synchronization for more consistent role-playing agent memory than recurrent summarization.
Computer Vision for MOBA Analytics: A Dataset and Baseline for Visibility Analysis in Dota 2 cs.CV · 2026-06-25 · unverdicted · none · ref 30
Introduces the Dota2-Vis dataset of 288 videos from 144 TI 2025 matches plus 2,477 annotated minimaps and evaluates YOLO11 variants for player-icon detection to produce visibility curves.
Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents cs.CL · 2026-06-04 · unverdicted · none · ref 3
Introduces a benchmark dataset for data snapshot extraction focused on semantically meaningful analytical artifacts in institutional documents and shows open-source layout models struggle to generalize from academic benchmarks.
WildRoadBench: A Wild Aerial Road-Damage Grounding Benchmark for Vision-Language Models and Autonomous Agents cs.CV · 2026-05-19 · unverdicted · none · ref 8 · 2 links
WildRoadBench is a new dual-track benchmark on professionally annotated wild UAV road-damage images showing closed-source VLMs lead but leave over half the AP_50 metric on the table while agents lag and open-source models collapse on small targets.
GAZE: Grounded Agentic Zero-shot Evaluation with Viewer-Level Tools and Literature Retrieval on Rare Brain MRI cs.LG · 2026-04-25 · unverdicted · none · ref 8
GAZE framework with viewer tools and literature retrieval achieves 58.2 mAP@0.3 lesion localization and 34.9% top-1 diagnostic accuracy on 906 rare brain MRI cases in zero-shot setting, with larger gains on rarest pathologies.
Variational Feature Compression for Model-Specific Representations cs.CV · 2026-04-08 · unverdicted · none · ref 8
A variational latent bottleneck with KL regularization and a dynamic binary mask based on saliency produces model-specific features that keep high accuracy for one classifier but drop others below 2% on CIFAR-100 with over 45x suppression.
MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models cs.CV · 2026-03-26 · unverdicted · none · ref 4
MuRF fuses multi-resolution features from frozen vision foundation models at inference time to create stronger representations without any training.
Cross-Domain Few-Shot Segmentation via Ordinary Differential Equations over Time Intervals cs.CV · 2025-09-01 · unverdicted · none · ref 55
FSS-TIs models cross-domain few-shot segmentation as an ODE process with Fourier-based spectral perturbations to create domain-agnostic features and enable effective fine-tuning on limited support samples.
MoEIoU: Rethinking Bounding-Box Regression as a Mixture of Experts cs.CV · 2026-05-30 · unverdicted · none · ref 3
MoEIoU is a mixture-of-experts IoU loss using log-sum-exp aggregation and curriculum weighting that reports consistent gains over prior IoU losses on PASCAL VOC, HRIPCB, and MS COCO with YOLO models.
XiYOLO: Energy-Aware Object Detection via Iterative Architecture Search and Scaling cs.CV · 2026-05-07 · unverdicted · none · ref 13
XiYOLO uses iterative energy-aware neural architecture search and scaling to produce object detectors with stronger accuracy-energy tradeoffs than YOLO baselines on GPUs and NPUs.
GarmNet: Improving Global with Local Perception for Robotic Laundry Folding cs.RO · 2019-06-30 · unverdicted · none · ref 4
GarmNet jointly localizes garments and detects grasp landmarks on the CloPeMa dataset, reducing localization error by 24.7% when landmark detection is included.
Generalization Under Scrutiny: Cross-Domain Detection Progresses, Pitfalls, and Persistent Challenges cs.CV · 2026-04-09 · unverdicted · none · ref 95
A survey that organizes methods for cross-domain object detection into a taxonomy, analyzes domain shift across detection stages, and outlines persistent challenges.
PipeMFL-240K: A Large-scale Dataset and Benchmark for Object Detection in Pipeline Magnetic Flux Leakage Imaging cs.CV · 2026-02-04 · unreviewed · ref 5 · 2 links

The pascal visual object classes (voc) challenge.International Journal of Computer Vision, 88:303–338, 06 2010

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer