Nms strikes back

Nms strikes back , author= · 2022 · arXiv 2212.06137

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

baseline 1

citation-polarity summary

baseline 1

representative citing papers

MVDGC: Joint 3D and 2D Multi-view Pedestrian Detection via Dual Geometric Constraints

cs.CV · 2026-06-30 · unverdicted · novelty 7.0

MVDGC unifies BEV and image-view pedestrian localization into one task via 3D cylindrical queries that enforce dual geometric constraints between views.

Simple Supervision Is Hard to Beat: A Bitter Lesson from Sparse Target Labels in Domain-Adaptive Object Detection

cs.CV · 2026-06-29 · unverdicted · novelty 6.0

RTSM improves SFDA-OD by 1.7-18.3 AP50 across methods and detectors, and ten sparse-label feedback plugins give only limited method-dependent gains over it.

MDS-DETR: DETR with Masked Duplicate Suppressor

cs.CV · 2026-05-22 · unverdicted · novelty 6.0

MDS-DETR introduces a masked duplicate suppressor in self-attention to enable one-to-many supervision inside a single decoder, yielding +2.8 mAP over Deformable-DETR on COCO with 5% more training time and outperforming MR.DETR by 0.3 mAP while training 20% faster.

Perception Encoder: The best visual embeddings are not at the output of the network

cs.CV · 2025-04-17 · unverdicted · novelty 6.0

Intermediate layers of a contrastively trained vision-language encoder yield stronger general embeddings than the output layer, enabling state-of-the-art performance across image/video classification, multimodal QA, and dense prediction after simple alignment.

citing papers explorer

Showing 4 of 4 citing papers after filters.

MVDGC: Joint 3D and 2D Multi-view Pedestrian Detection via Dual Geometric Constraints cs.CV · 2026-06-30 · unverdicted · none · ref 70
MVDGC unifies BEV and image-view pedestrian localization into one task via 3D cylindrical queries that enforce dual geometric constraints between views.
Simple Supervision Is Hard to Beat: A Bitter Lesson from Sparse Target Labels in Domain-Adaptive Object Detection cs.CV · 2026-06-29 · unverdicted · none · ref 16
RTSM improves SFDA-OD by 1.7-18.3 AP50 across methods and detectors, and ten sparse-label feedback plugins give only limited method-dependent gains over it.
MDS-DETR: DETR with Masked Duplicate Suppressor cs.CV · 2026-05-22 · unverdicted · none · ref 30
MDS-DETR introduces a masked duplicate suppressor in self-attention to enable one-to-many supervision inside a single decoder, yielding +2.8 mAP over Deformable-DETR on COCO with 5% more training time and outperforming MR.DETR by 0.3 mAP while training 20% faster.
Perception Encoder: The best visual embeddings are not at the output of the network cs.CV · 2025-04-17 · unverdicted · none · ref 99
Intermediate layers of a contrastively trained vision-language encoder yield stronger general embeddings than the output layer, enabling state-of-the-art performance across image/video classification, multimodal QA, and dense prediction after simple alignment.

Nms strikes back

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer