pith. sign in

arxiv: 2106.10409 · v1 · pith:4LYFJONZnew · submitted 2021-06-19 · 💻 cs.CV

AdaZoom: Adaptive Zoom Network for Multi-Scale Object Detection in Large Scenes

classification 💻 cs.CV
keywords detectionadaptiveadazoomfocusnetworkobjectobjectsregions
0
0 comments X
read the original abstract

Detection in large-scale scenes is a challenging problem due to small objects and extreme scale variation. It is essential to focus on the image regions of small objects. In this paper, we propose a novel Adaptive Zoom (AdaZoom) network as a selective magnifier with flexible shape and focal length to adaptively zoom the focus regions for object detection. Based on policy gradient, we construct a reinforcement learning framework for focus region generation, with the reward formulated by object distributions. The scales and aspect ratios of the generated regions are adaptive to the scales and distribution of objects inside. We apply variable magnification according to the scale of the region for adaptive multi-scale detection. We further propose collaborative training to complementarily promote the performance of AdaZoom and the detection network. To validate the effectiveness, we conduct extensive experiments on VisDrone2019, UAVDT, and DOTA datasets. The experiments show AdaZoom brings a consistent and significant improvement over different detection networks, achieving state-of-the-art performance on these datasets, especially outperforming the existing methods by AP of 4.64% on Vis-Drone2019.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. DenseScout: Algorithm-System Co-design for Budgeted Tiny Object Selection on Edge Platforms

    cs.CV 2026-04 unverdicted novelty 6.0

    DenseScout is a lightweight dense-response selector that outperforms detector-based baselines for budgeted tiny-object patch selection on edge platforms, with end-to-end utility depending on both selector quality and ...