pith. machine review for the scientific record. sign in

arxiv: 1504.08083 · v2 · submitted 2015-04-30 · 💻 cs.CV

Recognition: unknown

Fast R-CNN

Ross Girshick

Authors on Pith no claims yet
classification 💻 cs.CV
keywords fastr-cnnfastercomparedconvolutionaldeepdetectionnetwork
0
0 comments X
read the original abstract

This paper proposes a Fast Region-based Convolutional Network method (Fast R-CNN) for object detection. Fast R-CNN builds on previous work to efficiently classify object proposals using deep convolutional networks. Compared to previous work, Fast R-CNN employs several innovations to improve training and testing speed while also increasing detection accuracy. Fast R-CNN trains the very deep VGG16 network 9x faster than R-CNN, is 213x faster at test-time, and achieves a higher mAP on PASCAL VOC 2012. Compared to SPPnet, Fast R-CNN trains VGG16 3x faster, tests 10x faster, and is more accurate. Fast R-CNN is implemented in Python and C++ (using Caffe) and is available under the open-source MIT License at https://github.com/rbgirshick/fast-rcnn.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

    cs.CV 2015-10 conditional novelty 7.0

    A pruning-quantization-Huffman pipeline compresses deep neural networks 35-49x without accuracy loss.

  2. CalibFree: Self-Supervised View Feature Separation for Calibration-Free Multi-Camera Multi-Object Tracking

    cs.CV 2026-05 unverdicted novelty 6.0

    CalibFree enables calibration-free multi-camera tracking via self-supervised feature separation through single-view distillation and cross-view reconstruction, reporting 3% higher accuracy and 7.5% better F1 on tested...

  3. Label-Efficient School Detection from Aerial Imagery via Weakly Supervised Pretraining and Fine-Tuning

    cs.CV 2026-05 unverdicted novelty 4.0

    A two-stage weakly supervised pipeline pretrains on auto-generated school labels from sparse points and fine-tunes on only 50 manual examples to achieve strong detection performance in aerial imagery.

  4. Learning to count small and clustered objects with application to bacterial colonies

    cs.CV 2026-04 unverdicted novelty 4.0

    ACFamNet Pro reaches 9.64% mean normalized absolute error on bacterial colony images under 5-fold cross-validation, beating FamNet by 12.71%.