pith. sign in

DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

In this paper, we develop a novel unified framework called DeepText for text region proposal generation and text detection in natural images via a fully convolutional neural network (CNN). First, we propose the inception region proposal network (Inception-RPN) and design a set of text characteristic prior bounding boxes to achieve high word recall with only hundred level candidate proposals. Next, we present a powerful textdetection network that embeds ambiguous text category (ATC) information and multilevel region-of-interest pooling (MLRP) for text and non-text classification and accurate localization. Finally, we apply an iterative bounding box voting scheme to pursue high recall in a complementary manner and introduce a filtering algorithm to retain the most suitable bounding box, while removing redundant inner and outer boxes for each text instance. Our approach achieves an F-measure of 0.83 and 0.85 on the ICDAR 2011 and 2013 robust text detection benchmarks, outperforming previous state-of-the-art results.

fields

cs.CV 1

years

2019 1

verdicts

UNVERDICTED 1

representative citing papers

RFBTD: RFB Text Detector

cs.CV · 2019-07-04 · unverdicted · novelty 2.0

RFBTD applies Receptive Field Blocks to scene text detection for arbitrary orientations and dense text, reporting an F-score of 47.09 on ICDAR2015 at 720p resolution.

citing papers explorer

Showing 1 of 1 citing paper.

  • RFBTD: RFB Text Detector cs.CV · 2019-07-04 · unverdicted · none · ref 4 · internal anchor

    RFBTD applies Receptive Field Blocks to scene text detection for arbitrary orientations and dense text, reporting an F-score of 47.09 on ICDAR2015 at 720p resolution.