Single Shot Text Detector with Regional Attention

被引:205
|
作者
He, Pan [1 ]
Huang, Weilin [2 ,3 ]
He, Tong [3 ]
Zhu, Qile [1 ]
Qiao, Yu [3 ]
Li, Xiaolin [1 ]
机构
[1] Univ Florida, Natl Sci Fdn, Ctr Big Learning, Gainesville, FL 32611 USA
[2] Univ Oxford, Dept Engn Sci, Oxford, England
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Guangdong Prov Key Lab Comp Vis & Virtual Real Te, Shenzhen, Peoples R China
基金
美国国家卫生研究院; 美国国家科学基金会; 中国国家自然科学基金;
关键词
D O I
10.1109/ICCV.2017.331
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel single-shot text detector that directly outputs word-level bounding boxes in a natural image. We propose an attention mechanism which roughly identifies text regions via an automatically learned attentional map. This substantially suppresses background interference in the convolutional features, which is the key to producing accurate inference of words, particularly at extremely small sizes. This results in a single model that essentially works in a coarse-to-fine manner. It departs from recent FCN-based text detectors which cascade multiple FCN models to achieve an accurate prediction. Furthermore, we develop a hierarchical inception module which efficiently aggregates multi-scale inception features. This enhances local details, and also encodes strong context information, allowing the detector to work reliably on multi-scale and multi-orientation text with single-scale images. Our text detector achieves an F-measure of 77% on the ICDAR 2015 benchmark, advancing the state-of-the-art results in [18, 28]. Demo is available at: http://sstd.whuang.org/.
引用
收藏
页码:3066 / 3074
页数:9
相关论文
共 50 条
  • [1] Regional attention-based single shot detector for SAR ship detection
    Chen Shiqi
    Zhan Ronghui
    Zhang Jun
    [J]. JOURNAL OF ENGINEERING-JOE, 2019, 2019 (21): : 7381 - 7384
  • [2] A Fusion Strategy for the Single Shot Text Detector
    Yu, Zheng
    Lyu, Shujing
    Lu, Yue
    Wang, Patrick S. P.
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3687 - 3691
  • [3] Attention Based Single Shot Multibox Detector
    Zhao Hui
    Li Zhiwei
    Zhang Tianqi
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (07) : 2096 - 2104
  • [4] Single Shot Text Detector with Rotational Prior Boxes
    Zhu, Wei
    Lou, Jing
    Xia, Qingyuan
    Ren, Mingwu
    [J]. NEURAL PROCESSING LETTERS, 2019, 49 (03) : 863 - 877
  • [5] Single Shot Text Detector with Rotational Prior Boxes
    Wei Zhu
    Jing Lou
    Qingyuan Xia
    Mingwu Ren
    [J]. Neural Processing Letters, 2019, 49 : 863 - 877
  • [6] Scale Pyramid Attention for Single Shot MultiBox Detector
    Hao, Jie
    Jiang, Feng
    Zhang, Rufei
    Lin, Xipeng
    Leng, Biao
    Song, Guanglu
    [J]. IEEE ACCESS, 2019, 7 : 138816 - 138824
  • [7] Single Shot Attention-Based Face Detector
    Zhuang, Chubin
    Zhang, Shifeng
    Zhu, Xiangyu
    Lei, Zhen
    Li, Stan Z.
    [J]. BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 : 285 - 293
  • [8] A SINGLE-SHOT ORIENTED SCENE TEXT DETECTOR WITH LEARNABLE ANCHORS
    Sheng, Fenfen
    Chen, Zhineng
    Mei, Tao
    Xu, Bo
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1516 - 1521
  • [9] TSSD: Temporal Single-Shot Detector Based on Attention and LSTM
    Chen, Xingyu
    Wu, Zhengxing
    Yu, Junzhi
    [J]. 2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 5758 - 5763
  • [10] TextBoxes plus plus : A Single-Shot Oriented Scene Text Detector
    Liao, Minghui
    Shi, Baoguang
    Bai, Xiang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (08) : 3676 - 3690