Text proposals with location-awareness-attention network for arbitrarily shaped scene text detection and recognition

被引:12
|
作者
Zhong, Dajian [1 ]
Lyu, Shujing [1 ,2 ]
Shivakumara, Palaiahankote [3 ]
Pal, Umapada [4 ]
Lu, Yue [1 ,2 ]
机构
[1] East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai 200241, Peoples R China
[2] East China Normal Univ, Sch Commun & Elect Engn, Shanghai 200241, Peoples R China
[3] Univ Malaya, Fac Comp Sci & Informat Technol FSKTM, Kuala Lumpur 50603, Malaysia
[4] Indian Stat Inst, CVPR Unit, Kolkata 700108, India
基金
中国国家自然科学基金;
关键词
Scene text detection; Scene text recognition; Text proposal; Attention model; Location-awareness-attention model; NEURAL-NETWORK; IMAGE;
D O I
10.1016/j.eswa.2022.117564
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unlike existing models that aim to address the challenge of scene text detection and recognition separately, the proposed work aims to address both text detection and recognition using a single architecture to deal with arbitrarily oriented/shaped text. Towards this aim, a novel Text Proposal with Location-AwarenessAttention Network (TPLAANet) for arbitrarily oriented/shaped text detection and recognition is proposed. For text detection, the proposed method explores central mask prediction for locating text instances, bounding box regression branch for tight bounding boxes, and mask branch for accurate positions of arbitrarily oriented/shaped text instances. For text recognition, the proposed method explores character information using a Location-Awareness-Attention Network (LAAN), which learns a two-dimensional attention weight and improves the recognition performance. To test the efficacy of the proposed model, we consider the commonly used horizontal and multi-oriented natural scene text datasets, namely, ICDAR2013, ICDAR2015, and the arbitrarily shaped scene text datasets, namely, Total-Text and CTW1500 for experimentation. Experimental results are provided to validate the effectiveness of the proposed method. The code is available at: https: //codeocean.com/capsule/5666319/tree/v1.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Text Attention and Focal Negative Loss for Scene Text Detection
    Huang, Randong
    Xu, Bo
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [22] FDTA: Fully Convolutional Scene Text Detection With Text Attention
    Cao, Yongcun
    Ma, Shuaisen
    Pan, Haichuan
    IEEE ACCESS, 2020, 8 : 155441 - 155449
  • [23] An extended attention mechanism for scene text recognition
    Xiao, Zheng
    Nie, Zhenyu
    Song, Chao
    Chronopoulos, Anthony Theodore
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 203
  • [24] HIERARCHICAL REFINED ATTENTION FOR SCENE TEXT RECOGNITION
    Zhang, Min
    Ma, Meng
    Wang, Ping
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4175 - 4179
  • [25] Triggered Attention Model for Scene Text Recognition
    Zhang, Churong
    Ming, Yue
    ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [26] Convolutional Attention Networks for Scene Text Recognition
    Xie, Hongtao
    Fang, Shancheng
    Zha, Zheng-Jun
    Yang, Yating
    Li, Yan
    Zhang, Yongdong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (01)
  • [27] Visual attention models for scene text recognition
    Ghosh, Suman K.
    Valveny, Ernest
    Bagdanov, Andrew D.
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 943 - 948
  • [28] Review network for scene text recognition
    Li, Shuohao
    Han, Anqi
    Chen, Xu
    Yin, Xiaoqing
    Zhang, Jun
    JOURNAL OF ELECTRONIC IMAGING, 2017, 26 (05)
  • [29] Deep Residual Text Detection Network for Scene Text
    Zhu, Xiangyu
    Jiang, Yingying
    Yang, Shuli
    Wang, Xiaobing
    Li, Wei
    Fu, Pei
    Wang, Hua
    Luo, Zhenbo
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 807 - 812
  • [30] Summary of Scene Text Detection and Recognition
    Qin, Yao
    Zhang, Zhi
    PROCEEDINGS OF THE 15TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2020), 2020, : 85 - 89