Text proposals with location-awareness-attention network for arbitrarily shaped scene text detection and recognition

被引:12
|
作者
Zhong, Dajian [1 ]
Lyu, Shujing [1 ,2 ]
Shivakumara, Palaiahankote [3 ]
Pal, Umapada [4 ]
Lu, Yue [1 ,2 ]
机构
[1] East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai 200241, Peoples R China
[2] East China Normal Univ, Sch Commun & Elect Engn, Shanghai 200241, Peoples R China
[3] Univ Malaya, Fac Comp Sci & Informat Technol FSKTM, Kuala Lumpur 50603, Malaysia
[4] Indian Stat Inst, CVPR Unit, Kolkata 700108, India
基金
中国国家自然科学基金;
关键词
Scene text detection; Scene text recognition; Text proposal; Attention model; Location-awareness-attention model; NEURAL-NETWORK; IMAGE;
D O I
10.1016/j.eswa.2022.117564
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unlike existing models that aim to address the challenge of scene text detection and recognition separately, the proposed work aims to address both text detection and recognition using a single architecture to deal with arbitrarily oriented/shaped text. Towards this aim, a novel Text Proposal with Location-AwarenessAttention Network (TPLAANet) for arbitrarily oriented/shaped text detection and recognition is proposed. For text detection, the proposed method explores central mask prediction for locating text instances, bounding box regression branch for tight bounding boxes, and mask branch for accurate positions of arbitrarily oriented/shaped text instances. For text recognition, the proposed method explores character information using a Location-Awareness-Attention Network (LAAN), which learns a two-dimensional attention weight and improves the recognition performance. To test the efficacy of the proposed model, we consider the commonly used horizontal and multi-oriented natural scene text datasets, namely, ICDAR2013, ICDAR2015, and the arbitrarily shaped scene text datasets, namely, Total-Text and CTW1500 for experimentation. Experimental results are provided to validate the effectiveness of the proposed method. The code is available at: https: //codeocean.com/capsule/5666319/tree/v1.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Arbitrarily Shaped Scene Text Detection With a Mask Tightness Text Detector
    Liu, Yuliang
    Jin, Lianwen
    Fang, Chuanming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 2918 - 2930
  • [2] Arbitrarily shaped scene text detection with dynamic convolution
    Cai, Ying
    Liu, Yuliang
    Shen, Chunhua
    Jin, Lianwen
    Li, Yidong
    Ergu, Daji
    PATTERN RECOGNITION, 2022, 127
  • [3] weakly supervised text attention network for generating text proposals in scene images
    Li Rong
    En Mengyi
    Li Jianqiang
    Zhang haibin
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 324 - 330
  • [4] SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition
    Zhong, Dajian
    Lyu, Shujing
    Shivakumara, Palaiahnakote
    Yin, Bing
    Wu, Jiajia
    Pal, Umapada
    Lu, Yue
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 464 - 480
  • [5] Scene Text Recognition with Cascade Attention Network
    Zhang, Min
    Ma, Meng
    Wang, Ping
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 385 - 393
  • [6] CHARACTER REGION AWARENESS NETWORK FOR SCENE TEXT RECOGNITION
    Shang, Mingyu
    Gao, Jie
    Sun, Jun
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [7] Gaussian Constrained Attention Network for Scene Text Recognition
    Qiao, Zhi
    Qin, Xugong
    Zhou, Yu
    Yang, Fei
    Wang, Weiping
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3328 - 3335
  • [8] Scene Text Recognition by Attention Network with Gated Embedding
    Wang, Cong
    Liu, Cheng-Lin
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [9] DENSE CHAINED ATTENTION NETWORK FOR SCENE TEXT RECOGNITION
    Gao, Yunze
    Chen, Yingying
    Wang, Jinqiao
    Tang, Ming
    Lu, Hanqing
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 679 - 683
  • [10] Spatial attention contrastive network for scene text recognition
    Wang, Fan
    Yin, Dong
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)