Toward Arbitrary-Shaped Text Spotting Based on End-to-End

被引:3
|
作者
Wei, Guangcun [1 ,2 ]
Rong, Wansheng [1 ]
Liang, Yongquan [1 ]
Xiao, Xinguang [1 ]
Liu, Xiang [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Intelligent Equipment, Tai An 271019, Shandong, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
关键词
Text recognition; Feature extraction; Task analysis; Detectors; Optimization; Convolution; Optical character recognition software; Natural scene text spotting; SA-BiLSTM; end-to-end; joint optimization; SCENE TEXT; RECOGNITION;
D O I
10.1109/ACCESS.2020.3020387
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
At present, text spotting in natural scenes has become one of the research hotspots. Among them, curvilinear text and long text are the main difficulties of text spotting in natural scenes. To better solve these two types of problems, we propose a novel end-to-end text spotting model. The model includes three parts: shared convolution module, text detector module and text recognizer module. For the problem of long text, we adopt the corner attention mechanism to extract the features of long text more effectively. For the problem of curve text, we feed the rectification feature map into the SA-BiLSTM decoder to recognize the curve text more effectively. More importantly, the joint optimization strategy realizes the mutual promotion function of the text detection task and the text recognition task. Experimental results on TotalText, ICDAR2015, ICDAR2013, CTW1500, COCO-Text and MLT datasets prove that our method achieves excellent performance and robustness in text spotting tasks based on end-to-end natural scenes.
引用
收藏
页码:159906 / 159914
页数:9
相关论文
共 50 条
  • [21] End-to-end Keywords Spotting Based on Connectionist Temporal Classification for Mandarin
    Bai, Ye
    Yi, Jiangyan
    Ni, Hao
    Wen, Zhengqi
    Liu, Bin
    Li, Ya
    Tao, Jianhua
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [22] An End-to-End Model Based on TDNN-BiGRU for Keyword Spotting
    Chai, Shuzhou
    Yang, Zhenye
    Lv, Changsheng
    Zhang, Wei-Qiang
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 402 - 406
  • [23] Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting
    Chen, Ying
    Qiao, Liang
    Cheng, Zhanzhan
    Pu, Shiliang
    Niu, Yi
    Li, Xi
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 356 - 373
  • [24] Arbitrary-Shaped Text Detection With Adaptive Text Region Representation
    Jiang, Xiufeng
    Xu, Shugong
    Zhang, Shunqing
    Cao, Shan
    IEEE ACCESS, 2020, 8 : 102106 - 102118
  • [25] Metadata-Aware End-to-End Keyword Spotting
    Liu, Hongyi
    Abhyankar, Apurva
    Mishchenko, Yuriy
    Senechal, Thibaud
    Fu, Gengshen
    Kulis, Brian
    Stein, Noah
    Shah, Anish
    Vitaladevuni, Shiv Naga Prasad
    INTERSPEECH 2020, 2020, : 2282 - 2286
  • [26] End-to-End Multi-Look Keyword Spotting
    Yu, Meng
    Ji, Xuan
    Wu, Bo
    Su, Dan
    Yu, Dong
    INTERSPEECH 2020, 2020, : 66 - 70
  • [27] Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection
    Wang, Fangfang
    Xu, Xiaogang
    Chen, Yifeng
    Li, Xi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1 - 12
  • [28] An End-to-End Attack on Text CAPTCHAs
    Zi, Yang
    Gao, Haichang
    Cheng, Zhouhang
    Liu, Yi
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 753 - 766
  • [29] END-TO-END CHINESE TEXT RECOGNITION
    Hu, Jie
    Guo, Tszhang
    Cao, Ji
    Zhang, Changshui
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1407 - 1411
  • [30] End-to-End Scene Text Recognition
    Wang, Kai
    Babenko, Boris
    Belongie, Serge
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 1457 - 1464