Toward Arbitrary-Shaped Text Spotting Based on End-to-End

被引:3
|
作者
Wei, Guangcun [1 ,2 ]
Rong, Wansheng [1 ]
Liang, Yongquan [1 ]
Xiao, Xinguang [1 ]
Liu, Xiang [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Intelligent Equipment, Tai An 271019, Shandong, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
关键词
Text recognition; Feature extraction; Task analysis; Detectors; Optimization; Convolution; Optical character recognition software; Natural scene text spotting; SA-BiLSTM; end-to-end; joint optimization; SCENE TEXT; RECOGNITION;
D O I
10.1109/ACCESS.2020.3020387
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
At present, text spotting in natural scenes has become one of the research hotspots. Among them, curvilinear text and long text are the main difficulties of text spotting in natural scenes. To better solve these two types of problems, we propose a novel end-to-end text spotting model. The model includes three parts: shared convolution module, text detector module and text recognizer module. For the problem of long text, we adopt the corner attention mechanism to extract the features of long text more effectively. For the problem of curve text, we feed the rectification feature map into the SA-BiLSTM decoder to recognize the curve text more effectively. More importantly, the joint optimization strategy realizes the mutual promotion function of the text detection task and the text recognition task. Experimental results on TotalText, ICDAR2015, ICDAR2013, CTW1500, COCO-Text and MLT datasets prove that our method achieves excellent performance and robustness in text spotting tasks based on end-to-end natural scenes.
引用
收藏
页码:159906 / 159914
页数:9
相关论文
共 50 条
  • [41] Arbitrary-shaped scene text detection by predicting distance map
    Xinyu Wang
    Yaohua Yi
    Jibing Peng
    Kaili Wang
    Applied Intelligence, 2022, 52 : 14374 - 14386
  • [42] AutoText: An End-to-End AutoAI Framework for Text
    Chaudhary, Arunima
    Issak, Alayt
    Kate, Kiran
    Katsis, Yannis
    Valente, Abel
    Wang, Dakuo
    Evfimievski, Alexandre
    Gurajada, Sairam
    Kawas, Ban
    Malossi, Cristiano
    Popa, Lucian
    Pedapati, Tejaswini
    Samulowitz, Horst
    Wistuba, Martin
    Li, Yunyao
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 16001 - 16003
  • [43] End-to-End Neural Text Classification for Tibetan
    Qun, Nuo
    Li, Xing
    Qiu, Xipeng
    Huang, Xuanjing
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2017, 2017, 10565 : 472 - 480
  • [44] EraseNet: End-to-End Text Removal in the Wild
    Liu, Chongyu
    Liu, Yuliang
    Jin, Lianwen
    Zhang, Shuaitao
    Luo, Canjie
    Wang, Yongpan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8760 - 8775
  • [45] End-to-End Differentiable GANs for Text Generation
    Kumar, Sachin
    Tsvetkov, Yulia
    NEURIPS WORKSHOPS, 2020, 2020, 137 : 118 - 128
  • [46] END-TO-END ATTENTION BASED TEXT-DEPENDENT SPEAKER VERIFICATION
    Zhang, Shi-Xiong
    Chen, Zhuo
    Zhao, Yong
    Li, Jinyu
    Gong, Yifan
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 171 - 178
  • [47] End-to-end Learning for Short Text Expansion
    Tang, Jian
    Wang, Yue
    Zheng, Kai
    Mei, Qiaozhu
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1105 - 1113
  • [48] Emotion selectable end-to-end text-based speech editing
    Wang, Tao
    Yi, Jiangyan
    Fu, Ruibo
    Tao, Jianhua
    Wen, Zhengqi
    Zhang, Chu Yuan
    ARTIFICIAL INTELLIGENCE, 2024, 329
  • [49] Arbitrary-shaped scene text detection by predicting distance map
    Wang, Xinyu
    Yi, Yaohua
    Peng, Jibing
    Wang, Kaili
    APPLIED INTELLIGENCE, 2022, 52 (12) : 14374 - 14386
  • [50] Learning Pixel Affinity Pyramid for Arbitrary-Shaped Text Detection
    Fu, Zilong
    Xie, Hongtao
    Fang, Shancheng
    Wang, Yuxin
    Xing, Mengting
    Zhang, Yongdong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)