Toward Arbitrary-Shaped Text Spotting Based on End-to-End

被引:3
|
作者
Wei, Guangcun [1 ,2 ]
Rong, Wansheng [1 ]
Liang, Yongquan [1 ]
Xiao, Xinguang [1 ]
Liu, Xiang [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Intelligent Equipment, Tai An 271019, Shandong, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
关键词
Text recognition; Feature extraction; Task analysis; Detectors; Optimization; Convolution; Optical character recognition software; Natural scene text spotting; SA-BiLSTM; end-to-end; joint optimization; SCENE TEXT; RECOGNITION;
D O I
10.1109/ACCESS.2020.3020387
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
At present, text spotting in natural scenes has become one of the research hotspots. Among them, curvilinear text and long text are the main difficulties of text spotting in natural scenes. To better solve these two types of problems, we propose a novel end-to-end text spotting model. The model includes three parts: shared convolution module, text detector module and text recognizer module. For the problem of long text, we adopt the corner attention mechanism to extract the features of long text more effectively. For the problem of curve text, we feed the rectification feature map into the SA-BiLSTM decoder to recognize the curve text more effectively. More importantly, the joint optimization strategy realizes the mutual promotion function of the text detection task and the text recognition task. Experimental results on TotalText, ICDAR2015, ICDAR2013, CTW1500, COCO-Text and MLT datasets prove that our method achieves excellent performance and robustness in text spotting tasks based on end-to-end natural scenes.
引用
收藏
页码:159906 / 159914
页数:9
相关论文
共 50 条
  • [1] Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting
    Qiao, Liang
    Tang, Sanli
    Cheng, Zhanzhan
    Xu, Yunlu
    Niu, Yi
    Pu, Shiliang
    Wu, Fei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11899 - 11907
  • [2] TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting
    Feng, Wei
    He, Wenhao
    Yin, Fei
    Zhang, Xu-Yao
    Liu, Cheng-Lin
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9075 - 9084
  • [3] Boundary TextSpotter: Toward Arbitrary-Shaped Scene Text Spotting
    Lu, Pu
    Wang, Hao
    Zhu, Shenggao
    Wang, Jing
    Bai, Xiang
    Liu, Wenyu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6200 - 6212
  • [4] TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
    Singh, Amanpreet
    Peng, Guan
    Toh, Mandy
    Huang, Jing
    Galuba, Wojciech
    Hassner, Tal
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8798 - 8808
  • [5] SText-DETR: End-to-End Arbitrary-Shaped Text Detection with Scalable Query in Transformer
    Liao, Pujin
    Wang, Zengfu
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 481 - 492
  • [6] Scene text spotting based on end-to-end
    Wei G.
    Rong W.
    Liang Y.
    Xiao X.
    Liu X.
    Journal of Intelligent and Fuzzy Systems, 2021, 40 (05): : 8871 - 8881
  • [7] All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting
    Wang, Hao
    Lu, Pu
    Zhang, Hui
    Yang, Mingkun
    Bai, Xiang
    Xu, Yongchao
    He, Mengchao
    Wang, Yongpan
    Liu, Wenyu
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12160 - 12167
  • [8] POINTER NETWORKS FOR ARBITRARY-SHAPED TEXT SPOTTING
    Zhang, Yi
    Yang, Wei
    Xu, Zhenbo
    Li, Yingjie
    Chen, Zhi
    Huang, Liusheng
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2375 - 2379
  • [9] Towards Unconstrained End-to-End Text Spotting
    Qin, Siyang
    Bissacco, Alessandro
    Raptis, Michalis
    Fujii, Yasuhisa
    Xiao, Ying
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4703 - 4713
  • [10] End-to-End Video Text Spotting with Transformer
    Wu, Weijia
    Cai, Yuanqiang
    Shen, Chunhua
    Zhang, Debing
    Fu, Ying
    Zhou, Hong
    Luo, Ping
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 4019 - 4035