Toward Arbitrary-Shaped Text Spotting Based on End-to-End

被引：3

作者：

Wei, Guangcun ^{[1
,2
]}

Rong, Wansheng ^{[1
]}

Liang, Yongquan ^{[1
]}

Xiao, Xinguang ^{[1
]}

Liu, Xiang ^{[1
]}

机构：

[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China

[2] Shandong Univ Sci & Technol, Coll Intelligent Equipment, Tai An 271019, Shandong, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷 / 08期

关键词：

Text recognition; Feature extraction; Task analysis; Detectors; Optimization; Convolution; Optical character recognition software; Natural scene text spotting; SA-BiLSTM; end-to-end; joint optimization; SCENE TEXT; RECOGNITION;

D O I：

10.1109/ACCESS.2020.3020387

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

At present, text spotting in natural scenes has become one of the research hotspots. Among them, curvilinear text and long text are the main difficulties of text spotting in natural scenes. To better solve these two types of problems, we propose a novel end-to-end text spotting model. The model includes three parts: shared convolution module, text detector module and text recognizer module. For the problem of long text, we adopt the corner attention mechanism to extract the features of long text more effectively. For the problem of curve text, we feed the rectification feature map into the SA-BiLSTM decoder to recognize the curve text more effectively. More importantly, the joint optimization strategy realizes the mutual promotion function of the text detection task and the text recognition task. Experimental results on TotalText, ICDAR2015, ICDAR2013, CTW1500, COCO-Text and MLT datasets prove that our method achieves excellent performance and robustness in text spotting tasks based on end-to-end natural scenes.

引用

页码：159906 / 159914

页数：9

共 50 条

[1] Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting
Qiao, Liang
Tang, Sanli
Cheng, Zhanzhan
Xu, Yunlu
Niu, Yi
Pu, Shiliang
Wu, Fei
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11899 - 11907
[2] TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting
Feng, Wei
He, Wenhao
Yin, Fei
Zhang, Xu-Yao
Liu, Cheng-Lin
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9075 - 9084
[3] Boundary TextSpotter: Toward Arbitrary-Shaped Scene Text Spotting
Lu, Pu
Wang, Hao
Zhu, Shenggao
Wang, Jing
Bai, Xiang
Liu, Wenyu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6200 - 6212
[4] TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Singh, Amanpreet
Peng, Guan
Toh, Mandy
Huang, Jing
Galuba, Wojciech
Hassner, Tal
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8798 - 8808
[5] SText-DETR: End-to-End Arbitrary-Shaped Text Detection with Scalable Query in Transformer
Liao, Pujin
Wang, Zengfu
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 481 - 492
[6] Scene text spotting based on end-to-end
Wei G.
Rong W.
Liang Y.
Xiao X.
Liu X.
Journal of Intelligent and Fuzzy Systems, 2021, 40 (05): : 8871 - 8881
[7] All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting
Wang, Hao
Lu, Pu
Zhang, Hui
Yang, Mingkun
Bai, Xiang
Xu, Yongchao
He, Mengchao
Wang, Yongpan
Liu, Wenyu
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12160 - 12167
[8] POINTER NETWORKS FOR ARBITRARY-SHAPED TEXT SPOTTING
Zhang, Yi
Yang, Wei
Xu, Zhenbo
Li, Yingjie
Chen, Zhi
Huang, Liusheng
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2375 - 2379
[9] Towards Unconstrained End-to-End Text Spotting
Qin, Siyang
Bissacco, Alessandro
Raptis, Michalis
Fujii, Yasuhisa
Xiao, Ying
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4703 - 4713
[10] End-to-End Video Text Spotting with Transformer
Wu, Weijia
Cai, Yuanqiang
Shen, Chunhua
Zhang, Debing
Fu, Ying
Zhou, Hong
Luo, Ping
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 4019 - 4035

← 1 2 3 4 5 →