Toward Arbitrary-Shaped Text Spotting Based on End-to-End

被引：3

作者：

Wei, Guangcun ^{[1
,2
]}

Rong, Wansheng ^{[1
]}

Liang, Yongquan ^{[1
]}

Xiao, Xinguang ^{[1
]}

Liu, Xiang ^{[1
]}

机构：

[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China

[2] Shandong Univ Sci & Technol, Coll Intelligent Equipment, Tai An 271019, Shandong, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷 / 08期

关键词：

Text recognition; Feature extraction; Task analysis; Detectors; Optimization; Convolution; Optical character recognition software; Natural scene text spotting; SA-BiLSTM; end-to-end; joint optimization; SCENE TEXT; RECOGNITION;

D O I：

10.1109/ACCESS.2020.3020387

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

At present, text spotting in natural scenes has become one of the research hotspots. Among them, curvilinear text and long text are the main difficulties of text spotting in natural scenes. To better solve these two types of problems, we propose a novel end-to-end text spotting model. The model includes three parts: shared convolution module, text detector module and text recognizer module. For the problem of long text, we adopt the corner attention mechanism to extract the features of long text more effectively. For the problem of curve text, we feed the rectification feature map into the SA-BiLSTM decoder to recognize the curve text more effectively. More importantly, the joint optimization strategy realizes the mutual promotion function of the text detection task and the text recognition task. Experimental results on TotalText, ICDAR2015, ICDAR2013, CTW1500, COCO-Text and MLT datasets prove that our method achieves excellent performance and robustness in text spotting tasks based on end-to-end natural scenes.

引用

页码：159906 / 159914

页数：9

共 50 条

[31] An end-to-end text spotter with text relation networks
Jianguo Jiang
Baole Wei
Min Yu
Gang Li
Boquan Li
Chao Liu
Min Li
Weiqing Huang
Cybersecurity, 4
[32] Fourier Contour Embedding for Arbitrary-Shaped Text Detection
Zhu, Yiqin
Chen, Jianyong
Liang, Lingyu
Kuang, Zhanghui
Jin, Lianwen
Zhang, Wayne
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3122 - 3130
[33] Wavelet descriptor network for arbitrary-shaped text detection
Zhang, Zixu
Tong, Minglei
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
[34] An end-to-end text spotter with text relation networks
Jiang, Jianguo
Wei, Baole
Yu, Min
Li, Gang
Li, Boquan
Liu, Chao
Li, Min
Huang, Weiqing
CYBERSECURITY, 2021, 4 (01)
[35] Transformer-based end-to-end scene text recognition
Zhu, Xinghao
Zhang, Zhi
PROCEEDINGS OF THE 2021 IEEE 16TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2021), 2021, : 1691 - 1695
[36] An End-to-End Scene Text Recognition for Bilingual Text
Albalawi, Bayan M.
Jamal, Amani T.
Al Khuzayem, Lama A.
Alsaedi, Olaa A.
BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (09)
[37] Attention-based End-to-End Models for Small-Footprint Keyword Spotting
Shan, Changhao
Zhang, Junbo
Wang, Yujun
Xie, Lei
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2037 - 2041
[38] Arbitrary-shaped scene text detection with keypoint-based shape representation
Shuxin Qin
Lin Chen
International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 115 - 127
[39] CM-Net: Concentric Mask Based Arbitrary-Shaped Text Detection
Yang, Chuang
Chen, Mulin
Xiong, Zhitong
Yuan, Yuan
Wang, Qi
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2864 - 2877
[40] Arbitrary-shaped scene text detection with keypoint-based shape representation
Qin, Shuxin
Chen, Lin
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2022, 25 (02) : 115 - 127

← 1 2 3 4 5 →