Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion

被引：4

作者：

Wang, Qitong ^{[1
,2
]}

Fu, Bin ^{[3
]}

Li, Ming ^{[3
]}

He, Junjun ^{[3
]}

Peng, Xi ^{[2
]}

Qiao, Yu ^{[3
,4
]}

机构：

[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China

[2] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA

[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Guangdong Hong Kong Macao Joint Lab Human Machine, Shenzhen 518055, Peoples R China

[4] Shanghai AI Lab, Shanghai 200031, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

关键词：

Scene text detection; scene understanding; deep learning;

D O I：

10.1109/TMM.2022.3181448

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Segmentation-based text detectors are flexible to capture arbitrary-shaped text regions. Due to large geometry variance, it is necessary to construct effective and robust representations to identify text regions with various shapes and scales. In this paper, we focus on designing effective multi-scale contextual features for locating text instances. Specially, we develop a Region Context Module (RCM) to summarize the semantic response and adaptively extract text-region-aware information in a limited local area. To construct complementary multi-scale contextual representations, multiple RCM branches with different scales are employed and integrated via Progressive Fusion Module (PFM). Our proposed RCM and PFM serve as the plug-and-play modules which can be incorporated into existing scene text detection platforms to further boost detection performance. Extensive experiments show that our methods achieve state-of-the-art performances on Total-Text, SCUT-CTW1500 and MSRA-TD500 datasets. The code with models will become publicly available at https://github.com/wqtwjt1996/RP-Text.

引用

页码：4718 / 4729

页数：12

共 50 条

[31] Arbitrary-Shaped Scene Text Recognition with Deformable Ensemble Attention
Xu, Shuo
Zhuang, Zeming
Li, Mingjun
Su, Feng
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), (237-253):
[32] Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting
Qiao, Liang
Tang, Sanli
Cheng, Zhanzhan
Xu, Yunlu
Niu, Yi
Pu, Shiliang
Wu, Fei
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11899 - 11907
[33] ReLaText: Exploiting visual relationships for arbitrary-shaped scene text detection with graph convolutional networks
Ma, Chixiang
Sun, Lei
Zhong, Zhuoyao
Huo, Qiang
PATTERN RECOGNITION, 2021, 111
[34] BIP-NET: BIDIRECTIONAL PERSPECTIVE STRATEGY BASED ARBITRARY-SHAPED TEXT DETECTION NETWORK
Yang, Chuang
Chen, Mulin
Yuan, Yuan
Wang, Qi
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2255 - 2259
[35] SegLink plus plus : Detecting Dense and Arbitrary-shaped Scene Text by Instance-aware Component Grouping
Tang, Jun
Yang, Zhibo
Wang, Yongpan
Zheng, Qi
Xu, Yongchao
Bai, Xiang
PATTERN RECOGNITION, 2019, 96
[36] All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting
Wang, Hao
Lu, Pu
Zhang, Hui
Yang, Mingkun
Bai, Xiang
Xu, Yongchao
He, Mengchao
Wang, Yongpan
Liu, Wenyu
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12160 - 12167
[37] Progressive mesh-based coding of arbitrary-shaped video objects
Jordan, CL
Ebrahimi, T
Kunt, M
VISUAL COMMUNICATIONS AND IMAGE PROCESSING '99, PARTS 1-2, 1998, 3653 : 1190 - 1201
[38] Toward Arbitrary-Shaped Text Spotting Based on End-to-End
Wei, Guangcun
Rong, Wansheng
Liang, Yongquan
Xiao, Xinguang
Liu, Xiang
IEEE ACCESS, 2020, 8 (08): : 159906 - 159914
[39] Supervised Attention Network for Arbitrary-Shaped Text Detection in Edge-Fainted Noisy Scene Images
Soni, Aishwarya
Dutta, Tanima
Nigam, Nitika
Verma, Deepali
Gupta, Hari Prabhat
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (03) : 1179 - 1188
[40] An efficient and universal polygon prediction method based on derivable analytic geometry for arbitrary-shaped text detection
Zhang, Xiangnan
Tian, Chunna
Gao, Xinbo
VISUAL COMPUTER, 2024, 40 (06): : 4273 - 4285

← 1 2 3 4 5 →