Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion

被引:4
|
作者
Wang, Qitong [1 ,2 ]
Fu, Bin [3 ]
Li, Ming [3 ]
He, Junjun [3 ]
Peng, Xi [2 ]
Qiao, Yu [3 ,4 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Guangdong Hong Kong Macao Joint Lab Human Machine, Shenzhen 518055, Peoples R China
[4] Shanghai AI Lab, Shanghai 200031, Peoples R China
关键词
Scene text detection; scene understanding; deep learning;
D O I
10.1109/TMM.2022.3181448
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Segmentation-based text detectors are flexible to capture arbitrary-shaped text regions. Due to large geometry variance, it is necessary to construct effective and robust representations to identify text regions with various shapes and scales. In this paper, we focus on designing effective multi-scale contextual features for locating text instances. Specially, we develop a Region Context Module (RCM) to summarize the semantic response and adaptively extract text-region-aware information in a limited local area. To construct complementary multi-scale contextual representations, multiple RCM branches with different scales are employed and integrated via Progressive Fusion Module (PFM). Our proposed RCM and PFM serve as the plug-and-play modules which can be incorporated into existing scene text detection platforms to further boost detection performance. Extensive experiments show that our methods achieve state-of-the-art performances on Total-Text, SCUT-CTW1500 and MSRA-TD500 datasets. The code with models will become publicly available at https://github.com/wqtwjt1996/RP-Text.
引用
收藏
页码:4718 / 4729
页数:12
相关论文
共 50 条
  • [31] Arbitrary-Shaped Scene Text Recognition with Deformable Ensemble Attention
    Xu, Shuo
    Zhuang, Zeming
    Li, Mingjun
    Su, Feng
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), (237-253):
  • [32] Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting
    Qiao, Liang
    Tang, Sanli
    Cheng, Zhanzhan
    Xu, Yunlu
    Niu, Yi
    Pu, Shiliang
    Wu, Fei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11899 - 11907
  • [33] ReLaText: Exploiting visual relationships for arbitrary-shaped scene text detection with graph convolutional networks
    Ma, Chixiang
    Sun, Lei
    Zhong, Zhuoyao
    Huo, Qiang
    PATTERN RECOGNITION, 2021, 111
  • [34] BIP-NET: BIDIRECTIONAL PERSPECTIVE STRATEGY BASED ARBITRARY-SHAPED TEXT DETECTION NETWORK
    Yang, Chuang
    Chen, Mulin
    Yuan, Yuan
    Wang, Qi
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2255 - 2259
  • [35] SegLink plus plus : Detecting Dense and Arbitrary-shaped Scene Text by Instance-aware Component Grouping
    Tang, Jun
    Yang, Zhibo
    Wang, Yongpan
    Zheng, Qi
    Xu, Yongchao
    Bai, Xiang
    PATTERN RECOGNITION, 2019, 96
  • [36] All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting
    Wang, Hao
    Lu, Pu
    Zhang, Hui
    Yang, Mingkun
    Bai, Xiang
    Xu, Yongchao
    He, Mengchao
    Wang, Yongpan
    Liu, Wenyu
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12160 - 12167
  • [37] Progressive mesh-based coding of arbitrary-shaped video objects
    Jordan, CL
    Ebrahimi, T
    Kunt, M
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '99, PARTS 1-2, 1998, 3653 : 1190 - 1201
  • [38] Toward Arbitrary-Shaped Text Spotting Based on End-to-End
    Wei, Guangcun
    Rong, Wansheng
    Liang, Yongquan
    Xiao, Xinguang
    Liu, Xiang
    IEEE ACCESS, 2020, 8 (08): : 159906 - 159914
  • [39] Supervised Attention Network for Arbitrary-Shaped Text Detection in Edge-Fainted Noisy Scene Images
    Soni, Aishwarya
    Dutta, Tanima
    Nigam, Nitika
    Verma, Deepali
    Gupta, Hari Prabhat
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (03) : 1179 - 1188
  • [40] An efficient and universal polygon prediction method based on derivable analytic geometry for arbitrary-shaped text detection
    Zhang, Xiangnan
    Tian, Chunna
    Gao, Xinbo
    VISUAL COMPUTER, 2024, 40 (06): : 4273 - 4285