Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion

被引:4
|
作者
Wang, Qitong [1 ,2 ]
Fu, Bin [3 ]
Li, Ming [3 ]
He, Junjun [3 ]
Peng, Xi [2 ]
Qiao, Yu [3 ,4 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Guangdong Hong Kong Macao Joint Lab Human Machine, Shenzhen 518055, Peoples R China
[4] Shanghai AI Lab, Shanghai 200031, Peoples R China
关键词
Scene text detection; scene understanding; deep learning;
D O I
10.1109/TMM.2022.3181448
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Segmentation-based text detectors are flexible to capture arbitrary-shaped text regions. Due to large geometry variance, it is necessary to construct effective and robust representations to identify text regions with various shapes and scales. In this paper, we focus on designing effective multi-scale contextual features for locating text instances. Specially, we develop a Region Context Module (RCM) to summarize the semantic response and adaptively extract text-region-aware information in a limited local area. To construct complementary multi-scale contextual representations, multiple RCM branches with different scales are employed and integrated via Progressive Fusion Module (PFM). Our proposed RCM and PFM serve as the plug-and-play modules which can be incorporated into existing scene text detection platforms to further boost detection performance. Extensive experiments show that our methods achieve state-of-the-art performances on Total-Text, SCUT-CTW1500 and MSRA-TD500 datasets. The code with models will become publicly available at https://github.com/wqtwjt1996/RP-Text.
引用
收藏
页码:4718 / 4729
页数:12
相关论文
共 50 条
  • [21] TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform Mask
    Su, Yuchen
    Shao, Zhiwen
    Zhou, Yong
    Meng, Fanrong
    Zhu, Hancheng
    Liu, Bing
    Yao, Rui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5030 - 5042
  • [22] CT-Net: Arbitrary-Shaped Text Detection via Contour Transformer
    Shao, Zhiwen
    Su, Yuchen
    Zhou, Yong
    Meng, Fanrong
    Zhu, Hancheng
    Liu, Bing
    Yao, Rui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1815 - 1826
  • [23] Arbitrary-shaped scene text detection with keypoint-based shape representation
    Qin, Shuxin
    Chen, Lin
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2022, 25 (02) : 115 - 127
  • [24] Arbitrary-shaped text detection with adaptive convolution and path enhancement pyramid network
    Qi Cheng
    Guodong Wang
    Qian Dong
    Bin Wei
    Multimedia Tools and Applications, 2020, 79 : 29225 - 29242
  • [25] All You Need Is a Second Look: Towards Arbitrary-Shaped Text Detection
    Cao, Meng
    Zhang, Can
    Yang, Dongming
    Zou, Yuexian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 758 - 767
  • [26] Kernel-mask knowledge distillation for efficient and accurate arbitrary-shaped text detection
    Honghui Chen
    Yuhang Qiu
    Mengxi Jiang
    Jianhui Lin
    Pingping Chen
    Complex & Intelligent Systems, 2024, 10 : 75 - 86
  • [27] Kernel-mask knowledge distillation for efficient and accurate arbitrary-shaped text detection
    Chen, Honghui
    Qiu, Yuhang
    Jiang, Mengxi
    Lin, Jianhui
    Chen, Pingping
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 75 - 86
  • [28] TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection
    Wang, Fangfang
    Chen, Yifeng
    Wu, Fei
    Li, Xi
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 111 - 119
  • [29] JMNET: Arbitrary-shaped scene text detection using multi-space perception
    Lin, Zhijian
    Chen, Ying
    Chen, Pingping
    Chen, Honghui
    Chen, Feng
    Ling, Nam
    NEUROCOMPUTING, 2022, 513 : 261 - 272
  • [30] Boundary TextSpotter: Toward Arbitrary-Shaped Scene Text Spotting
    Lu, Pu
    Wang, Hao
    Zhu, Shenggao
    Wang, Jing
    Bai, Xiang
    Liu, Wenyu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6200 - 6212