Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion

被引:4
|
作者
Wang, Qitong [1 ,2 ]
Fu, Bin [3 ]
Li, Ming [3 ]
He, Junjun [3 ]
Peng, Xi [2 ]
Qiao, Yu [3 ,4 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Guangdong Hong Kong Macao Joint Lab Human Machine, Shenzhen 518055, Peoples R China
[4] Shanghai AI Lab, Shanghai 200031, Peoples R China
关键词
Scene text detection; scene understanding; deep learning;
D O I
10.1109/TMM.2022.3181448
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Segmentation-based text detectors are flexible to capture arbitrary-shaped text regions. Due to large geometry variance, it is necessary to construct effective and robust representations to identify text regions with various shapes and scales. In this paper, we focus on designing effective multi-scale contextual features for locating text instances. Specially, we develop a Region Context Module (RCM) to summarize the semantic response and adaptively extract text-region-aware information in a limited local area. To construct complementary multi-scale contextual representations, multiple RCM branches with different scales are employed and integrated via Progressive Fusion Module (PFM). Our proposed RCM and PFM serve as the plug-and-play modules which can be incorporated into existing scene text detection platforms to further boost detection performance. Extensive experiments show that our methods achieve state-of-the-art performances on Total-Text, SCUT-CTW1500 and MSRA-TD500 datasets. The code with models will become publicly available at https://github.com/wqtwjt1996/RP-Text.
引用
收藏
页码:4718 / 4729
页数:12
相关论文
共 50 条
  • [1] Arbitrary-Shaped Text Detection With Adaptive Text Region Representation
    Jiang, Xiufeng
    Xu, Shugong
    Zhang, Shunqing
    Cao, Shan
    IEEE ACCESS, 2020, 8 : 102106 - 102118
  • [2] Bidirectional Regression for Arbitrary-Shaped Text Detection
    Sheng, Tao
    Lian, Zhouhui
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 187 - 201
  • [3] Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection
    Wang, Fangfang
    Xu, Xiaogang
    Chen, Yifeng
    Li, Xi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1 - 12
  • [4] Fourier Contour Embedding for Arbitrary-Shaped Text Detection
    Zhu, Yiqin
    Chen, Jianyong
    Liang, Lingyu
    Kuang, Zhanghui
    Jin, Lianwen
    Zhang, Wayne
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3122 - 3130
  • [5] Wavelet descriptor network for arbitrary-shaped text detection
    Zhang, Zixu
    Tong, Minglei
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [6] Arbitrary-shaped scene text detection by predicting distance map
    Xinyu Wang
    Yaohua Yi
    Jibing Peng
    Kaili Wang
    Applied Intelligence, 2022, 52 : 14374 - 14386
  • [7] Arbitrary-shaped scene text detection by predicting distance map
    Wang, Xinyu
    Yi, Yaohua
    Peng, Jibing
    Wang, Kaili
    APPLIED INTELLIGENCE, 2022, 52 (12) : 14374 - 14386
  • [8] Learning Pixel Affinity Pyramid for Arbitrary-Shaped Text Detection
    Fu, Zilong
    Xie, Hongtao
    Fang, Shancheng
    Wang, Yuxin
    Xing, Mengting
    Zhang, Yongdong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [9] Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection
    Han, Xu
    Gao, Junyu
    Yang, Chuang
    Yuan, Yuan
    Wang, Qi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 287 - 299
  • [10] Margin Guidance Network for Arbitrary-shaped Scene Text Detection
    Li, Xin
    Wu, Xingjiao
    Ma, Tianlong
    Zhou, Zhao
    Chen, Luhui
    He, Liang
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 1111 - 1117