Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion

被引:4
|
作者
Wang, Qitong [1 ,2 ]
Fu, Bin [3 ]
Li, Ming [3 ]
He, Junjun [3 ]
Peng, Xi [2 ]
Qiao, Yu [3 ,4 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Guangdong Hong Kong Macao Joint Lab Human Machine, Shenzhen 518055, Peoples R China
[4] Shanghai AI Lab, Shanghai 200031, Peoples R China
关键词
Scene text detection; scene understanding; deep learning;
D O I
10.1109/TMM.2022.3181448
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Segmentation-based text detectors are flexible to capture arbitrary-shaped text regions. Due to large geometry variance, it is necessary to construct effective and robust representations to identify text regions with various shapes and scales. In this paper, we focus on designing effective multi-scale contextual features for locating text instances. Specially, we develop a Region Context Module (RCM) to summarize the semantic response and adaptively extract text-region-aware information in a limited local area. To construct complementary multi-scale contextual representations, multiple RCM branches with different scales are employed and integrated via Progressive Fusion Module (PFM). Our proposed RCM and PFM serve as the plug-and-play modules which can be incorporated into existing scene text detection platforms to further boost detection performance. Extensive experiments show that our methods achieve state-of-the-art performances on Total-Text, SCUT-CTW1500 and MSRA-TD500 datasets. The code with models will become publicly available at https://github.com/wqtwjt1996/RP-Text.
引用
收藏
页码:4718 / 4729
页数:12
相关论文
共 50 条
  • [41] Cross-Level Attention Based Adaptive Feature Alignment Network for Arbitrary-Shaped Text Detection
    Zhang, Haiyan
    Li, Sumei
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2243 - 2248
  • [42] SText-DETR: End-to-End Arbitrary-Shaped Text Detection with Scalable Query in Transformer
    Liao, Pujin
    Wang, Zengfu
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 481 - 492
  • [43] R-CCF: region-aware continual contrastive fusion for weakly supervised object detection
    Zhang, Yongqiang
    Tian, Rui
    Zhang, Yin
    Zhang, Zian
    Bai, Yancheng
    Ding, Mingli
    Zuo, Wangmeng
    APPLIED INTELLIGENCE, 2024, 54 (06) : 4689 - 4712
  • [44] R-CCF: region-aware continual contrastive fusion for weakly supervised object detection
    Yongqiang Zhang
    Rui Tian
    Yin Zhang
    Zian Zhang
    Yancheng Bai
    Mingli Ding
    Wangmeng Zuo
    Applied Intelligence, 2024, 54 : 4689 - 4712
  • [45] Region-aware RGB and near-infrared image fusion
    Ying, Jiacheng
    Tong, Can
    Sheng, Zehua
    Yao, Bowen
    Cao, Si-Yuan
    Yu, Heng
    Shen, Hui-Liang
    PATTERN RECOGNITION, 2023, 142
  • [46] Adaptive region-aware feature enhancement for object detection
    Fan, Zhongjie
    Liu, Qiong
    PATTERN RECOGNITION, 2022, 124
  • [47] Detect Arbitrary-Shaped Text via Adaptive Thresholding and Localization Quality Estimation
    Cheng, Peirui
    Zhao, Yuzhong
    Wang, Weiqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7480 - 7490
  • [48] Which and Where to Focus: A Simple yet Accurate Framework for Arbitrary-Shaped Nearby Text Detection in Scene Images
    Guo, Youhui
    Zhou, Yu
    Qin, Xugong
    Wang, Weiping
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 271 - 283
  • [49] Feature Aggregation and Region-Aware Learning for Detection of Splicing Forgery
    Xu, Yanzhi
    Zheng, Jiangbin
    Ren, Jinchang
    Fang, Aiqing
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 696 - 700
  • [50] Reading Arbitrary-Shaped Scene Text from Images Through Spline Regression and Rectification
    Chen, Long
    Su, Feng
    Shi, Jiahao
    Qian, Ye
    COMPUTER VISION - ACCV 2022, PT V, 2023, 13845 : 107 - 123