ACP-Net: Asymmetric Center Positioning Network for Real-Time Text Detection

被引:0
|
作者
Zhu, Boyuan [1 ]
Liu, Fagui [1 ,2 ]
Chen, Xi [1 ]
Tang, Quan [2 ]
Chen, C. L. Philip [1 ,3 ]
机构
[1] South China Univ Technol, 342 Outer Ring East Rd, Guangzhou 510006, Peoples R China
[2] Peng Cheng Lab, 2 Xingke 1st St, Shenzhen 518055, Nanshan, Peoples R China
[3] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
关键词
Text Detection; Real-Time; Asymmetric Center Positioning Network;
D O I
10.1016/j.knosys.2024.112603
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text detection is crucial across numerous application fields. However, despite the emphasis on real-time performance in scene text detection, most existing detection models utilize the Feature Pyramid Network (FPN) for feature extraction, often disregarding its inherent limitations. Integrating high-resolution multi-channel features into FPN requires substantial computational resources. While FPN treats local and global features equally and is stable in various applications, its suitability for text-specific features is questionable. To this end, we propose the Asymmetric Center Positioning Network (ACP-Net) to replace FPN, achieving accuracy and real-time text detection in complex scenarios. ACP-Net features an asymmetric feature structure with independent branches for global and local information, along with an adaptive weighted fusion module to capture long-range dependencies effectively. In addition, a text center positioning module enhances text feature understanding by learning feature centers. Comprehensive evaluations across various terminals confirmed ACP-Net's superior accuracy and speed.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] MSER-based Real-Time Text Detection and Tracking
    Gomez, Lluis
    Karatzas, Dimosthenis
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3110 - 3115
  • [22] Lightweight Asymmetric Dilation Network for Real-Time Semantic Segmentation
    Hu, Xuegang
    Gong, Yu
    IEEE ACCESS, 2021, 9 : 55630 - 55643
  • [23] Real-time Scene Text Detection Based on Stroke Model
    Liu, Yi
    Zhang, Dongming
    Zhang, Yongdong
    Lin, Shouxun
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3116 - 3120
  • [24] REAL-TIME POSITIONING COMING REAL SOON
    不详
    CIVIL ENGINEERING, 1993, 63 (12): : 18 - 19
  • [25] Real-time Clock Jump Detection and Repair for Precise Point Positioning
    Guo, Fei
    Zhang, Xiaohong
    PROCEEDINGS OF THE 25TH INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS 2012), 2012, : 3077 - 3088
  • [26] Real-time detection and processing of noise correlation in kinematic navigation and positioning
    Gan, Yu
    Sui, Lifen
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2011, 36 (08): : 909 - 913
  • [27] A real-time and effective text detection method for multi-scale and fuzzy text
    Tong, Guoxiang
    Dong, Ming
    Song, Yan
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (01)
  • [28] A real-time and effective text detection method for multi-scale and fuzzy text
    Guoxiang Tong
    Ming Dong
    Yan Song
    Journal of Real-Time Image Processing, 2023, 20
  • [29] Center Focusing Network for Real-Time LiDAR Panoptic Segmentation
    Li, Xiaoyan
    Zhang, Gang
    Wang, Boyue
    Hu, Yongli
    Yin, Baocai
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13425 - 13434
  • [30] CGAN-NET: CLASS-GUIDED ASYMMETRIC NON-LOCAL NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION
    Chen, Hanlin
    Hu, Qingyong
    Yang, Jungang
    Wu, Jing
    Guo, Yulan
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2325 - 2329