ADNet: Rethinking the Shrunk Polygon-Based Approach in Scene Text Detection

被引:5
|
作者
Qu, Yadong [1 ]
Xie, Hongtao [1 ]
Fang, Shancheng [1 ]
Wang, Yuxin [1 ]
Zhang, Yongdong [1 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230026, Peoples R China
关键词
Kernel; Shape; Costs; Convolution; Adaptive systems; Text recognition; Synthetic aperture sonar; Scene text detection; shrunk polygon; aspect ratio; adaptive dilation factor;
D O I
10.1109/TMM.2022.3216729
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To localize text regions and separate close instances, the shrunk polygon is widely used in recent scene text detection methods. However, there exist two problems: 1) Existing methods fail to consider the aspect ratio sensitive problem when reconstructing the text instance from shrunk polygon. 2) Texts with extreme aspect ratios will lead to the fracture of shrunk polygons. To handle these two problems, in this paper, we propose a novel Adaptive Dilation Network (ADNet) to focus on the reconstruction process from shrunk polygon, which aims to provide a tight and complete text representation. Firstly, instead of using a fixed dilation factor, ADNet uses an aspect ratio-wise dilation factor to reconstruct the text region from shrunk polygon for each text instance. Such an instance-wise dilation factor considers the scale correlation between the original and shrunk polygon, and thus can guide an adaptive text region reconstruction for texts with large aspect ratio variance. Secondly, to deal with the fracture of detection results, a new Efficient Spatial Relationship Module (ESRM) is devised to capture long-range dependencies with low computation cost. ESRM uses a novel Weighted Pooling to reduce the resolution of feature maps without much information loss. Compared with the existing methods, ADNet further explores the potential of shrunk polygon-based approaches and obtains excellent detection results at an impressive speed. Extensive experiments on several datasets (Total-Text, CTW1500, MSRA-TD500 and ICDAR2015) verify the superiority of our method.
引用
收藏
页码:6983 / 6996
页数:14
相关论文
共 50 条
  • [31] Text detection and localization in natural scene images based on text awareness score
    Soni, Rituraj
    Kumar, Bijendra
    Chand, Satish
    APPLIED INTELLIGENCE, 2019, 49 (04) : 1376 - 1405
  • [32] Natural scene text detection based on multiscale connectionist text proposal network
    Huang, Min
    Lan, Chaohao
    Huang, Wei
    Tao, Yang
    JOURNAL OF ENGINEERING-JOE, 2020, 2020 (13): : 326 - 329
  • [33] Text detection and localization in natural scene images based on text awareness score
    Rituraj Soni
    Bijendra Kumar
    Satish Chand
    Applied Intelligence, 2019, 49 : 1376 - 1405
  • [34] A Novel Wrapper Approach for Feature Selection in Object-Based Image Classification Using Polygon-Based Cross-Validation
    Ma, Lei
    Li, Manchun
    Gao, Yu
    Chen, Tan
    Ma, Xiaoxue
    Qu, Lean
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (03) : 409 - 413
  • [35] FTPN: Scene Text Detection With Feature Pyramid Based Text Proposal Network
    Liu, Fagui
    Chen, Cheng
    Gu, Dian
    Zheng, Jingzhong
    IEEE ACCESS, 2019, 7 : 44219 - 44228
  • [36] PETR: Rethinking the Capability of Transformer-Based Language Model in Scene Text Recognition
    Wang, Yuxin
    Xie, Hongtao
    Fang, Shancheng
    Xing, Mengting
    Wang, Jing
    Zhu, Shenggao
    Zhang, Yongdong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5585 - 5598
  • [37] SCENE TEXT DETECTION BASED ON SKELETON-CUT DETECTOR
    He, Xiang
    Song, Yonghong
    Zhang, Yuanlin
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3375 - 3379
  • [38] Double supervision for scene text detection and recognition based on BMINet
    Wan, Hanyang
    Liu, Ruoyun
    Yu, Li
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 130
  • [39] A Quantum-Based Attention Mechanism in Scene Text Detection
    Wu, Hao
    Zhou, Jun
    Zhang, Qiong
    Lei, Yang
    Yu, Kun
    An, Wenbo
    Zhang, Juntao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 3 - 14
  • [40] SCENE TEXT DETECTION WITH EXTREMAL REGION BASED CASCADED FILTERING
    Li, Gen
    Liu, Jie
    Zhang, Shuwu
    Zheng, Yang
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 2896 - 2900