Arbitrary-shaped scene text detection with keypoint-based shape representation

被引:0
|
作者
Shuxin Qin
Lin Chen
机构
[1] Purple Mountain Laboratories,
[2] Institute of Automation,undefined
[3] Chinese Academy of Sciences,undefined
关键词
Arbitrary-shaped text detection; Keypoint regression; Anchor-free method; Feature fusing;
D O I
暂无
中图分类号
学科分类号
摘要
Recently scene text detection has become a hot research topic. Arbitrary-shaped text detection is more challenging due to the irregular geometry of the texts such as long curved shapes. Most existing works attempt to solve the problem by using bottom-up methods, followed by heuristic post-processing, or top-down methods with boundary regression. Through analysis and comparison, we present an efficient framework to detect arbitrary-shaped text by fusing bottom-up and top-down methods. Specifically, we use a segmentation method as the bottom-up detector to regress the text areas. We employ an anchor-free method as the top-down detector to represent and distinguish each text based on the results of bottom-up detector. To detect text with arbitrary shapes, we propose a keypoint-based shape representation method, which treats a text as several keypoints linked together. Then, keypoints are regressed by the top-down detector. With the keypoint-based shape representation, the detected text can be easily rectified by Thin Plate Spline (TPS) transformation, and the framework can be directly extended to support end-to-end text spotting. Extensive experiments on several public benchmarks, including both regular-shaped and arbitrary-shaped scene texts in natural images, demonstrate that our method has achieved state-of-the-art performance .
引用
收藏
页码:115 / 127
页数:12
相关论文
共 50 条
  • [1] Arbitrary-shaped scene text detection with keypoint-based shape representation
    Qin, Shuxin
    Chen, Lin
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2022, 25 (02) : 115 - 127
  • [2] Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection
    Wang, Fangfang
    Xu, Xiaogang
    Chen, Yifeng
    Li, Xi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1 - 12
  • [3] Arbitrary-Shaped Text Detection With Adaptive Text Region Representation
    Jiang, Xiufeng
    Xu, Shugong
    Zhang, Shunqing
    Cao, Shan
    [J]. IEEE ACCESS, 2020, 8 : 102106 - 102118
  • [4] Arbitrary-shaped scene text detection by predicting distance map
    Xinyu Wang
    Yaohua Yi
    Jibing Peng
    Kaili Wang
    [J]. Applied Intelligence, 2022, 52 : 14374 - 14386
  • [5] Arbitrary-shaped scene text detection by predicting distance map
    Wang, Xinyu
    Yi, Yaohua
    Peng, Jibing
    Wang, Kaili
    [J]. APPLIED INTELLIGENCE, 2022, 52 (12) : 14374 - 14386
  • [6] Margin Guidance Network for Arbitrary-shaped Scene Text Detection
    Li, Xin
    Wu, Xingjiao
    Ma, Tianlong
    Zhou, Zhao
    Chen, Luhui
    He, Liang
    [J]. 2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 1111 - 1117
  • [7] TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection
    Wang, Fangfang
    Chen, Yifeng
    Wu, Fei
    Li, Xi
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 111 - 119
  • [8] ESRNet: an exploring sample relationships network for arbitrary-shaped scene text detection
    Fan, Huageng
    Lu, Tongwei
    [J]. APPLIED INTELLIGENCE, 2024, 54 (22) : 11995 - 12008
  • [9] Arbitrary Shape Scene Text Detection with Adaptive Text Region Representation
    Wang, Xiaobing
    Jiang, Yingying
    Luo, Zhenbo
    Liu, Cheng-Lin
    Choi, Hyunsoo
    Kim, Sungjin
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6442 - 6451
  • [10] Bidirectional Regression for Arbitrary-Shaped Text Detection
    Sheng, Tao
    Lian, Zhouhui
    [J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 187 - 201