Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection

被引:8
|
作者
Wang, Fangfang [1 ,2 ]
Xu, Xiaogang [2 ,3 ]
Chen, Yifeng [1 ]
Li, Xi [1 ,4 ,5 ,6 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[2] Zhejiang Lab, Hangzhou 310027, Peoples R China
[3] Zhejiang Gongshang Univ, Sch Comp & Informat Engn, Hangzhou 310027, Peoples R China
[4] Zhejiang Univ, Shanghai Inst Adv Study, Shanghai 201203, Peoples R China
[5] Shanghai AI Lab, Shanghai 201203, Peoples R China
[6] Zhejiang Singapore Innovat & AI Joint Res Lab, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Arbitrary-shaped text detection; fuzzy semantics; segmentation-based framework; single-shot network;
D O I
10.1109/TIP.2022.3201467
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To robustly detect arbitrary-shaped scene texts, bottom-up methods are widely explored for their flexibility. Due to the highly homogeneous texture and cluttered distribution of scene texts, it is nontrivial for segmentation-based methods to discover the separatrixes between adjacent instances. To effectively separate nearby texts, many methods adopt the seed expansion strategy that segments shrunken text regions as seed areas, and then iteratively expands the seed areas into intact text regions. In seek of a more straightforward way that does not rely on seed area segmentation and avoid possible error accumulation brought by iterative processing, we propose a redundancy removal strategy. In this work, we directly explore two types of fuzzy semantics-text and separatrix-that do not possess specific boundaries, and separate cluttered instances by excluding the separatrix pixels from text regions. To deal with the fuzzy semantic boundaries, we also conduct reliability analysis in both optimization and inference stage to suppress false positive pixels at ambiguous locations. Experiments on benchmark datasets demonstrate the effectiveness of our method.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [1] Arbitrary-shaped scene text detection by predicting distance map
    Xinyu Wang
    Yaohua Yi
    Jibing Peng
    Kaili Wang
    [J]. Applied Intelligence, 2022, 52 : 14374 - 14386
  • [2] Arbitrary-shaped scene text detection by predicting distance map
    Wang, Xinyu
    Yi, Yaohua
    Peng, Jibing
    Wang, Kaili
    [J]. APPLIED INTELLIGENCE, 2022, 52 (12) : 14374 - 14386
  • [3] Margin Guidance Network for Arbitrary-shaped Scene Text Detection
    Li, Xin
    Wu, Xingjiao
    Ma, Tianlong
    Zhou, Zhao
    Chen, Luhui
    He, Liang
    [J]. 2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 1111 - 1117
  • [4] ESRNet: an exploring sample relationships network for arbitrary-shaped scene text detection
    Fan, Huageng
    Lu, Tongwei
    [J]. APPLIED INTELLIGENCE, 2024, 54 (22) : 11995 - 12008
  • [5] Arbitrary-shaped scene text detection with keypoint-based shape representation
    Shuxin Qin
    Lin Chen
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 115 - 127
  • [6] Arbitrary-shaped scene text detection with keypoint-based shape representation
    Qin, Shuxin
    Chen, Lin
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2022, 25 (02) : 115 - 127
  • [7] Bidirectional Regression for Arbitrary-Shaped Text Detection
    Sheng, Tao
    Lian, Zhouhui
    [J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 187 - 201
  • [8] Boundary TextSpotter: Toward Arbitrary-Shaped Scene Text Spotting
    Lu, Pu
    Wang, Hao
    Zhu, Shenggao
    Wang, Jing
    Bai, Xiang
    Liu, Wenyu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6200 - 6212
  • [9] Arbitrary-Shaped Scene Text Recognition with Deformable Ensemble Attention
    Xu, Shuo
    Zhuang, Zeming
    Li, Mingjun
    Su, Feng
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), (237-253):
  • [10] TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection
    Wang, Fangfang
    Chen, Yifeng
    Wu, Fei
    Li, Xi
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 111 - 119