Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection

被引:8
|
作者
Wang, Fangfang [1 ,2 ]
Xu, Xiaogang [2 ,3 ]
Chen, Yifeng [1 ]
Li, Xi [1 ,4 ,5 ,6 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[2] Zhejiang Lab, Hangzhou 310027, Peoples R China
[3] Zhejiang Gongshang Univ, Sch Comp & Informat Engn, Hangzhou 310027, Peoples R China
[4] Zhejiang Univ, Shanghai Inst Adv Study, Shanghai 201203, Peoples R China
[5] Shanghai AI Lab, Shanghai 201203, Peoples R China
[6] Zhejiang Singapore Innovat & AI Joint Res Lab, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Arbitrary-shaped text detection; fuzzy semantics; segmentation-based framework; single-shot network;
D O I
10.1109/TIP.2022.3201467
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To robustly detect arbitrary-shaped scene texts, bottom-up methods are widely explored for their flexibility. Due to the highly homogeneous texture and cluttered distribution of scene texts, it is nontrivial for segmentation-based methods to discover the separatrixes between adjacent instances. To effectively separate nearby texts, many methods adopt the seed expansion strategy that segments shrunken text regions as seed areas, and then iteratively expands the seed areas into intact text regions. In seek of a more straightforward way that does not rely on seed area segmentation and avoid possible error accumulation brought by iterative processing, we propose a redundancy removal strategy. In this work, we directly explore two types of fuzzy semantics-text and separatrix-that do not possess specific boundaries, and separate cluttered instances by excluding the separatrix pixels from text regions. To deal with the fuzzy semantic boundaries, we also conduct reliability analysis in both optimization and inference stage to suppress false positive pixels at ambiguous locations. Experiments on benchmark datasets demonstrate the effectiveness of our method.
引用
下载
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [31] Arbitrary-shaped text detection with adaptive convolution and path enhancement pyramid network
    Cheng, Qi
    Wang, Guodong
    Dong, Qian
    Wei, Bin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (39-40) : 29225 - 29242
  • [32] TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform Mask
    Su, Yuchen
    Shao, Zhiwen
    Zhou, Yong
    Meng, Fanrong
    Zhu, Hancheng
    Liu, Bing
    Yao, Rui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5030 - 5042
  • [33] CT-Net: Arbitrary-Shaped Text Detection via Contour Transformer
    Shao, Zhiwen
    Su, Yuchen
    Zhou, Yong
    Meng, Fanrong
    Zhu, Hancheng
    Liu, Bing
    Yao, Rui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1815 - 1826
  • [34] Arbitrary-shaped text detection with adaptive convolution and path enhancement pyramid network
    Qi Cheng
    Guodong Wang
    Qian Dong
    Bin Wei
    Multimedia Tools and Applications, 2020, 79 : 29225 - 29242
  • [35] All You Need Is a Second Look: Towards Arbitrary-Shaped Text Detection
    Cao, Meng
    Zhang, Can
    Yang, Dongming
    Zou, Yuexian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 758 - 767
  • [36] I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-Shaped Scene Text Detection
    Bo Du
    Jian Ye
    Jing Zhang
    Juhua Liu
    Dacheng Tao
    International Journal of Computer Vision, 2022, 130 : 1961 - 1977
  • [37] I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-Shaped Scene Text Detection
    Du, Bo
    Ye, Jian
    Zhang, Jing
    Liu, Juhua
    Tao, Dacheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (08) : 1961 - 1977
  • [38] TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
    Singh, Amanpreet
    Peng, Guan
    Toh, Mandy
    Huang, Jing
    Galuba, Wojciech
    Hassner, Tal
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8798 - 8808
  • [39] Kernel-mask knowledge distillation for efficient and accurate arbitrary-shaped text detection
    Honghui Chen
    Yuhang Qiu
    Mengxi Jiang
    Jianhui Lin
    Pingping Chen
    Complex & Intelligent Systems, 2024, 10 : 75 - 86
  • [40] Kernel-mask knowledge distillation for efficient and accurate arbitrary-shaped text detection
    Chen, Honghui
    Qiu, Yuhang
    Jiang, Mengxi
    Lin, Jianhui
    Chen, Pingping
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 75 - 86