Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection

被引:22
|
作者
Wang, Fangfang [1 ,2 ]
Xu, Xiaogang [2 ,3 ]
Chen, Yifeng [1 ]
Li, Xi [1 ,4 ,5 ,6 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[2] Zhejiang Lab, Hangzhou 310027, Peoples R China
[3] Zhejiang Gongshang Univ, Sch Comp & Informat Engn, Hangzhou 310027, Peoples R China
[4] Zhejiang Univ, Shanghai Inst Adv Study, Shanghai 201203, Peoples R China
[5] Shanghai AI Lab, Shanghai 201203, Peoples R China
[6] Zhejiang Singapore Innovat & AI Joint Res Lab, Hangzhou 310027, Peoples R China
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Arbitrary-shaped text detection; fuzzy semantics; segmentation-based framework; single-shot network;
D O I
10.1109/TIP.2022.3201467
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To robustly detect arbitrary-shaped scene texts, bottom-up methods are widely explored for their flexibility. Due to the highly homogeneous texture and cluttered distribution of scene texts, it is nontrivial for segmentation-based methods to discover the separatrixes between adjacent instances. To effectively separate nearby texts, many methods adopt the seed expansion strategy that segments shrunken text regions as seed areas, and then iteratively expands the seed areas into intact text regions. In seek of a more straightforward way that does not rely on seed area segmentation and avoid possible error accumulation brought by iterative processing, we propose a redundancy removal strategy. In this work, we directly explore two types of fuzzy semantics-text and separatrix-that do not possess specific boundaries, and separate cluttered instances by excluding the separatrix pixels from text regions. To deal with the fuzzy semantic boundaries, we also conduct reliability analysis in both optimization and inference stage to suppress false positive pixels at ambiguous locations. Experiments on benchmark datasets demonstrate the effectiveness of our method.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [21] Arbitrary-Shaped Text Detection with B-Spline Curve Network
    You, Yuwei
    Lei, Yuxin
    Zhang, Zixu
    Tong, Minglei
    SENSORS, 2023, 23 (05)
  • [22] Boundary-Aware Arbitrary-Shaped Scene Text Detector With Learnable Embedding Network
    Xing, Mengting
    Xie, Hongtao
    Tan, Qingfeng
    Fang, Shancheng
    Wang, Yuxin
    Zha, Zhengjun
    Zhang, Yongdong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3129 - 3143
  • [23] Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion
    Wang, Qitong
    Fu, Bin
    Li, Ming
    He, Junjun
    Peng, Xi
    Qiao, Yu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4718 - 4729
  • [24] A New Arbitrary-shaped Text Detection Network by Reinforcing Edge Features
    Bai H.-X.
    Wang H.-R.
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (05): : 1019 - 1030
  • [25] Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network
    Wang, Wenhai
    Xie, Enze
    Song, Xiaoge
    Zang, Yuhang
    Wang, Wenjia
    Lu, Tong
    Yu, Gang
    Shen, Chunhua
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8439 - 8448
  • [26] Fast arbitrary shaped scene text detection via text discriminator
    Guizhou Institute of Technology, Guiyzhou, Guiyang, China
    不详
    J. Phys. Conf. Ser., 1742, 1
  • [27] POINTER NETWORKS FOR ARBITRARY-SHAPED TEXT SPOTTING
    Zhang, Yi
    Yang, Wei
    Xu, Zhenbo
    Li, Yingjie
    Chen, Zhi
    Huang, Liusheng
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2375 - 2379
  • [28] Mask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text Detection
    Qin, Xugong
    Zhou, Yu
    Guo, Youhui
    Wu, Dayan
    Tian, Zhihong
    Jiang, Ning
    Wang, Hongbin
    Wang, Weiping
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 414 - 423
  • [29] Learning and Fusing Multi-Scale Representations for Accurate Arbitrary-Shaped Scene Text Recognition
    Li, Mingjun
    Xu, Shuo
    Su, Feng
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 353 - 361
  • [30] Arbitrary-shaped text detection with adaptive convolution and path enhancement pyramid network
    Cheng, Qi
    Wang, Guodong
    Dong, Qian
    Wei, Bin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (39-40) : 29225 - 29242