Stroke-Based Scene Text Erasing Using Synthetic Data for Training

被引:15
|
作者
Tang, Zhengmi [1 ]
Miyazaki, Tomo [1 ]
Sugaya, Yoshihiro [1 ]
Omachi, Shinichiro [1 ]
机构
[1] Tohoku Univ, Grad Sch Engn, Sendai, Miyagi 9808579, Japan
基金
日本学术振兴会;
关键词
Image segmentation; Task analysis; Convolution; Training; Pipelines; Engines; Detectors; Scene text erasing; synthetic text; background inpainting; LOCALIZATION;
D O I
10.1109/TIP.2021.3125260
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text erasing, which replaces text regions with reasonable content in natural images, has drawn significant attention in the computer vision community in recent years. There are two potential subtasks in scene text erasing: text detection and image inpainting. Both subtasks require considerable data to achieve better performance; however, the lack of a large-scale real-world scene-text removal dataset does not allow existing methods to realize their potential. To compensate for the lack of pairwise real-world data, we made considerable use of synthetic text after additional enhancement and subsequently trained our model only on the dataset generated by the improved synthetic text engine. Our proposed network contains a stroke mask prediction module and background inpainting module that can extract the text stroke as a relatively small hole from the cropped text image to maintain more background content for better inpainting results. This model can partially erase text instances in a scene image with a bounding box or work with an existing scene-text detector for automatic scene text erasing. The experimental results from the qualitative and quantitative evaluation on the SCUT-Syn, ICDAR2013, and SCUT-EnsText datasets demonstrate that our method significantly outperforms existing state-of-the-art methods even when they are trained on real-world data.
引用
收藏
页码:9306 / 9320
页数:15
相关论文
共 50 条
  • [1] Scene text removal via cascaded text stroke detection and erasing
    Xuewei Bian
    Chaoqun Wang
    Weize Quan
    Juntao Ye
    Xiaopeng Zhang
    Dong-Ming Yan
    Computational Visual Media, 2022, 8 (02) : 273 - 287
  • [2] Scene text removal via cascaded text stroke detection and erasing
    Bian, Xuewei
    Wang, Chaoqun
    Quan, Weize
    Ye, Juntao
    Zhang, Xiaopeng
    Yan, Dong-Ming
    COMPUTATIONAL VISUAL MEDIA, 2022, 8 (02) : 273 - 287
  • [3] Scene text removal via cascaded text stroke detection and erasing
    Xuewei Bian
    Chaoqun Wang
    Weize Quan
    Juntao Ye
    Xiaopeng Zhang
    Dong-Ming Yan
    Computational Visual Media, 2022, 8 : 273 - 287
  • [4] Synthetic Word Gesture Generation for Stroke-Based Virtual Keyboards
    Burgbacher, Ulrich
    Hinrichs, Klaus
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2017, 47 (02) : 221 - 234
  • [5] Scene Text Detection Based on Text Stroke Components
    Hou, Xinyue
    Cheng, Pengsen
    Gao, Hongyu
    Li, Xin
    Liu, Jiayong
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2025, 35 (05)
  • [6] Balanced Synthetic Data for Accurate Scene Text Spotting
    Yao, Ying
    Huang, Zhangjin
    TENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2018), 2018, 10806
  • [7] Stroke-based semantic segmentation for scene-level free-hand sketches
    Zhang, Zhengming
    Deng, Xiaoming
    Li, Jinyao
    Lai, Yukun
    Ma, Cuixia
    Liu, Yongjin
    Wang, Hongan
    VISUAL COMPUTER, 2023, 39 (12): : 6309 - 6321
  • [8] Stroke-based semantic segmentation for scene-level free-hand sketches
    Zhengming Zhang
    Xiaoming Deng
    Jinyao Li
    Yukun Lai
    Cuixia Ma
    Yongjin Liu
    Hongan Wang
    The Visual Computer, 2023, 39 : 6309 - 6321
  • [9] Improving Kannada OCR Using a Stroke-Based Approach
    Arun, Edupuganti
    Vinith, J.
    Pattar, Chakith
    George, Koshy
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 1611 - 1615
  • [10] A MULTIMODAL STROKE-BASED PREDICTIVE INPUT FOR EFFICIENT CHINESE TEXT ENTRY ON MOBILE DEVICES
    Sim, Khe Chai
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 448 - 453