Stroke-Based Scene Text Erasing Using Synthetic Data for Training

被引:15
|
作者
Tang, Zhengmi [1 ]
Miyazaki, Tomo [1 ]
Sugaya, Yoshihiro [1 ]
Omachi, Shinichiro [1 ]
机构
[1] Tohoku Univ, Grad Sch Engn, Sendai, Miyagi 9808579, Japan
基金
日本学术振兴会;
关键词
Image segmentation; Task analysis; Convolution; Training; Pipelines; Engines; Detectors; Scene text erasing; synthetic text; background inpainting; LOCALIZATION;
D O I
10.1109/TIP.2021.3125260
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text erasing, which replaces text regions with reasonable content in natural images, has drawn significant attention in the computer vision community in recent years. There are two potential subtasks in scene text erasing: text detection and image inpainting. Both subtasks require considerable data to achieve better performance; however, the lack of a large-scale real-world scene-text removal dataset does not allow existing methods to realize their potential. To compensate for the lack of pairwise real-world data, we made considerable use of synthetic text after additional enhancement and subsequently trained our model only on the dataset generated by the improved synthetic text engine. Our proposed network contains a stroke mask prediction module and background inpainting module that can extract the text stroke as a relatively small hole from the cropped text image to maintain more background content for better inpainting results. This model can partially erase text instances in a scene image with a bounding box or work with an existing scene-text detector for automatic scene text erasing. The experimental results from the qualitative and quantitative evaluation on the SCUT-Syn, ICDAR2013, and SCUT-EnsText datasets demonstrate that our method significantly outperforms existing state-of-the-art methods even when they are trained on real-world data.
引用
收藏
页码:9306 / 9320
页数:15
相关论文
共 50 条
  • [21] STROKE-BASED HANDWRITTEN CHINESE CHARACTER-RECOGNITION USING NEURAL NETWORKS
    LIAO, HY
    HUANG, JS
    HUANG, ST
    PATTERN RECOGNITION LETTERS, 1993, 14 (10) : 833 - 840
  • [22] Handwritten digit recognition using neural networks and dynamic zoning with stroke-based descriptors
    Alvarez-Leon, David
    Fernandez-Diaz, Ramon-Angel
    Sanchez-Gonzalez, Lidia
    Alija-Perez, Jose-Manuel
    LOGIC JOURNAL OF THE IGPL, 2017, 25 (06) : 979 - 990
  • [23] Scene Text Detection Using Superpixel-Based Stroke Feature Transform and Deep Learning Based Region Classification
    Tang, Youbao
    Wu, Xiangqian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (09) : 2276 - 2288
  • [24] On-line cursive kanji character recognition using stroke-based affine transformation
    Wakahara, T
    Odaka, K
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (12) : 1381 - 1385
  • [25] Online Handwritten Arabic Scripts Recognition Using Stroke-Based Class Labeling Scheme
    Zitouni, Rabiaa
    Bezine, Hala
    Arous, Najet
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 187 - 198
  • [26] Scene-specific crowd counting using synthetic training images
    Delussu, Rita
    Putzu, Lorenzo
    Fumera, Giorgio
    PATTERN RECOGNITION, 2022, 124
  • [27] Scene-specific crowd counting using synthetic training images
    Delussu, Rita
    Putzu, Lorenzo
    Fumera, Giorgio
    Pattern Recognition, 2022, 124
  • [28] A Thumb Stroke-Based Virtual Keyboard for Sight-Free Text Entry on Touch-Screen Mobile Phones
    Lai, Jianwei
    Zhang, Dongsong
    Wang, Sen
    Kilic, Isil Yakut
    Zhou, Lina
    PROCEEDINGS OF THE 51ST ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2018, : 293 - 302
  • [29] Extraction of Arbitrary Text in Natural Scene Image based on Stroke Width Transform
    Jameson, Jinjuli
    Abdullah, Siti Norul Huda Sheikh
    2014 14TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA 2014), 2014,
  • [30] Synthetic Data Generation using Imitation Training
    Kishore, Aman
    Choe, Tae Eun
    Kwon, Junghyun
    Park, Minwoo
    Hao, Pengfei
    Mittel, Akshita
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3071 - 3079