Stroke-Based Scene Text Erasing Using Synthetic Data for Training

被引:15
|
作者
Tang, Zhengmi [1 ]
Miyazaki, Tomo [1 ]
Sugaya, Yoshihiro [1 ]
Omachi, Shinichiro [1 ]
机构
[1] Tohoku Univ, Grad Sch Engn, Sendai, Miyagi 9808579, Japan
基金
日本学术振兴会;
关键词
Image segmentation; Task analysis; Convolution; Training; Pipelines; Engines; Detectors; Scene text erasing; synthetic text; background inpainting; LOCALIZATION;
D O I
10.1109/TIP.2021.3125260
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text erasing, which replaces text regions with reasonable content in natural images, has drawn significant attention in the computer vision community in recent years. There are two potential subtasks in scene text erasing: text detection and image inpainting. Both subtasks require considerable data to achieve better performance; however, the lack of a large-scale real-world scene-text removal dataset does not allow existing methods to realize their potential. To compensate for the lack of pairwise real-world data, we made considerable use of synthetic text after additional enhancement and subsequently trained our model only on the dataset generated by the improved synthetic text engine. Our proposed network contains a stroke mask prediction module and background inpainting module that can extract the text stroke as a relatively small hole from the cropped text image to maintain more background content for better inpainting results. This model can partially erase text instances in a scene image with a bounding box or work with an existing scene-text detector for automatic scene text erasing. The experimental results from the qualitative and quantitative evaluation on the SCUT-Syn, ICDAR2013, and SCUT-EnsText datasets demonstrate that our method significantly outperforms existing state-of-the-art methods even when they are trained on real-world data.
引用
收藏
页码:9306 / 9320
页数:15
相关论文
共 50 条
  • [41] Text Detection in Natural Images with Convolutional Neural Networks and Synthetic Training Data
    Grond, Marco
    Brink, Willie
    Herbst, Ben
    2016 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS INTERNATIONAL CONFERENCE (PRASA-ROBMECH), 2016,
  • [42] Scene Text Detection Based on Robust Stroke Width Transform and Deep Belief Network
    Xu, Hailiang
    Xue, Like
    Su, Feng
    COMPUTER VISION - ACCV 2014, PT II, 2015, 9004 : 195 - 209
  • [43] Text Detection in Traffic Informatory Signs Using Synthetic Data
    Chen, Fangge
    Kataoka, Hirokatsu
    Satoh, Yutaka
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 851 - 858
  • [44] Recognizing Multiplayer Behaviors Using Synthetic Training Data
    Feng, Andrew
    Gordon, Andrew S.
    2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 463 - 470
  • [45] IMPROVING PERSON DETECTION USING SYNTHETIC TRAINING DATA
    Yu, Jie
    Farin, Dirk
    Krueger, Christof
    Schiele, Bernt
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3477 - 3480
  • [46] Training of a deep learning based digital subtraction angiography method using synthetic data
    Duan, Lizhen
    Eulig, Elias
    Knaup, Michael
    Adamus, Ralf
    Lell, Michael
    Kachelriess, Marc
    MEDICAL PHYSICS, 2024, 51 (07) : 4793 - 4810
  • [47] Pixelwise Object Class Segmentation based on Synthetic Data using an Optimized Training Strategy
    Dittrich, Frank
    Woern, Heinz
    Sharma, Vivek
    Yayilgan, Stile
    2014 FIRST INTERNATIONAL CONFERENCE ON NETWORKS & SOFT COMPUTING (ICNSC), 2014, : 388 - 394
  • [48] Detection of Surgical Instruments Based on Synthetic Training Data
    Wiese, Leon
    Hinz, Lennart
    Reithmeier, Eduard
    Korn, Philippe
    Neuhaus, Michael
    COMPUTERS, 2025, 14 (02)
  • [49] Estimation and correction of geometric distortion of stroke-based symbology for avionics display system using curve fitting
    Saini, Surender Singh
    Pattnaik, S. S.
    Sardana, H. K.
    INTERNATIONAL JOURNAL OF IMAGE AND DATA FUSION, 2014, 5 (04) : 334 - 347
  • [50] Scene Text Extraction Based on Symmetrical Edge-point Pair Detection of Character Stroke
    Liu, Teng
    Zhou, Liang
    2017 EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2017, : 27 - 32