Stroke-Based Scene Text Erasing Using Synthetic Data for Training

被引:15
|
作者
Tang, Zhengmi [1 ]
Miyazaki, Tomo [1 ]
Sugaya, Yoshihiro [1 ]
Omachi, Shinichiro [1 ]
机构
[1] Tohoku Univ, Grad Sch Engn, Sendai, Miyagi 9808579, Japan
基金
日本学术振兴会;
关键词
Image segmentation; Task analysis; Convolution; Training; Pipelines; Engines; Detectors; Scene text erasing; synthetic text; background inpainting; LOCALIZATION;
D O I
10.1109/TIP.2021.3125260
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text erasing, which replaces text regions with reasonable content in natural images, has drawn significant attention in the computer vision community in recent years. There are two potential subtasks in scene text erasing: text detection and image inpainting. Both subtasks require considerable data to achieve better performance; however, the lack of a large-scale real-world scene-text removal dataset does not allow existing methods to realize their potential. To compensate for the lack of pairwise real-world data, we made considerable use of synthetic text after additional enhancement and subsequently trained our model only on the dataset generated by the improved synthetic text engine. Our proposed network contains a stroke mask prediction module and background inpainting module that can extract the text stroke as a relatively small hole from the cropped text image to maintain more background content for better inpainting results. This model can partially erase text instances in a scene image with a bounding box or work with an existing scene-text detector for automatic scene text erasing. The experimental results from the qualitative and quantitative evaluation on the SCUT-Syn, ICDAR2013, and SCUT-EnsText datasets demonstrate that our method significantly outperforms existing state-of-the-art methods even when they are trained on real-world data.
引用
收藏
页码:9306 / 9320
页数:15
相关论文
共 50 条
  • [31] Printed Ottoman text recognition using synthetic data and data augmentation
    Esma F. Bilgin Tasdemir
    International Journal on Document Analysis and Recognition (IJDAR), 2023, 26 : 273 - 287
  • [32] Printed Ottoman text recognition using synthetic data and data augmentation
    Tasdemir, Esma F. Bilgin F.
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2023, 26 (03) : 273 - 287
  • [33] A crowdsource based framework for Bengali scene text data collection and detection
    Hossain, Md. Yearat
    Rahman, Tanzilur
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 112
  • [34] MapReduce Based Text Detection in Big Data Natural Scene Videos
    Ben Ayed, Abdelkarim
    Ben Halima, Mohamed
    Alimi, Adel M.
    INNS CONFERENCE ON BIG DATA 2015 PROGRAM, 2015, 53 : 216 - 223
  • [35] SCENE TEXT RECOGNITION USING SPARSE CODING BASED FEATURES
    Zhang, Dong
    Wang, Da-Han
    Wang, Hanzi
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1066 - 1070
  • [36] Script Independent Scene Text Segmentation using Fast Stroke Width Transform and GrabCut
    Bosamiya, Jay H.
    Agrawal, Palash
    Roy, Partha Pratim
    Balasubramanian, R.
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 151 - 155
  • [37] Text Detection in Scene Images using Stroke Width and Nearest-Neighbor Constraints
    Srivastav, Apurva
    Kumar, Jayant
    2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 678 - +
  • [38] Scene Text Extraction using Stroke Width Transform for Tourist Translator on Android Platform
    Chavre, Pooja
    Ghotkar, Archana
    2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 301 - 306
  • [39] Using Synthetic Training Data for Deep Learning-Based GBM Segmentation
    Lindner, Lydia
    Narnhofer, Dominik
    Weber, Maximilian
    Gsaxner, Christina
    Kolodziej, Malgorzata
    Egger, Jan
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 6724 - 6729
  • [40] Adversarial Physics-based Augmentations for Robust Training Using Synthetic Data
    Clark, Emma
    Walters, Ellie
    Zelnio, Edmund
    ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY XXXI, 2024, 13032