Self-Supervised Text Erasing with Controllable Image Synthesis

被引:2
|
作者
Jiang, Gangwei [1 ]
Wang, Shiyao [2 ]
Ge, Tiezheng [2 ]
Jiang, Yuning [2 ]
Wei, Ying [3 ]
Lian, Defu [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] City Univ Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
text erasing; self-supervised; reinforcement learning;
D O I
10.1145/3503161.3547905
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Recent efforts on text erasing have shown promising results. However, existing methods require rich yet costly label annotations to obtain robust models, which limits their use for practical applications. To this end, we study an unsupervised scenario by proposing a novel Self-supervised Text Erasing (STE) framework that jointly learns to synthesize training images with erasure ground-truth and accurately erase texts in the real world. We first design a styleaware image synthesis function to generate synthetic images with diverse styled texts based on two synthetic mechanisms. To bridge the text style gap between the synthetic and real-world data, a policy network is constructed to control the synthetic mechanisms by picking style parameters with the guidance of two specifically designed rewards. The synthetic training images with ground-truth are then fed to train a coarse-to-fine erasing network. To produce better erasing outputs, a triplet erasure loss is designed to enforce the refinement stage to recover background textures. Moreover, we provide a new dataset (called PosterErase), which contains 60K high-resolution posters and is more challenging for the erasing task. The proposed method has been extensively evaluated with both PosterErase and the widely-used SCUT-Enstext dataset. Notably, on PosterErase, our method achieves 5.07 in terms of FID, with a relative improvement of 20.9% over existing supervised baselines.
引用
收藏
页码:1973 / 1983
页数:11
相关论文
共 50 条
  • [1] Text-to-image synthesis with self-supervised learning
    Tan, Yong Xuan
    Lee, Chin Poo
    Neo, Mai
    Lim, Kian Ming
    [J]. PATTERN RECOGNITION LETTERS, 2022, 157 : 119 - 126
  • [2] Self-Supervised Sketch-to-Image Synthesis
    Liu, Bingchen
    Zhu, Yizhe
    Song, Kunpeng
    Elgammal, Ahmed
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2073 - 2081
  • [3] Text-to-image synthesis with self-supervised bi-stage generative adversarial network
    Tan, Yong Xuan
    Lee, Chin Poo
    Neo, Mai
    Lim, Kian Ming
    Lim, Jit Yan
    [J]. PATTERN RECOGNITION LETTERS, 2023, 169 : 43 - 49
  • [4] Self-supervised Regularization for Text Classification
    Zhou, Meng
    Li, Zechen
    Xie, Pengtao
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 641 - 656
  • [5] Self-supervised regularization for text classification
    Zhou, Meng
    Li, Zechen
    Xie, Pengtao
    [J]. Transactions of the Association for Computational Linguistics, 2021, 9 : 1147 - 1162
  • [6] SELF-SUPERVISED LEARNING FOR SENTIMENT ANALYSIS VIA IMAGE-TEXT MATCHING
    Zhu, Haidong
    Zheng, Zhaoheng
    Soleymani, Mohammad
    Nevatia, Ram
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1710 - 1714
  • [7] Self-Supervised Ultrasound to MRI Fetal Brain Image Synthesis
    Jiao, Jianbo
    Namburete, Ana I. L.
    Papageorghiou, Aris T.
    Noble, J. Alison
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (12) : 4413 - 4424
  • [8] Self-Supervised Object Detection via Generative Image Synthesis
    Mustikovela, Siva Karthik
    De Mello, Shalini
    Prakash, Aayush
    Iqbal, Umar
    Liu, Sifei
    Thu Nguyen-Phuoc
    Rother, Carsten
    Kautz, Jan
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8589 - 8598
  • [9] Self-supervised memory learning for scene text image super-resolution
    Guo, Kehua
    Zhu, Xiangyuan
    Schaefer, Gerald
    Ding, Rui
    Fang, Hui
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [10] INJECTING TEXT IN SELF-SUPERVISED SPEECH PRETRAINING
    Chen, Zhehuai
    Zhang, Yu
    Rosenberg, Andrew
    Ramabhadran, Bhuvana
    Wang, Gary
    Moreno, Pedro
    [J]. 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 251 - 258