Text Image Super-Resolution Guided by Text Structure and Embedding Priors

被引:0
|
作者
Huang, Cong [1 ]
Peng, Xiulian [2 ]
Liu, Dong [1 ]
Lu, Yan [2 ]
机构
[1] Univ Sci & Technol China, 96 JinZhai Rd, Hefei, Peoples R China
[2] Microsoft Res Asia, 5 Dan Ling St, Beijing, Peoples R China
关键词
Text image super-resolution; text-structure prior; text-embedding prior; NETWORK; RECOGNITION;
D O I
10.1145/3595924
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We aim to super-resolve text images from unrecognizable low-resolution inputs. Existing super-resolution methods mainly learn a direct mapping from low-resolution to high-resolution images by exploring low-level features, which usually generate blurry outputs and suffer from severe structure distortion for text parts, especially when the resolution is quite low. Both the visual quality and the readability will suffer. To tackle these issues, we propose a new text super-resolution paradigm by recovering with understanding. Specifically, we extract a text-embedding prior and a text-structure prior from the upsampled image by learning to understand the text. The two priors with rich structure information and text-embedding information are then used as auxiliary information to recover the clear text structure. In addition, we introduce a text-feature loss to guide the training for better text recognizability. Extensive evaluations on both screen and scene text image datasets show that our method largely outperforms the state-of-the-art in both visual quality and recognition accuracy.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] An example-based prior model for text image super-resolution
    Park, J
    Kwon, Y
    Kim, JH
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 374 - 378
  • [22] Bayesian super-resolution of text image sequences from low resolution observations.
    Cortijo, FJ
    Villena, S
    Molina, R
    Katsaggelos, A
    [J]. SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 1, PROCEEDINGS, 2003, : 421 - 424
  • [23] Image Super-Resolution Via Sparse Embedding
    Zhu, Qidan
    Sun, Lei
    Cai, Chengtao
    [J]. 2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 5673 - 5676
  • [24] Image Super-Resolution With Sparse Neighbor Embedding
    Gao, Xinbo
    Zhang, Kaibing
    Tao, Dacheng
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (07) : 3194 - 3205
  • [25] Scene Text Image Super-Resolution via Parallelly Contextual Attention Network
    Zhao, Cairong
    Feng, Shuyang
    Zhao, Brian Nlong
    Ding, Zhijun
    Wu, Jun
    Shen, Fuming
    Shen, Heng Tao
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2908 - 2917
  • [26] Self-supervised memory learning for scene text image super-resolution
    Guo, Kehua
    Zhu, Xiangyuan
    Schaefer, Gerald
    Ding, Rui
    Fang, Hui
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [27] A Text Image Super-Resolution Generation Network without Pre-training
    Zhang, Qingyong
    Ye, Ziliu
    Leng Zhiwen
    Yue, Qi
    [J]. 2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 515 - 519
  • [28] Improvement of Text Image Super-Resolution Benefiting Multi-task Learning
    Honda, Kosuke
    Fujita, Hamido
    Kurematsu, Masaki
    [J]. ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 275 - 286
  • [29] TLWSR: Weakly supervised real-world scene text image super-resolution using text label
    Shi, Qin
    Zhu, Yu
    Fang, Chuantao
    Yang, Dawei
    [J]. IET IMAGE PROCESSING, 2023, 17 (09) : 2780 - 2790
  • [30] Bayesian super-resolution of text in video with a text-specific bimodal prior
    Donaldson, K
    Myers, GK
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 1188 - 1195