Text Image Super-Resolution Guided by Text Structure and Embedding Priors

被引:0
|
作者
Huang, Cong [1 ]
Peng, Xiulian [2 ]
Liu, Dong [1 ]
Lu, Yan [2 ]
机构
[1] Univ Sci & Technol China, 96 JinZhai Rd, Hefei, Peoples R China
[2] Microsoft Res Asia, 5 Dan Ling St, Beijing, Peoples R China
关键词
Text image super-resolution; text-structure prior; text-embedding prior; NETWORK; RECOGNITION;
D O I
10.1145/3595924
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We aim to super-resolve text images from unrecognizable low-resolution inputs. Existing super-resolution methods mainly learn a direct mapping from low-resolution to high-resolution images by exploring low-level features, which usually generate blurry outputs and suffer from severe structure distortion for text parts, especially when the resolution is quite low. Both the visual quality and the readability will suffer. To tackle these issues, we propose a new text super-resolution paradigm by recovering with understanding. Specifically, we extract a text-embedding prior and a text-structure prior from the upsampled image by learning to understand the text. The two priors with rich structure information and text-embedding information are then used as auxiliary information to recover the clear text structure. In addition, we introduce a text-feature loss to guide the training for better text recognizability. Extensive evaluations on both screen and scene text image datasets show that our method largely outperforms the state-of-the-art in both visual quality and recognition accuracy.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Text Prior Guided Scene Text Image Super-Resolution
    Ma, Jianqi
    Guo, Shi
    Zhang, Lei
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1341 - 1353
  • [2] Perceiving Multiple Representations for scene text image super-resolution guided by text recognizer
    Shi, Qin
    Zhu, Yu
    Liu, Yatong
    Ye, Jiongyao
    Yang, Dawei
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 124
  • [3] Advancing scene text image super-resolution via edge enhancement priors
    Li, Hongjun
    Li, Shangfeng
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 8241 - 8250
  • [4] Text Image Super-resolution by Image Matting and Text Label Supervision
    Lin, Kai
    Liu, Yubao
    Li, Thomas H.
    Liu, Shan
    Li, Ge
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1722 - 1727
  • [5] Super-resolution enhancement of text image sequences
    Capel, D
    Zisserman, A
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 600 - 605
  • [6] Learning Generative Structure Prior for Blind Text Image Super-resolution
    Li, Xiaoming
    Zuo, Wangmeng
    Loy, Chen Change
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10103 - 10113
  • [7] Text Gestalt: Stroke-Aware Scene Text Image Super-resolution
    Chen, Jingye
    Yu, Haiyang
    Ma, Jianqi
    Li, Bin
    Xue, Xiangyang
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 285 - 293
  • [8] Scene Text Telescope: Text-Focused Scene Image Super-Resolution
    Chen, Jingye
    Li, Bin
    Xue, Xiangyang
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12021 - 12030
  • [9] Rethinking Super-Resolution as Text-Guided Details Generation
    Ma, Chenxi
    Yan, Bo
    Lin, Qing
    Tan, Weimin
    Chen, Siming
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3461 - 3469
  • [10] A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution
    Ma, Jianqi
    Liang, Zhetong
    Zhang, Lei
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5901 - 5910