Text Image Super-Resolution Guided by Text Structure and Embedding Priors

被引：0

作者：

Huang, Cong ^{[1
]}

Peng, Xiulian ^{[2
]}

Liu, Dong ^{[1
]}

Lu, Yan ^{[2
]}

机构：

[1] Univ Sci & Technol China, 96 JinZhai Rd, Hefei, Peoples R China

[2] Microsoft Res Asia, 5 Dan Ling St, Beijing, Peoples R China

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2023年 / 19卷 / 06期

关键词：

Text image super-resolution; text-structure prior; text-embedding prior; NETWORK; RECOGNITION;

D O I：

10.1145/3595924

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We aim to super-resolve text images from unrecognizable low-resolution inputs. Existing super-resolution methods mainly learn a direct mapping from low-resolution to high-resolution images by exploring low-level features, which usually generate blurry outputs and suffer from severe structure distortion for text parts, especially when the resolution is quite low. Both the visual quality and the readability will suffer. To tackle these issues, we propose a new text super-resolution paradigm by recovering with understanding. Specifically, we extract a text-embedding prior and a text-structure prior from the upsampled image by learning to understand the text. The two priors with rich structure information and text-embedding information are then used as auxiliary information to recover the clear text structure. In addition, we introduce a text-feature loss to guide the training for better text recognizability. Extensive evaluations on both screen and scene text image datasets show that our method largely outperforms the state-of-the-art in both visual quality and recognition accuracy.

引用

页数：18

共 50 条

[1] Text Prior Guided Scene Text Image Super-Resolution
Ma, Jianqi
Guo, Shi
Zhang, Lei
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1341 - 1353
[2] Perceiving Multiple Representations for scene text image super-resolution guided by text recognizer
Shi, Qin
Zhu, Yu
Liu, Yatong
Ye, Jiongyao
Yang, Dawei
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 124
[3] Advancing scene text image super-resolution via edge enhancement priors
Li, Hongjun
Li, Shangfeng
[J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 8241 - 8250
[4] Text Image Super-resolution by Image Matting and Text Label Supervision
Lin, Kai
Liu, Yubao
Li, Thomas H.
Liu, Shan
Li, Ge
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1722 - 1727
[5] Super-resolution enhancement of text image sequences
Capel, D
Zisserman, A
[J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 600 - 605
[6] Learning Generative Structure Prior for Blind Text Image Super-resolution
Li, Xiaoming
Zuo, Wangmeng
Loy, Chen Change
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10103 - 10113
[7] Text Gestalt: Stroke-Aware Scene Text Image Super-resolution
Chen, Jingye
Yu, Haiyang
Ma, Jianqi
Li, Bin
Xue, Xiangyang
[J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 285 - 293
[8] Scene Text Telescope: Text-Focused Scene Image Super-Resolution
Chen, Jingye
Li, Bin
Xue, Xiangyang
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12021 - 12030
[9] Rethinking Super-Resolution as Text-Guided Details Generation
Ma, Chenxi
Yan, Bo
Lin, Qing
Tan, Weimin
Chen, Siming
[J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3461 - 3469
[10] A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution
Ma, Jianqi
Liang, Zhetong
Zhang, Lei
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5901 - 5910

← 1 2 3 4 5 →