Scene text image super-resolution via textual reasoning and multiscale cross-convolution

被引：0

作者：

Yu, Lan ^{[1
]}

Li, Xiaojie ^{[1
]}

Yu, Qi ^{[1
]}

Li, Guangju ^{[1
]}

Jin, Dehu ^{[1
]}

Qi, Meng ^{[1
]}

机构：

[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan, Peoples R China

来源：

APPLIED INTELLIGENCE | 2024年 / 54卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Scene text image super-resolution; Textual reasoning; Multiscale cross-convolution; Progressively hierarchical exploration; NEURAL-NETWORK;

D O I：

10.1007/s10489-023-05251-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene text image super-resolution aims to upgrade the visual quality of low-resolution images and contributes to the accuracy of the subsequent scene text recognition task. However, advanced super-resolution methods with more attention to text-oriented information still have challenges in extremely blurred images. To address this problem, we propose a novel network based on textual reasoning and multiscale cross-convolution (TRMCC), in which a text structure preservation module is designed to explore the correlation of horizontal features among layers to enhance the structural similarity between the reconstructions and the corresponding high-resolution (HR) images and the multiscale cross-convolution block explores structural information hierarchically in layers with various perceptual fields in a progressive manner. In addition, based on human behavior in the presence of blurred images with linguistic rules, the text semantic reasoning module incorporated a self-attention mechanism and language-based textual reasoning to improve the accuracy of textual prior information. Comprehensive experiments conducted on the real-scene text image dataset TextZoom demonstrated the superiority of our model compared with existing state-of-the-art models, especially on structural similarity and information integrity.

引用

页码：1997 / 2008

页数：12

共 50 条

[1] Scene text image super-resolution via textual reasoning and multiscale cross-convolution
Lan Yu
Xiaojie Li
Qi Yu
Guangju Li
Dehu Jin
Meng Qi
[J]. Applied Intelligence, 2024, 54 : 1997 - 2008
[2] Text Prior Guided Scene Text Image Super-Resolution
Ma, Jianqi
Guo, Shi
Zhang, Lei
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1341 - 1353
[3] Scene Text Telescope: Text-Focused Scene Image Super-Resolution
Chen, Jingye
Li, Bin
Xue, Xiangyang
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12021 - 12030
[4] Scene Text Image Super-Resolution via Parallelly Contextual Attention Network
Zhao, Cairong
Feng, Shuyang
Zhao, Brian Nlong
Ding, Zhijun
Wu, Jun
Shen, Fuming
Shen, Heng Tao
[J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2908 - 2917
[5] Advancing scene text image super-resolution via edge enhancement priors
Li, Hongjun
Li, Shangfeng
[J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 8241 - 8250
[6] Batch-transformer for scene text image super-resolution
Sun, Yaqi
Xie, Xiaolan
Li, Zhi
Yang, Kai
[J]. VISUAL COMPUTER, 2024, 40 (10): : 7399 - 7409
[7] Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement
Guo, Hang
Dai, Tao
Meng, Guanghao
Xia, Shu-Tao
[J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 782 - 790
[8] TextSRNet: Scene Text Super-Resolution Based on Contour Prior and Atrous Convolution
Ma, Jizhao
Jin, Lianwen
Zhang, Jiaxin
Jiang, Jiajia
Xue, Yang
He, Mengchao
[J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3252 - 3258
[9] Text Gestalt: Stroke-Aware Scene Text Image Super-resolution
Chen, Jingye
Yu, Haiyang
Ma, Jianqi
Li, Bin
Xue, Xiangyang
[J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 285 - 293
[10] Text-Enhanced Scene Image Super-Resolution via Stroke Mask and Orthogonal Attention
Shu, Rui
Zhao, Cairong
Feng, Shuyang
Zhu, Liang
Miao, Duoqian
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6317 - 6330

← 1 2 3 4 5 →