Scene text image super-resolution via textual reasoning and multiscale cross-convolution

被引:0
|
作者
Yu, Lan [1 ]
Li, Xiaojie [1 ]
Yu, Qi [1 ]
Li, Guangju [1 ]
Jin, Dehu [1 ]
Qi, Meng [1 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan, Peoples R China
基金
中国国家自然科学基金;
关键词
Scene text image super-resolution; Textual reasoning; Multiscale cross-convolution; Progressively hierarchical exploration; NEURAL-NETWORK;
D O I
10.1007/s10489-023-05251-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text image super-resolution aims to upgrade the visual quality of low-resolution images and contributes to the accuracy of the subsequent scene text recognition task. However, advanced super-resolution methods with more attention to text-oriented information still have challenges in extremely blurred images. To address this problem, we propose a novel network based on textual reasoning and multiscale cross-convolution (TRMCC), in which a text structure preservation module is designed to explore the correlation of horizontal features among layers to enhance the structural similarity between the reconstructions and the corresponding high-resolution (HR) images and the multiscale cross-convolution block explores structural information hierarchically in layers with various perceptual fields in a progressive manner. In addition, based on human behavior in the presence of blurred images with linguistic rules, the text semantic reasoning module incorporated a self-attention mechanism and language-based textual reasoning to improve the accuracy of textual prior information. Comprehensive experiments conducted on the real-scene text image dataset TextZoom demonstrated the superiority of our model compared with existing state-of-the-art models, especially on structural similarity and information integrity.
引用
收藏
页码:1997 / 2008
页数:12
相关论文
共 50 条
  • [1] Scene text image super-resolution via textual reasoning and multiscale cross-convolution
    Lan Yu
    Xiaojie Li
    Qi Yu
    Guangju Li
    Dehu Jin
    Meng Qi
    [J]. Applied Intelligence, 2024, 54 : 1997 - 2008
  • [2] Text Prior Guided Scene Text Image Super-Resolution
    Ma, Jianqi
    Guo, Shi
    Zhang, Lei
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1341 - 1353
  • [3] Scene Text Telescope: Text-Focused Scene Image Super-Resolution
    Chen, Jingye
    Li, Bin
    Xue, Xiangyang
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12021 - 12030
  • [4] Scene Text Image Super-Resolution via Parallelly Contextual Attention Network
    Zhao, Cairong
    Feng, Shuyang
    Zhao, Brian Nlong
    Ding, Zhijun
    Wu, Jun
    Shen, Fuming
    Shen, Heng Tao
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2908 - 2917
  • [5] Advancing scene text image super-resolution via edge enhancement priors
    Li, Hongjun
    Li, Shangfeng
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 8241 - 8250
  • [6] Batch-transformer for scene text image super-resolution
    Sun, Yaqi
    Xie, Xiaolan
    Li, Zhi
    Yang, Kai
    [J]. VISUAL COMPUTER, 2024, 40 (10): : 7399 - 7409
  • [7] Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement
    Guo, Hang
    Dai, Tao
    Meng, Guanghao
    Xia, Shu-Tao
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 782 - 790
  • [8] TextSRNet: Scene Text Super-Resolution Based on Contour Prior and Atrous Convolution
    Ma, Jizhao
    Jin, Lianwen
    Zhang, Jiaxin
    Jiang, Jiajia
    Xue, Yang
    He, Mengchao
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3252 - 3258
  • [9] Text Gestalt: Stroke-Aware Scene Text Image Super-resolution
    Chen, Jingye
    Yu, Haiyang
    Ma, Jianqi
    Li, Bin
    Xue, Xiangyang
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 285 - 293
  • [10] Text-Enhanced Scene Image Super-Resolution via Stroke Mask and Orthogonal Attention
    Shu, Rui
    Zhao, Cairong
    Feng, Shuyang
    Zhu, Liang
    Miao, Duoqian
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6317 - 6330