Soft-edge-guided significant coordinate attention network for scene text image super-resolution

被引:1
|
作者
Xi, Chenchen [1 ]
Zhang, Kaibing [1 ,2 ,3 ]
He, Xin [1 ]
Hu, Yanting [4 ]
Chen, Jinguang [2 ,3 ]
机构
[1] Xian Polytech Univ, Sch Elect & Informat, Xian 710048, Peoples R China
[2] Xian Polytech Univ, Sch Comp Sci, Shaanxi Key Lab Clothing Intelligence, Xian 710048, Peoples R China
[3] Xian Polytech Univ, Sch Elect & Informat, Xian 710048, Peoples R China
[4] Xinjiang Med Univ, Sch Med Engn & Technol, Urumqi 830054, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 08期
基金
中国国家自然科学基金;
关键词
Scene text image super-resolution; Scene text recognition; Soft edge; Significant coordinate attention; NEURAL-NETWORK; CLASSIFICATION;
D O I
10.1007/s00371-023-03111-6
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Scene text image super-resolution (STISR) aims to enhance the resolution and visual quality of low-resolution scene text images, thereby improving the performance of some text-related downstream vision tasks. However, many existing STISR methods treat scene text images as general images while ignoring text-specific properties such as the particular structure of text images. Although some methods elaborated on introducing a certain edge detection operator to obtain the hard edges for improving the quality of super-resolved images, the extracted hard edges are binary and prone to generate aliasing edges. In view of the above considerations, we propose a novel soft-edge-guided significant coordinate attention network for STISR. Specifically, we apply soft edges to assist text image super-resolution, which is the probabilistic edges that can reflect a complete edge description on text images. In addition, some proposed approaches exploit both channel and spatial attention for effective image enhancement, but they all ignore the location information hiding in text images. To explore the key position-dependent features embedded in scene text images, we elaborately incorporate the coordinate attention into the process of STISR, which can capture long-term dependencies in one spatial direction while retaining precise position information in another one. Furthermore, we propose a new attention mechanism, called significant coordinate attention, to enable the network to focus more on the significant text region. The extensive experimental results demonstrate that our newly proposed method performs favorably against state-of-the-art methods in terms of both quantitative and qualitative assessments. The code will be available at https://github.com/kbzhang0505/SegSCoAN.
引用
收藏
页码:5393 / 5406
页数:14
相关论文
共 50 条
  • [1] Text Prior Guided Scene Text Image Super-Resolution
    Ma, Jianqi
    Guo, Shi
    Zhang, Lei
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1341 - 1353
  • [2] A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution
    Ma, Jianqi
    Liang, Zhetong
    Zhang, Lei
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5901 - 5910
  • [3] Scene Text Image Super-Resolution via Parallelly Contextual Attention Network
    Zhao, Cairong
    Feng, Shuyang
    Zhao, Brian Nlong
    Ding, Zhijun
    Wu, Jun
    Shen, Fuming
    Shen, Heng Tao
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2908 - 2917
  • [4] Edge Attention Network for Image Deblurring and Super-Resolution
    Han, Jong-Wook
    Choi, Jun-Ho
    Kim, Jun-Hyuk
    Lee, Jong-Seok
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2401 - 2406
  • [5] Deep coordinate attention network for single image super-resolution
    Xie, Chao
    Zhu, Hongyu
    Fei, Yeqi
    [J]. IET IMAGE PROCESSING, 2022, 16 (01) : 273 - 284
  • [6] Perceiving Multiple Representations for scene text image super-resolution guided by text recognizer
    Shi, Qin
    Zhu, Yu
    Liu, Yatong
    Ye, Jiongyao
    Yang, Dawei
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 124
  • [7] Lightweight Attention-Guided Network for Image Super-Resolution
    Ding, Zixuan
    Juan, Zhang
    Xiang, Li
    Wang, Xinyu
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (14)
  • [8] Advancing scene text image super-resolution via edge enhancement priors
    Li, Hongjun
    Li, Shangfeng
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 8241 - 8250
  • [9] Scene Text Telescope: Text-Focused Scene Image Super-Resolution
    Chen, Jingye
    Li, Bin
    Xue, Xiangyang
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12021 - 12030
  • [10] Edge-Enhanced with Feedback Attention Network for Image Super-Resolution
    Fu, Chunmei
    Yin, Yong
    [J]. SENSORS, 2021, 21 (06) : 1 - 16