Soft-edge-guided significant coordinate attention network for scene text image super-resolution

被引:1
|
作者
Xi, Chenchen [1 ]
Zhang, Kaibing [1 ,2 ,3 ]
He, Xin [1 ]
Hu, Yanting [4 ]
Chen, Jinguang [2 ,3 ]
机构
[1] Xian Polytech Univ, Sch Elect & Informat, Xian 710048, Peoples R China
[2] Xian Polytech Univ, Sch Comp Sci, Shaanxi Key Lab Clothing Intelligence, Xian 710048, Peoples R China
[3] Xian Polytech Univ, Sch Elect & Informat, Xian 710048, Peoples R China
[4] Xinjiang Med Univ, Sch Med Engn & Technol, Urumqi 830054, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 08期
基金
中国国家自然科学基金;
关键词
Scene text image super-resolution; Scene text recognition; Soft edge; Significant coordinate attention; NEURAL-NETWORK; CLASSIFICATION;
D O I
10.1007/s00371-023-03111-6
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Scene text image super-resolution (STISR) aims to enhance the resolution and visual quality of low-resolution scene text images, thereby improving the performance of some text-related downstream vision tasks. However, many existing STISR methods treat scene text images as general images while ignoring text-specific properties such as the particular structure of text images. Although some methods elaborated on introducing a certain edge detection operator to obtain the hard edges for improving the quality of super-resolved images, the extracted hard edges are binary and prone to generate aliasing edges. In view of the above considerations, we propose a novel soft-edge-guided significant coordinate attention network for STISR. Specifically, we apply soft edges to assist text image super-resolution, which is the probabilistic edges that can reflect a complete edge description on text images. In addition, some proposed approaches exploit both channel and spatial attention for effective image enhancement, but they all ignore the location information hiding in text images. To explore the key position-dependent features embedded in scene text images, we elaborately incorporate the coordinate attention into the process of STISR, which can capture long-term dependencies in one spatial direction while retaining precise position information in another one. Furthermore, we propose a new attention mechanism, called significant coordinate attention, to enable the network to focus more on the significant text region. The extensive experimental results demonstrate that our newly proposed method performs favorably against state-of-the-art methods in terms of both quantitative and qualitative assessments. The code will be available at https://github.com/kbzhang0505/SegSCoAN.
引用
收藏
页码:5393 / 5406
页数:14
相关论文
共 50 条
  • [31] Stratified attention dense network for image super-resolution
    Zhiwei Liu
    Xiaofeng Mao
    Ji Huang
    Menghan Gan
    Yueyuan Zhang
    [J]. Signal, Image and Video Processing, 2022, 16 : 715 - 722
  • [32] Upsampling Attention Network for Single Image Super-resolution
    Zheng, Zhijie
    Jiao, Yuhang
    Fang, Guangyou
    [J]. VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP, 2021, : 399 - 406
  • [33] Global attention guided multi-scale network for face image super-resolution
    Zhang, Jinlu
    Liu, Mingliang
    Wang, Xiaohang
    [J]. MACHINE VISION AND APPLICATIONS, 2023, 34 (06)
  • [34] A sparse lightweight attention network for image super-resolution
    Zhang, Hongao
    Fang, Jinsheng
    Hu, Siyu
    Zeng, Kun
    [J]. VISUAL COMPUTER, 2024, 40 (02): : 1261 - 1272
  • [35] Feature Fusion Attention Network for Image Super-resolution
    Zhou, Deng-Wen
    Ma, Lu-Yao
    Tian, Jin-Yue
    Sun, Xiu-Xiu
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (09): : 2233 - 2241
  • [36] Densely convolutional attention network for image super-resolution
    Bai, Furui
    Lu, Wen
    Huang, Yuanfei
    Zha, Lin
    Yang, Jiachen
    [J]. NEUROCOMPUTING, 2019, 368 : 25 - 33
  • [37] Kernel Attention Network for Single Image Super-Resolution
    Zhang, Dongyang
    Shao, Jie
    Shen, Heng Tao
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (03)
  • [38] Pixel attention convolutional network for image super-resolution
    Xin Wang
    Shufen Zhang
    Yuanyuan Lin
    Yanxia Lyu
    Jiale Zhang
    [J]. Neural Computing and Applications, 2023, 35 : 8589 - 8599
  • [39] Stratified attention dense network for image super-resolution
    Liu, Zhiwei
    Mao, Xiaofeng
    Huang, Ji
    Gan, Menghan
    Zhang, Yueyuan
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (03) : 715 - 722
  • [40] A sparse lightweight attention network for image super-resolution
    Hongao Zhang
    Jinsheng Fang
    Siyu Hu
    Kun Zeng
    [J]. The Visual Computer, 2024, 40 (2) : 1261 - 1272