Spatial-temporal feature refine network for single image super-resolution

被引:2
|
作者
Qin, Jiayi [1 ,2 ]
Chen, Lihui [1 ,2 ]
Liu, Kai [3 ]
Jeon, Gwanggil [4 ,5 ]
Yang, Xiaomin [1 ,2 ]
机构
[1] Sichuan Univ, Coll Elect & Informat Engn, Chengdu 610065, Sichuan, Peoples R China
[2] Sichuan Univ, Sch Aeronaut & Astronaut, Chengdu 610065, Sichuan, Peoples R China
[3] Sichuan Univ, Coll Elect Engn, Chengdu 610065, Sichuan, Peoples R China
[4] Incheon Natl Univ, Dept Embedded Syst Engn, Incheon 22012, South Korea
[5] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
关键词
Single image super-resolution; Lightweight convolution neural network; Spatial-temporal feature refine network; Multi-attention enhanced residual block;
D O I
10.1007/s10489-022-03741-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, numerous convolution neural networks (CNNs) for single image super-resolution (SISR) have shown powerful capability in image reconstruction. Especially, accurate and compact networks receive widespread attention due to their superiority in running speed on resources-limited devices. However, in most lightweight CNN-based methods, the limitation of informative features extraction results in mediocre reconstructed images. In this paper, we propose a spatial-temporal feature refine network (STFRN) to alleviate the above problem by extract features from diverse dimensions. Specifically, we first introduce a spatial-temporal stage learning (STSL) of our STFRN in two distinct views: 1) for temporal feature extraction, we enhance the relevance of various stages and introduce a persistent memory via multi-stage learning, thereby boosting its reconstruction capability; 2) for spatial feature extraction, we enlarge the receptive fields by densely deepening the network to capture more context features. In the time dimension, we only maintain the main stage, i.e., the last stage, for efficient training. In addition to STSL, we elaborately design an effective multi-attention enhanced residual block (MERB), which further refines informative features at the depth level. In detail, by combining multiple attention blocks, we achieve the complementary features from diverse views (i.e., in point-wise, channel-wise, and spatial dimensions). Compared with ordinary residual blocks, MERB realizes better results while consuming less computational resources. Like most literature, we also utilize residual learning and dense connection to promote performance. Extensive experiments show our STFRN levering on the STSL and MERBs is superior to other state-of-the-art methods in quality and quantity.
引用
收藏
页码:9668 / 9688
页数:21
相关论文
共 50 条
  • [31] Balanced Spatial Feature Distillation and Pyramid Attention Network for Lightweight Image Super-resolution
    Gendy, Garas
    Sabor, Nabil
    Hou, Jingchao
    He, Guanghui
    NEUROCOMPUTING, 2022, 509 (157-166) : 157 - 166
  • [32] Hyperspectral Image Super-Resolution Based on Spatial-Spectral Feature Extraction Network
    LI Yanshan
    CHEN Shifu
    LUO Wenhan
    ZHOU Li
    XIE Weixin
    Chinese Journal of Electronics, 2023, 32 (03) : 415 - 428
  • [33] Nonlocal-guided enhanced interaction spatial-temporal network for compressed video super-resolution
    Junxiong Cheng
    Shuhua Xiong
    Xiaohai He
    Chao Ren
    Tingrong Zhang
    Honggang Chen
    Applied Intelligence, 2023, 53 : 24407 - 24421
  • [34] Nonlocal-guided enhanced interaction spatial-temporal network for compressed video super-resolution
    Cheng, Junxiong
    Xiong, Shuhua
    He, Xiaohai
    Ren, Chao
    Zhang, Tingrong
    Chen, Honggang
    APPLIED INTELLIGENCE, 2023, 53 (20) : 24407 - 24421
  • [35] Momentum feature comparison network based on generative adversarial network for single image super-resolution*,**
    Wang, Cailing
    Shen, Qi
    Wang, Xingbo
    Jiang, Guoping
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 106
  • [36] Explore spatial-temporal relations: transient super-resolution with PMD sensors
    Han, Chaosheng
    Lin, Xing
    Lin, Jingyu
    Yan, Chenggang
    Dai, Qionghai
    OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY III, 2014, 9273
  • [37] Video super-resolution based on spatial-temporal recurrent residual networks
    Yang, Wenhan
    Feng, Jiashi
    Xie, Guosen
    Liu, Jiaying
    Guo, Zongming
    Yan, Shuicheng
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 168 : 79 - 92
  • [38] Cross-resolution feature attention network for image super-resolution
    Liu, Anqi
    Li, Sumei
    Chang, Yongli
    VISUAL COMPUTER, 2023, 39 (09): : 3837 - 3849
  • [39] Cross-resolution feature attention network for image super-resolution
    Anqi Liu
    Sumei Li
    Yongli Chang
    The Visual Computer, 2023, 39 : 3837 - 3849
  • [40] Lightweight hierarchical residual feature fusion network for single-image super-resolution
    Qin, Jiayi
    Liu, Feiqiang
    Liu, Kai
    Jeon, Gwanggil
    Yang, Xiaomin
    NEUROCOMPUTING, 2022, 478 : 104 - 123