Spatial-temporal feature refine network for single image super-resolution

被引:2
|
作者
Qin, Jiayi [1 ,2 ]
Chen, Lihui [1 ,2 ]
Liu, Kai [3 ]
Jeon, Gwanggil [4 ,5 ]
Yang, Xiaomin [1 ,2 ]
机构
[1] Sichuan Univ, Coll Elect & Informat Engn, Chengdu 610065, Sichuan, Peoples R China
[2] Sichuan Univ, Sch Aeronaut & Astronaut, Chengdu 610065, Sichuan, Peoples R China
[3] Sichuan Univ, Coll Elect Engn, Chengdu 610065, Sichuan, Peoples R China
[4] Incheon Natl Univ, Dept Embedded Syst Engn, Incheon 22012, South Korea
[5] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
关键词
Single image super-resolution; Lightweight convolution neural network; Spatial-temporal feature refine network; Multi-attention enhanced residual block;
D O I
10.1007/s10489-022-03741-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, numerous convolution neural networks (CNNs) for single image super-resolution (SISR) have shown powerful capability in image reconstruction. Especially, accurate and compact networks receive widespread attention due to their superiority in running speed on resources-limited devices. However, in most lightweight CNN-based methods, the limitation of informative features extraction results in mediocre reconstructed images. In this paper, we propose a spatial-temporal feature refine network (STFRN) to alleviate the above problem by extract features from diverse dimensions. Specifically, we first introduce a spatial-temporal stage learning (STSL) of our STFRN in two distinct views: 1) for temporal feature extraction, we enhance the relevance of various stages and introduce a persistent memory via multi-stage learning, thereby boosting its reconstruction capability; 2) for spatial feature extraction, we enlarge the receptive fields by densely deepening the network to capture more context features. In the time dimension, we only maintain the main stage, i.e., the last stage, for efficient training. In addition to STSL, we elaborately design an effective multi-attention enhanced residual block (MERB), which further refines informative features at the depth level. In detail, by combining multiple attention blocks, we achieve the complementary features from diverse views (i.e., in point-wise, channel-wise, and spatial dimensions). Compared with ordinary residual blocks, MERB realizes better results while consuming less computational resources. Like most literature, we also utilize residual learning and dense connection to promote performance. Extensive experiments show our STFRN levering on the STSL and MERBs is superior to other state-of-the-art methods in quality and quantity.
引用
收藏
页码:9668 / 9688
页数:21
相关论文
共 50 条
  • [1] Spatial-temporal feature refine network for single image super-resolution
    Jiayi Qin
    Lihui Chen
    Kai Liu
    Gwanggil Jeon
    Xiaomin Yang
    [J]. Applied Intelligence, 2023, 53 : 9668 - 9688
  • [2] Learning a spatial-temporal symmetry network for video super-resolution
    Xiaohang Wang
    Mingliang Liu
    Pengying Wei
    [J]. Applied Intelligence, 2023, 53 : 3530 - 3544
  • [3] Learning a spatial-temporal symmetry network for video super-resolution
    Wang, Xiaohang
    Liu, Mingliang
    Wei, Pengying
    [J]. APPLIED INTELLIGENCE, 2023, 53 (03) : 3530 - 3544
  • [4] Channel-Wise and Spatial Feature Modulation Network for Single Image Super-Resolution
    Hu, Yanting
    Li, Jie
    Huang, Yuanfei
    Gao, Xinbo
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 3911 - 3927
  • [5] Lightweight Feature Fusion Network for Single Image Super-Resolution
    Yang, Wenming
    Wang, Wei
    Zhang, Xuechen
    Sun, Shuifa
    Liao, Qingmin
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (04) : 538 - 542
  • [6] Efficient Network Removing Feature Redundancy for Single Image Super-Resolution
    Zhou, Yun
    Liang, Tao
    Jiang, Zhuqing
    Men, Aidong
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2022,
  • [7] FSFN: feature separation and fusion network for single image super-resolution
    Kai Zhu
    Zhenxue Chen
    Q. M. Jonathan Wu
    Nannan Wang
    Jie Zhao
    Gan Zhang
    [J]. Multimedia Tools and Applications, 2021, 80 : 31599 - 31618
  • [8] FSFN: feature separation and fusion network for single image super-resolution
    Zhu, Kai
    Chen, Zhenxue
    Wu, Q. M. Jonathan
    Wang, Nannan
    Zhao, Jie
    Zhang, Gan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 31599 - 31618
  • [9] Spatial-Temporal Super-Resolution Land Cover Mapping With a Local Spatial-Temporal Dependence Model
    Li, Xiaodong
    Ling, Feng
    Foody, Giles M.
    Ge, Yong
    Zhang, Yihang
    Wang, Lihui
    Shi, Lingfei
    Li, Xinyan
    Du, Yun
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (07): : 4951 - 4966
  • [10] Building an End-to-End Spatial-Temporal Convolutional Network for Video Super-Resolution
    Guo, Jun
    Chao, Hongyang
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4053 - 4060