Multi-Stage Feature Fusion Network for Video Super-Resolution

被引:38
|
作者
Song, Huihui [1 ,2 ]
Xu, Wenjie [1 ,2 ]
Liu, Dong [3 ]
Liu, Bo [4 ]
Liu, Qingshan [1 ,2 ]
Metaxas, Dimitris N. [5 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Jiangsu Key Lab Big Data Anal Technol B DAT, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Nanjing 210044, Peoples R China
[3] Netflix Inc, Los Gatos, CA 95032 USA
[4] JD Finance Amer Corp, Mountain View, CA 94089 USA
[5] Rutgers State Univ, Dept Comp Sci, Piscataway, NJ 08854 USA
基金
中国国家自然科学基金;
关键词
Visualization; Convolution; Superresolution; Task analysis; Fuses; Feature extraction; Modulation; Video super-resolution; single image super-resolution; deep learning; deformable convolution; feature fusion; QUALITY ASSESSMENT;
D O I
10.1109/TIP.2021.3056868
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video super-resolution (VSR) is to restore a photo-realistic high-resolution (HR) frame from both its corresponding low-resolution (LR) frame (reference frame) and multiple neighboring frames (supporting frames). An important step in VSR is to fuse the feature of the reference frame with the features of the supporting frames. The major issue with existing VSR methods is that the fusion is conducted in a one-stage manner, and the fused feature may deviate greatly from the visual information in the original LR reference frame. In this paper, we propose an end-to-end Multi-Stage Feature Fusion Network that fuses the temporally aligned features of the supporting frames and the spatial feature of the original reference frame at different stages of a feed-forward neural network architecture. In our network, the Temporal Alignment Branch is designed as an inter-frame temporal alignment module used to mitigate the misalignment between the supporting frames and the reference frame. Specifically, we apply the multi-scale dilated deformable convolution as the basic operation to generate temporally aligned features of the supporting frames. Afterwards, the Modulative Feature Fusion Branch, the other branch of our network accepts the temporally aligned feature map as a conditional input and modulates the feature of the reference frame at different stages of the branch backbone. This enables the feature of the reference frame to be referenced at each stage of the feature fusion process, leading to an enhanced feature from LR to HR. Experimental results on several benchmark datasets demonstrate that our proposed method can achieve state-of-the-art performance on VSR task.
引用
收藏
页码:2923 / 2934
页数:12
相关论文
共 50 条
  • [21] Feature Fusion Attention Network for Image Super-resolution
    Zhou D.-W.
    Ma L.-Y.
    Tian J.-Y.
    Sun X.-X.
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (09): : 2233 - 2241
  • [22] OSFFNet: Omni-Stage Feature Fusion Network for Lightweight Image Super-Resolution
    Wang, Yang
    Zhang, Tao
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5660 - 5668
  • [23] Continuous Space-Time Video Super-Resolution with Multi-Stage Motion Information Reorganization
    Zhang, Yuantong
    Yang, Daiqin
    Chen, Zhenzhong
    Ding, Wenpeng
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (09)
  • [24] s-LMPNet: a super-lightweight multi-stage progressive network for image super-resolution
    Li, Meng
    Ma, Bo
    Liu, Ying
    Zhang, Yulin
    [J]. APPLIED INTELLIGENCE, 2023, 53 (11) : 13378 - 13397
  • [25] s-LMPNet: a super-lightweight multi-stage progressive network for image super-resolution
    Meng Li
    Bo Ma
    Ying Liu
    Yulin Zhang
    [J]. Applied Intelligence, 2023, 53 : 13378 - 13397
  • [26] Single image super-resolution with multi-level feature fusion recursive network
    Jin, Xin
    Xiong, Qiming
    Xiong, Chengyi
    Li, Zhibang
    Gao, Zhirong
    [J]. NEUROCOMPUTING, 2019, 370 : 166 - 173
  • [27] Multi-Residual Feature Fusion Network for lightweight Single Image Super-Resolution
    Qin, Jiayi
    He, Zheng
    Yan, Binyu
    Jeon, Gwanggil
    Yang, Xiaomin
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1511 - 1518
  • [28] A deep recursive multi-scale feature fusion network for image super-resolution?
    Liu, Feiqiang
    Yang, Xiaomin
    De Baets, Bernard
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90
  • [29] Multi-scale feature fusion residual network for Single Image Super-Resolution
    Qin, Jinghui
    Huang, Yongjie
    Wen, Wushao
    [J]. NEUROCOMPUTING, 2020, 379 (379) : 334 - 342
  • [30] Compressed multi-scale feature fusion network for single image super-resolution
    Fan, Xinxia
    Yang, Yanhua
    Deng, Cheng
    Xu, Jie
    Gao, Xinbo
    [J]. SIGNAL PROCESSING, 2018, 146 : 50 - 60