Video Super-Resolution via a Spatio-Temporal Alignment Network

被引:27
|
作者
Wen, Weilei [1 ]
Ren, Wenqi [2 ,3 ]
Shi, Yinghuan [3 ]
Nie, Yunfeng [4 ]
Zhang, Jingang [5 ]
Cao, Xiaochun [2 ]
机构
[1] Nankai Univ, Coll Comp Sci, TKLNDST, Tianjin 300350, Peoples R China
[2] Sun Yat Sen Univ Shenzhen, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China
[3] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210093, Peoples R China
[4] Vrije Univ Brussel, Brussels Photon, Dept Appl Phys & Photon, B-1050 Brussels, Belgium
[5] Univ Chinese Acad Sci, Intelligent Imaging Ctr, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Superresolution; Motion compensation; Estimation; Integrated optics; Optical imaging; Image reconstruction; Feature extraction; Video super-resolution; temporal consistency; spatio-temporal adaptive filters;
D O I
10.1109/TIP.2022.3146625
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural network based video super-resolution (SR) models have achieved significant progress in recent years. Existing deep video SR methods usually impose optical flow to wrap the neighboring frames for temporal alignment. However, accurate estimation of optical flow is quite difficult, which tends to produce artifacts in the super-resolved results. To address this problem, we propose a novel end-to-end deep convolutional network that dynamically generates the spatially adaptive filters for the alignment, which are constituted by the local spatio-temporal channels of each pixel. Our method avoids generating explicit motion compensation and utilizes spatio-temporal adaptive filters to achieve the operation of alignment, which effectively fuses the multi-frame information and improves the temporal consistency of the video. Capitalizing on the proposed adaptive filter, we develop a reconstruction network and take the aligned frames as input to restore the high-resolution frames. In addition, we employ residual modules embedded with channel attention as the basic unit to extract more informative features for video SR. Both quantitative and qualitative evaluation results on three public video datasets demonstrate that the proposed method performs favorably against state-of-the-art super-resolution methods in terms of clearness and texture details.
引用
收藏
页码:1761 / 1773
页数:13
相关论文
共 50 条
  • [1] Grouped Spatio-Temporal Alignment Network for Video Super-Resolution
    Lu, Mingxuan
    Zhang, Peng
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2193 - 2197
  • [2] Spatio-Temporal Fusion Network for Video Super-Resolution
    Li, Huabin
    Zhang, Pingjian
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [3] Residual Invertible Spatio-Temporal Network for Video Super-Resolution
    Zhu, Xiaobin
    Li, Zhuangzi
    Zhang, Xiao-Yu
    Li, Changsheng
    Liu, Yaqi
    Xue, Ziyu
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5981 - 5988
  • [4] Video super-resolution based on a spatio-temporal matching network
    Zhu, Xiaobin
    Li, Zhuangzi
    Lou, Jungang
    Shen, Qing
    [J]. PATTERN RECOGNITION, 2021, 110
  • [5] Fast Spatio-Temporal Residual Network for Video Super-Resolution
    Li, Sheng
    He, Fengxiang
    Du, Bo
    Zhang, Lefei
    Xu, Yonghao
    Tao, Dacheng
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10514 - 10523
  • [6] Efficient Spatio-Temporal Network with Gated Fusion for Video Super-Resolution
    Li, Changyu
    Zhang, Dongyang
    Xie, Ning
    Shao, Jie
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 640 - 651
  • [7] Spatio-temporal Super-resolution Network: Enhance Visual Representations for Video Captioning
    Cao, Quanhui
    Tang, Pengjie
    Wang, Hanli
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 3125 - 3129
  • [8] DSTnet: Deformable Spatio-Temporal Convolutional Residual Network for Video Super-Resolution
    Khan, Anusha
    Sargano, Allah Bux
    Habib, Zulfiqar
    [J]. MATHEMATICS, 2021, 9 (22)
  • [9] A multi-stage spatio-temporal adaptive network for video super-resolution
    Zhang, Yuhang
    Chen, Zhenzhong
    Liu, Shan
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [10] Super-Resolution Reconstruction for Spatio-Temporal Resolution Enhancement of Video Sequences
    Haseyama, Miki
    Izumi, Daisuke
    Takizawa, Makoto
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (09): : 2355 - 2358