Multi-Temporal Ultra Dense Memory Network for Video Super-Resolution

被引:0
|
作者
Yi, Peng [1 ]
Wang, Zhongyuan [1 ]
Jiang, Kui [1 ]
Shao, Zhenfeng [2 ]
Ma, Jiayi [3 ]
机构
[1] National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, Wuhan, China
[2] State Key Laboratory for Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, China
[3] Electronic Information School, Wuhan University, Wuhan, China
关键词
Convolution;
D O I
暂无
中图分类号
学科分类号
摘要
Video super-resolution (SR) aims to reconstruct the corresponding high-resolution (HR) frames from consecutive low-resolution (LR) frames. It is crucial for video SR to harness both inter-frame temporal correlations and intra-frame spatial correlations among frames. Previous video SR methods based on convolutional neural network (CNN) mostly adopt a single-channel structure and a single memory module, so they are unable to fully exploit inter-frame temporal correlations specific for video. To this end, this paper proposes a multi-temporal ultra-dense memory (MTUDM) network for video super-resolution. Particularly, we embed convolutional long-short-term memory (ConvLSTM) into ultra-dense residual block (UDRB) to construct an ultra-dense memory block (UDMB) for extracting and retaining spatio-temporal correlations. This design also reduces the layer depth by expanding the width, thus avoiding training difficulties, such as gradient exploding and vanishing under a large model. We further adopt multi-temporal information fusion (MTIF) strategy to merge the extracted temporal feature maps in consecutive frames, improving the accuracy without requiring much extra computational cost. The experimental results on extensive public datasets demonstrate that our method outperforms the state-of-the-art methods by a large margin. © 1991-2012 IEEE.
引用
收藏
页码:2503 / 2516
相关论文
共 50 条
  • [41] Robust Video Super-Resolution with Learned Temporal Dynamics
    Liu, Ding
    Wang, Zhaowen
    Fan, Yuchen
    Liu, Xianming
    Wang, Zhangyang
    Chang, Shiyu
    Huang, Thomas
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2526 - 2534
  • [42] Temporal Kernel Consistency for Blind Video Super-Resolution
    Xiang, Lichuan
    Lee, Royson
    Abdelfattah, Mohamed S.
    Lane, Nicholas D.
    Wen, Hongkai
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3470 - 3479
  • [43] ADAPTIVE INCREMENTAL VIDEO SUPER-RESOLUTION WITH TEMPORAL CONSISTENCY
    Su, Heng
    Wu, Ying
    Zhou, Jie
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1149 - 1152
  • [44] Image Super-Resolution Based on Dense Convolutional Network
    Li, Jie
    Zhou, Yue
    PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 134 - 145
  • [45] Dense-Gated Network for Image Super-Resolution
    Shumin Fan
    Tianyu Song
    Pengpeng Li
    Jiyu Jin
    Guiyue Jin
    Zhongmin Zhu
    Neural Processing Letters, 2023, 55 : 11845 - 11861
  • [46] Pyramid Attention Dense Network for Image Super-Resolution
    Chen, Si-Bao
    Hu, Chao
    Luo, Bin
    Ding, Chris H. Q.
    Huang, Shi-Lei
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [47] Medical image super-resolution with laplacian dense network
    Rui Tang
    Lihui Chen
    Rongzhu Zhang
    Awais Ahmad
    Marcelo Keese Albertini
    Xiaomin Yang
    Multimedia Tools and Applications, 2022, 81 : 3131 - 3144
  • [48] Dense-Gated Network for Image Super-Resolution
    Fan, Shumin
    Song, Tianyu
    Li, Pengpeng
    Jin, Jiyu
    Jin, Guiyue
    Zhu, Zhongmin
    NEURAL PROCESSING LETTERS, 2023, 55 (09) : 11845 - 11861
  • [49] Taper Residual Dense Network for Audio Super-Resolution
    Yang, Junmei
    Lin, Haosen
    Peng, Yiming
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PART X, 2023, 14263 : 432 - 443
  • [50] Stratified attention dense network for image super-resolution
    Zhiwei Liu
    Xiaofeng Mao
    Ji Huang
    Menghan Gan
    Yueyuan Zhang
    Signal, Image and Video Processing, 2022, 16 : 715 - 722