CycMuNet plus : Cycle-Projected Mutual Learning for Spatial-Temporal Video Super-Resolution

被引:14
|
作者
Hu, Mengshun [1 ]
Jiang, Kui [2 ]
Wang, Zheng [1 ]
Bai, Xiang [3 ]
Hu, Ruimin [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Superresolution; Optical flow; Task analysis; Image reconstruction; Correlation; Iterative methods; Performance evaluation; Spatial-temporal video super-resolution; spatial video super-resolution; temporal video super-resolution; cycle-projected; mutual learning; up-projection unit; down-projection unit; SPACE-TIME SUPERRESOLUTION;
D O I
10.1109/TPAMI.2023.3293522
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatial-Temporal Video Super-Resolution (ST-VSR) aims to generate high-quality videos with higher resolution (HR) and higher frame rate (HFR). Quite intuitively, pioneering two-stage based methods complete ST-VSR by directly combining two sub-tasks: Spatial Video Super-Resolution (S-VSR) and Temporal Video Super-Resolution (T-VSR) but ignore the reciprocal relations among them. 1) T-VSR to S-VSR: temporal correlations help accurate spatial detail representation; 2) S-VSR to T-VSR: abundant spatial information contributes to the refinement of temporal prediction. To this end, we propose a one-stage based Cycle-projected Mutual learning network (CycMuNet) for ST-VSR, which makes full use of spatial-temporal correlations via the mutual learning between S-VSR and T-VSR. Specifically, we propose to exploit the mutual information among them via iterative up- and down projections, where spatial and temporal features are fully fused and distilled, helping high-quality video reconstruction. In addition, we also show interesting extensions for efficient network design (CycMuNet+), such as parameter sharing and dense connection on projection units and feedback mechanism in CycMuNet. Besides extensive experiments on benchmark datasets, we also compare our proposed CycMuNet (+) with S-VSR and T-VSR tasks, demonstrating that our method significantly outperforms the state-of-the-art methods.
引用
收藏
页码:13376 / 13392
页数:17
相关论文
共 50 条
  • [1] Spatial-Temporal Space Hand-in-Hand: Spatial-Temporal Video Super-Resolution via Cycle-Projected Mutual Learning
    Hu, Mengshun
    Jiang, Kui
    Liao, Liang
    Xiao, Jing
    Jiang, Junjun
    Wang, Zheng
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3564 - 3573
  • [2] Learning a spatial-temporal symmetry network for video super-resolution
    Xiaohang Wang
    Mingliang Liu
    Pengying Wei
    [J]. Applied Intelligence, 2023, 53 : 3530 - 3544
  • [3] Learning a spatial-temporal symmetry network for video super-resolution
    Wang, Xiaohang
    Liu, Mingliang
    Wei, Pengying
    [J]. APPLIED INTELLIGENCE, 2023, 53 (03) : 3530 - 3544
  • [4] Conditional Neural Video Coding with Spatial-Temporal Super-Resolution
    Wang, Henan
    Pan, Xiaohan
    Feng, Runsen
    Guo, Zongyu
    Chen, Zhibo
    [J]. 2024 DATA COMPRESSION CONFERENCE, DCC, 2024, : 591 - 591
  • [5] CTVSR: Collaborative Spatial-Temporal Transformer for Video Super-Resolution
    Tang, Jun
    Lu, Chenyan
    Liu, Zhengxue
    Li, Jiale
    Dai, Hang
    Ding, Yong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5018 - 5032
  • [6] Deformable Spatial-Temporal Attention for Lightweight Video Super-Resolution
    Xue, Tong
    Huang, Xinyi
    Li, Dengshi
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 482 - 493
  • [7] Video super-resolution based on spatial-temporal recurrent residual networks
    Yang, Wenhan
    Feng, Jiashi
    Xie, Guosen
    Liu, Jiaying
    Guo, Zongming
    Yan, Shuicheng
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 168 : 79 - 92
  • [8] Learning Spatial-Temporal Implicit Neural Representations for Event-Guided Video Super-Resolution
    Lu, Yunfan
    Wang, Zipeng
    Liu, Minjie
    Wang, Hongjian
    Wang, Lin
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1557 - 1567
  • [9] Video super-resolution via mixed spatial-temporal convolution and selective fusion
    Sun, Wei
    Gong, Dong
    Shi, Javen Qinfeng
    van den Hengel, Anton
    Zhang, Yanning
    [J]. PATTERN RECOGNITION, 2022, 126
  • [10] Fine-grained video super-resolution via spatial-temporal learning and image detail enhancement
    Yeh, Chia -Hung
    Yang, Hsin-Fu
    Lin, Yu -Yang
    Huang, Wan-Jen
    Tsai, Feng-Hsu
    Kang, Li - Wei
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131