Nonlocal-guided enhanced interaction spatial-temporal network for compressed video super-resolution

被引:0
|
作者
Cheng, Junxiong [1 ]
Xiong, Shuhua [1 ]
He, Xiaohai [1 ]
Ren, Chao [1 ]
Zhang, Tingrong [1 ]
Chen, Honggang [1 ]
机构
[1] Sichuan Univ, Coll Elect & Informat Engn, Chengdu 610065, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Compression artifacts reduction; Video super-resolution; Nonlocal-guided enhanced interaction; Spatial-temporal network; EFFICIENCY; RESOLUTION;
D O I
10.1007/s10489-023-04798-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although deep-learning based video super-resolution (VSR) studies have achieved excellent progress in recent years, the majority of them do not take into account the impact of lossy compression. A large number of real-world videos are characterized by compression artifacts (e.g., blocking, ringing, and blurring) due to transmission bandwidth or storage capacity limitations, which makes the VSR task more challenging. To balance compression artifacts reduction and detail preservation, this paper proposes a nonlocal-guided enhanced interaction spatial-temporal network for compressed video super-resolution (EISTNet). EISTNet consists of the nonlocal-guided enhanced interaction feature extraction module (EIFEM) and the attention-based multi-channel feature self-calibration module (MCFSM). The pixel-shuffle-based nonlocal feature guidance module (PNFGM) is designed to explore the nonlocal similarity of video sequences and then it is used to guide the extraction and fusion of inter-frame spatial-temporal information in EIFEM. Considering that compression noise and video content are strongly correlated, MCFSM introduces features from the compression artifacts reduction stage for recalibration and adaptive fusion, which closely associates the two parts of the network. To reduce the computational memory pressure on nonlocal modules, we add pixel-shuffle operation to PNFGM, which also expands its receptive field. Experimental results demonstrate that our method achieves better performance compared to the existing methods.
引用
收藏
页码:24407 / 24421
页数:15
相关论文
共 50 条
  • [1] Nonlocal-guided enhanced interaction spatial-temporal network for compressed video super-resolution
    Junxiong Cheng
    Shuhua Xiong
    Xiaohai He
    Chao Ren
    Tingrong Zhang
    Honggang Chen
    [J]. Applied Intelligence, 2023, 53 : 24407 - 24421
  • [2] Learning a spatial-temporal symmetry network for video super-resolution
    Xiaohang Wang
    Mingliang Liu
    Pengying Wei
    [J]. Applied Intelligence, 2023, 53 : 3530 - 3544
  • [3] Learning a spatial-temporal symmetry network for video super-resolution
    Wang, Xiaohang
    Liu, Mingliang
    Wei, Pengying
    [J]. APPLIED INTELLIGENCE, 2023, 53 (03) : 3530 - 3544
  • [4] Conditional Neural Video Coding with Spatial-Temporal Super-Resolution
    Wang, Henan
    Pan, Xiaohan
    Feng, Runsen
    Guo, Zongyu
    Chen, Zhibo
    [J]. 2024 DATA COMPRESSION CONFERENCE, DCC, 2024, : 591 - 591
  • [5] CTVSR: Collaborative Spatial-Temporal Transformer for Video Super-Resolution
    Tang, Jun
    Lu, Chenyan
    Liu, Zhengxue
    Li, Jiale
    Dai, Hang
    Ding, Yong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5018 - 5032
  • [6] Deformable Spatial-Temporal Attention for Lightweight Video Super-Resolution
    Xue, Tong
    Huang, Xinyi
    Li, Dengshi
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 482 - 493
  • [7] You Only Align Once: Bidirectional Interaction for Spatial-Temporal Video Super-Resolution
    Hu, Mengshun
    Jiang, Kui
    Nie, Zhixiang
    Wang, Zheng
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [8] Enhanced Video Super-Resolution Network towards Compressed Data
    Li, Feng
    Wu, Yixuan
    Li, Anqi
    Bai, Huihui
    Cong, Runmin
    Zhao, Yao
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [9] Building an End-to-End Spatial-Temporal Convolutional Network for Video Super-Resolution
    Guo, Jun
    Chao, Hongyang
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4053 - 4060
  • [10] Learning Spatial-Temporal Implicit Neural Representations for Event-Guided Video Super-Resolution
    Lu, Yunfan
    Wang, Zipeng
    Liu, Minjie
    Wang, Hongjian
    Wang, Lin
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1557 - 1567