Multi-Level Alignments for Compressed Video Super-Resolution

被引:0
|
作者
Wei L. [1 ]
Ye M. [1 ]
Ji L. [1 ]
Gan Y. [2 ]
Li S. [3 ]
Li X. [4 ]
机构
[1] School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu
[2] College of Computer Science, Chongqing University, Chongqing
[3] School of Control Science and Engineering, Shandong University, Jinan
[4] School of Electrical Engineering and Computer Science, The University of Queensland, Brisbane, QLD
基金
中国国家自然科学基金;
关键词
Compressed video quality enhancement; Compressed video super-resolution; Transformer;
D O I
10.1109/TCE.2024.3411144
中图分类号
学科分类号
摘要
Due to the limited transmission bandwidth, to meet the application needs of consumer electronics products, there exists an approach to down-sample a video and then compress it to satisfy the limited bandwidth. The existing compressed video super-resolution methods pay more attention to the gain of low-frequency information in the video and process high-frequency information roughly. Besides, the geometric alignment information among temporal frames as well as the global information is also poorly extracted due to the limitation of the convolution operation. To address these limitations, we propose a Transformer based multi-level Alignments method to recover high-frequency and global information for compressed Video Super-Resolution (TAVSR). Specifically, a dual-branch alignment network is proposed. One branch is for recovering high-frequency information based on intra-frame which is compressed at original resolution; another branch is for low-frequency information in the continuous inter-frames at a lower resolution. For each branch, global and local alignments are performed respectively. To achieve global pixel movement alignment between the current frame and intra/inter-frame, Transformer based U-shape Network (TUNet) is proposed to estimate deformable convolution offsets, which performs much better than convolution in the geometric distance formulation from texture. By contrast, the local information is implicitly aligned using TUNet to keep the details. A multi-stage fusion module is further proposed to fuse aligned features to obtain the original resolution frame with enhanced quality. Extensive experiments show that the proposed method achieves the best rate-distortion (R-D) performance on JCT-VC test sequences compared with the most advanced methods. IEEE
引用
收藏
页码:1 / 1
相关论文
共 50 条
  • [1] Lightweight Video Super-Resolution for Compressed Video
    Kwon, Ilhwan
    Li, Jun
    Prasad, Mukesh
    ELECTRONICS, 2023, 12 (03)
  • [2] Compressed Domain Deep Video Super-Resolution
    Chen, Peilin
    Yang, Wenhan
    Wang, Meng
    Sun, Long
    Hu, Kangkang
    Wang, Shiqi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 7156 - 7169
  • [3] A NOVEL ALGORITHM OF SUPER-RESOLUTION RECONSTRUCTION FOR COMPRESSED VIDEO
    Xu Zhongqiang Zhu Xiuchang (Information Industry Ministry and Jiangsu Province Key Lab of Image Processing & Image Communication
    Journal of Electronics(China), 2007, (03) : 363 - 368
  • [4] Edge-Oriented Compressed Video Super-Resolution
    Wang, Zheng
    Quan, Guancheng
    He, Gang
    SENSORS, 2024, 24 (01)
  • [5] Super-resolution mosaicing from MPEG compressed video
    Kramer, P
    Hadar, O
    Benois-Pineau, J
    Domenger, JP
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 801 - 804
  • [6] Super-resolution mosaicing from MPEG compressed video
    Kramer, P.
    Hadar, O.
    Benois-Pineau, J.
    Domenger, J. -P.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2007, 22 (10) : 845 - 865
  • [7] Multi-level Feature Fusion Mechanism for Single Image Super-Resolution
    Lyn, Jiawen
    2020 THE 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTICS AND CONTROL ENGINEERING (IRCE 2020), 2020, : 52 - 57
  • [8] Multi-level Feature Fusion Network for Single Image Super-Resolution
    Zhang, Xinxia
    Zhang, Xiaoqin
    Zhao, Li
    Jiang, Runhua
    Huang, Pengcheng
    Xu, Jiawei
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 3361 - 3368
  • [9] Super-resolution imaging with an achromatic multi-level diffractive microlens array
    Banerji, Sourangsu
    Meem, Monjurul
    Majumder, Apratim
    Sensale-Rodriguez, Berardi
    Menon, Rajesh
    OPTICS LETTERS, 2020, 45 (22) : 6158 - 6161
  • [10] MFFN: image super-resolution via multi-level features fusion network
    Chen, Yuantao
    Xia, Runlong
    Yang, Kai
    Zou, Ke
    VISUAL COMPUTER, 2024, 40 (02): : 489 - 504