Multi-Level Alignments for Compressed Video Super-Resolution

被引:0
|
作者
Wei L. [1 ]
Ye M. [1 ]
Ji L. [1 ]
Gan Y. [2 ]
Li S. [3 ]
Li X. [4 ]
机构
[1] School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu
[2] College of Computer Science, Chongqing University, Chongqing
[3] School of Control Science and Engineering, Shandong University, Jinan
[4] School of Electrical Engineering and Computer Science, The University of Queensland, Brisbane, QLD
基金
中国国家自然科学基金;
关键词
Compressed video quality enhancement; Compressed video super-resolution; Transformer;
D O I
10.1109/TCE.2024.3411144
中图分类号
学科分类号
摘要
Due to the limited transmission bandwidth, to meet the application needs of consumer electronics products, there exists an approach to down-sample a video and then compress it to satisfy the limited bandwidth. The existing compressed video super-resolution methods pay more attention to the gain of low-frequency information in the video and process high-frequency information roughly. Besides, the geometric alignment information among temporal frames as well as the global information is also poorly extracted due to the limitation of the convolution operation. To address these limitations, we propose a Transformer based multi-level Alignments method to recover high-frequency and global information for compressed Video Super-Resolution (TAVSR). Specifically, a dual-branch alignment network is proposed. One branch is for recovering high-frequency information based on intra-frame which is compressed at original resolution; another branch is for low-frequency information in the continuous inter-frames at a lower resolution. For each branch, global and local alignments are performed respectively. To achieve global pixel movement alignment between the current frame and intra/inter-frame, Transformer based U-shape Network (TUNet) is proposed to estimate deformable convolution offsets, which performs much better than convolution in the geometric distance formulation from texture. By contrast, the local information is implicitly aligned using TUNet to keep the details. A multi-stage fusion module is further proposed to fuse aligned features to obtain the original resolution frame with enhanced quality. Extensive experiments show that the proposed method achieves the best rate-distortion (R-D) performance on JCT-VC test sequences compared with the most advanced methods. IEEE
引用
收藏
页码:1 / 1
相关论文
共 50 条
  • [41] Multi Frame Super-Resolution for Versatile Video Coding
    Hori, Toshiya
    Yano, Remina
    Watanabe, Hiroshi
    Chujoh, Takeshi
    Ikai, Tomohiro
    Sasaki, Eiichi
    Suzuki, Takuya
    Ito, Norio
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2021, 2021, 11766
  • [42] Multi-level and Multi-scale Spatial and Spectral Fusion CNN for Hyperspectral Image Super-resolution
    Han, Xian-Hua
    Zheng, YinQiang
    Chen, Yen-Wei
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4330 - 4339
  • [43] Compressed Super-Resolution of Positive Sources
    Ferreira Da Costa, Maxime
    Chi, Yuejie
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 56 - 60
  • [44] An image super-resolution deep learning network based on multi-level feature extraction module
    Xin Yang
    Yifan Zhang
    Yingqing Guo
    Dake Zhou
    Multimedia Tools and Applications, 2021, 80 : 7063 - 7075
  • [45] Multi-level feature extraction and reconstruction for 3D MRI image super-resolution
    Li, Hongbi
    Jia, Yuanyuan
    Zhu, Huazheng
    Han, Baoru
    Du, Jinglong
    Liu, Yanbing
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 171
  • [46] MICU: Image super-resolution via multi-level information compensation and U-net
    Chen, Yuantao
    Xia, Runlong
    Yang, Kai
    Zou, Ke
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245
  • [47] An image super-resolution deep learning network based on multi-level feature extraction module
    Yang, Xin
    Zhang, Yifan
    Guo, Yingqing
    Zhou, Dake
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (05) : 7063 - 7075
  • [48] Noisy face super-resolution method based on multi-level latent space information constraints
    Teng Z.
    Yu X.-S.
    Wu C.-D.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (05): : 1469 - 1477
  • [49] A hybrid of transformer and CNN for efficient single image super-resolution via multi-level distillation
    Zhou, Zhenting
    Li, Guoping
    Wang, Guozhong
    DISPLAYS, 2023, 76
  • [50] Unsupervised multi-level spatio-spectral fusion transformer for hyperspectral image super-resolution
    Cao, Xuheng
    Lian, Yusheng
    Li, Jin
    Wang, Kaixuan
    Ma, Chao
    OPTICS AND LASER TECHNOLOGY, 2024, 176