Multi-Codec Video Quality Enhancement Model Based on Spatio-Temporal Deformable Fusion

被引:0
|
作者
Kreisler, Gilberto [1 ]
da Silveira Junior, Garibaldi [1 ]
Zatt, Bruno [1 ]
Palomino, Daniel [1 ]
Correa, Guilherme [1 ]
机构
[1] Fed Univ Pelotas UFPel, Grad Program Comp PPGC, Video Technol Res Grp ViTech, Pelotas, RS, Brazil
关键词
video quality enhancement (VQE); video coding; deep learning;
D O I
10.1109/LASCAS60203.2024.10506192
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The popularization of mobile phones and other multimedia portable devices paved the way for the increase in video consumption worldwide. However, it is impossible to transmit a non-compressed video due to the high bandwidth required. To achieve significant compression rates, video codecs usually employ methods that damage the visual quality perceived by the end user in non-negligible levels. Different architectures based on deep learning have been recently proposed for Video Quality Enhancement (VQE). Still, most of them are trained and validated using videos generated by a single codec under fixed configurations. With the increase of video coding formats and standards on the market, VQE methods that apply to different contexts are desired. This paper proposes a new VQE model based on the Spatio-Temporal Deformable Fusion (STDF) architecture, providing quality gains for videos compressed according to different formats and standards, such as HEVC, VVC, VP9, and AV1. The results demonstrate that by considering different video coding standards and formats to build the STDF model, a significant increase in VQE is achieved, with an average PSNR increment of up to 0.382 dB.
引用
收藏
页码:163 / 167
页数:5
相关论文
共 50 条
  • [1] STDF: Spatio-Temporal Deformable Fusion for Video Quality Enhancement on Embedded Platforms
    Deng, Jianing
    Dong, Shunjie
    Chen, Lvcheng
    Hu, Jingtong
    Zhuo, Cheng
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2024, 23 (02)
  • [2] Spatio-Temporal Deformable Convolution for Compressed Video Quality Enhancement
    Deng, Jianing
    Wang, Li
    Pu, Shiliang
    Zhuo, Cheng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10696 - 10703
  • [3] Spatio-Temporal Information Fusion Network for Compressed Video Quality Enhancement
    Huang, Weiwei
    Jia, Kebin
    Liu, Pengyu
    Yu, Yuan
    2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 343 - 343
  • [4] Coarse-to-Fine Spatio-Temporal Information Fusion for Compressed Video Quality Enhancement
    Luo, Dengyan
    Ye, Mao
    Li, Shuai
    Li, Xue
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 543 - 547
  • [5] Multi-Swin Transformer Based Spatio-Temporal Information Exploration for Compressed Video Quality Enhancement
    Yu, Li
    Wu, Shiyu
    Gabbouj, Moncef
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1880 - 1884
  • [6] Multi-Frame Compressed Video Quality Enhancement by Spatio-Temporal Information Balance
    Wang, Zeyang
    Ye, Mao
    Li, Shuai
    Li, Xue
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 105 - 109
  • [7] Spatio-temporal video contrast enhancement
    Celik, Turgay
    IET IMAGE PROCESSING, 2013, 7 (06) : 543 - 555
  • [8] Spatio-Temporal Deformable Attention Network for Video Deblurring
    Zhang, Huicong
    Xie, Haozhe
    Yao, Hongxun
    COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 581 - 596
  • [9] Spatio-Temporal Deformable Attention Network for Video Deblurring
    Zhang, Huicong
    Xie, Haozhe
    Yao, Hongxun
    arXiv, 2022,
  • [10] No-reference Video Quality Assessment Based on Spatio-temporal Perception Feature Fusion
    Yaya Tan
    Guangqian Kong
    Xun Duan
    Huiyun Long
    Yun Wu
    Neural Processing Letters, 2023, 55 : 1317 - 1335