Multi-Codec Video Quality Enhancement Model Based on Spatio-Temporal Deformable Fusion

被引:0
|
作者
Kreisler, Gilberto [1 ]
da Silveira Junior, Garibaldi [1 ]
Zatt, Bruno [1 ]
Palomino, Daniel [1 ]
Correa, Guilherme [1 ]
机构
[1] Fed Univ Pelotas UFPel, Grad Program Comp PPGC, Video Technol Res Grp ViTech, Pelotas, RS, Brazil
关键词
video quality enhancement (VQE); video coding; deep learning;
D O I
10.1109/LASCAS60203.2024.10506192
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The popularization of mobile phones and other multimedia portable devices paved the way for the increase in video consumption worldwide. However, it is impossible to transmit a non-compressed video due to the high bandwidth required. To achieve significant compression rates, video codecs usually employ methods that damage the visual quality perceived by the end user in non-negligible levels. Different architectures based on deep learning have been recently proposed for Video Quality Enhancement (VQE). Still, most of them are trained and validated using videos generated by a single codec under fixed configurations. With the increase of video coding formats and standards on the market, VQE methods that apply to different contexts are desired. This paper proposes a new VQE model based on the Spatio-Temporal Deformable Fusion (STDF) architecture, providing quality gains for videos compressed according to different formats and standards, such as HEVC, VVC, VP9, and AV1. The results demonstrate that by considering different video coding standards and formats to build the STDF model, a significant increase in VQE is achieved, with an average PSNR increment of up to 0.382 dB.
引用
收藏
页码:163 / 167
页数:5
相关论文
共 50 条
  • [21] DeepVideoMVS: Multi-View Stereo on Video with Recurrent Spatio-Temporal Fusion
    Duzceker, Arda
    Galliani, Silvano
    Vogel, Christoph
    Speciale, Pablo
    Dusmanu, Mihai
    Pollefeys, Marc
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15319 - 15328
  • [22] Video modeling by spatio-temporal resampling and Bayesian fusion
    Zheng, Yunfei
    Li, Xin
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 3201 - 3204
  • [23] Deep video quality assessment using constrained multi-task regression and Spatio-temporal feature fusion
    Wen, Mingyang
    Liu, Lixiong
    Sang, Qingbing
    Zhang, Yongmei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (18) : 28067 - 28086
  • [24] Fusion Side Information Based on Spatio-temporal Correlation in Distributed Multi-view Video Coding
    Huang, Qing
    Li, Bin
    Wang, Yumei
    Zhang, Lin
    Liu, Yu
    2011 6TH INTERNATIONAL ICST CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA (CHINACOM), 2011, : 76 - 81
  • [25] Deep video quality assessment using constrained multi-task regression and Spatio-temporal feature fusion
    Mingyang Wen
    Lixiong Liu
    Qingbing Sang
    Yongmei Zhang
    Multimedia Tools and Applications, 2023, 82 : 28067 - 28086
  • [26] Video Quality Assessment Metric Based on Spatio-Temporal Motion Information
    Kang, Kai
    Liu, Xingang
    Sun, Chao
    2013 IEEE 11TH INTERNATIONAL CONFERENCE ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING (DASC), 2013, : 47 - 51
  • [27] Novel Spatio-Temporal Structural Information Based Video Quality Metric
    Wang, Yue
    Jiang, Tingting
    Ma, Siwei
    Gao, Wen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (07) : 989 - 998
  • [28] Blind video quality assessment based on Spatio-Temporal Feature Resolver
    Bi, Xiaodong
    He, Xiaohai
    Xiong, Shuhua
    Zhao, Zeming
    Chen, Honggang
    Sheriff, Raymond Edward
    NEUROCOMPUTING, 2024, 574
  • [29] Spatio-temporal enhancement method based on dense connection structure for compressed video
    Li, Hongyao
    He, Xiaohai
    Bi, Xiaodong
    Xiong, Shuhua
    Chen, Honggang
    Journal of Electronic Imaging, 2024, 33 (04)
  • [30] Facial Expression Recognition Based on the Fusion of Spatio-temporal Features in Video Sequences
    Wang Xiaohua
    Xia Chen
    Hu Min
    Ren Fuji
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (03) : 626 - 632