Multi-Codec Video Quality Enhancement Model Based on Spatio-Temporal Deformable Fusion

被引:0
|
作者
Kreisler, Gilberto [1 ]
da Silveira Junior, Garibaldi [1 ]
Zatt, Bruno [1 ]
Palomino, Daniel [1 ]
Correa, Guilherme [1 ]
机构
[1] Fed Univ Pelotas UFPel, Grad Program Comp PPGC, Video Technol Res Grp ViTech, Pelotas, RS, Brazil
关键词
video quality enhancement (VQE); video coding; deep learning;
D O I
10.1109/LASCAS60203.2024.10506192
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The popularization of mobile phones and other multimedia portable devices paved the way for the increase in video consumption worldwide. However, it is impossible to transmit a non-compressed video due to the high bandwidth required. To achieve significant compression rates, video codecs usually employ methods that damage the visual quality perceived by the end user in non-negligible levels. Different architectures based on deep learning have been recently proposed for Video Quality Enhancement (VQE). Still, most of them are trained and validated using videos generated by a single codec under fixed configurations. With the increase of video coding formats and standards on the market, VQE methods that apply to different contexts are desired. This paper proposes a new VQE model based on the Spatio-Temporal Deformable Fusion (STDF) architecture, providing quality gains for videos compressed according to different formats and standards, such as HEVC, VVC, VP9, and AV1. The results demonstrate that by considering different video coding standards and formats to build the STDF model, a significant increase in VQE is achieved, with an average PSNR increment of up to 0.382 dB.
引用
收藏
页码:163 / 167
页数:5
相关论文
共 50 条
  • [41] PointSDA: Spatio-Temporal Deformable Attention Network for Point Cloud Video Modeling
    Sheng, Xiaoxiao
    Shen, Zhiqiang
    Xiao, Gang
    IEEE Robotics and Automation Letters, 2024, 9 (12) : 10946 - 10953
  • [42] Low-Complexity Video Quality Assessment Based on Spatio-Temporal Structure
    Lu, Yaqi
    Yu, Mei
    Jiang, Gangyi
    INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2019, 2019, 1078 : 408 - 415
  • [43] No-Reference Quality Evaluation of Stereoscopic Video Based on Spatio-Temporal Texture
    Yang, Jiachen
    Zhao, Yang
    Jiang, Bin
    Lu, Wen
    Gao, Xinbo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (10) : 2635 - 2644
  • [44] Objective Quality Assessment for Video Retargeting Based on Spatio-Temporal Distortion Analysis
    Hsu, Chih-Chung
    Lin, Chia-Wen
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [45] No Reference Video Quality Assessment Based on Spatio-Temporal Features and Attention Mechanism
    Zhu Ze
    Sang Qingbing
    Zhang Hao
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (18)
  • [46] BLIND VIDEO QUALITY ASSESSMENT BASED ON SPATIO-TEMPORAL INTERNAL GENERATIVE MECHANISM
    Zhu, Yun
    Wang, Yongfang
    Shuai, Yuan
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 305 - 309
  • [47] Spatio-temporal attention model for video content analysis
    Guironnet, M
    Guyader, N
    Pellerin, D
    Ladret, P
    2005 International Conference on Image Processing (ICIP), Vols 1-5, 2005, : 2989 - 2992
  • [48] Video Text Tracking With a Spatio-Temporal Complementary Model
    Gao, Yuzhe
    Li, Xing
    Zhang, Jiajian
    Zhou, Yu
    Jin, Dian
    Wang, Jing
    Zhu, Shenggao
    Bai, Xiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9321 - 9331
  • [49] Traffic flow prediction model based on spatio-temporal graph convolution with multi-information fusion
    Meng, Chuang
    Wang, Hui
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (08): : 1541 - 1550
  • [50] Spatio-temporal fusion quality evaluation based on "Point"-"Line"-"Plane" aspects
    Lei C.
    Meng X.
    Shao F.
    National Remote Sensing Bulletin, 2021, 25 (03) : 791 - 802