Multi-Codec Video Quality Enhancement Model Based on Spatio-Temporal Deformable Fusion

被引:0
|
作者
Kreisler, Gilberto [1 ]
da Silveira Junior, Garibaldi [1 ]
Zatt, Bruno [1 ]
Palomino, Daniel [1 ]
Correa, Guilherme [1 ]
机构
[1] Fed Univ Pelotas UFPel, Grad Program Comp PPGC, Video Technol Res Grp ViTech, Pelotas, RS, Brazil
关键词
video quality enhancement (VQE); video coding; deep learning;
D O I
10.1109/LASCAS60203.2024.10506192
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The popularization of mobile phones and other multimedia portable devices paved the way for the increase in video consumption worldwide. However, it is impossible to transmit a non-compressed video due to the high bandwidth required. To achieve significant compression rates, video codecs usually employ methods that damage the visual quality perceived by the end user in non-negligible levels. Different architectures based on deep learning have been recently proposed for Video Quality Enhancement (VQE). Still, most of them are trained and validated using videos generated by a single codec under fixed configurations. With the increase of video coding formats and standards on the market, VQE methods that apply to different contexts are desired. This paper proposes a new VQE model based on the Spatio-Temporal Deformable Fusion (STDF) architecture, providing quality gains for videos compressed according to different formats and standards, such as HEVC, VVC, VP9, and AV1. The results demonstrate that by considering different video coding standards and formats to build the STDF model, a significant increase in VQE is achieved, with an average PSNR increment of up to 0.382 dB.
引用
收藏
页码:163 / 167
页数:5
相关论文
共 50 条
  • [31] Real-World Video for Zoom Enhancement based on Spatio-Temporal Coupling
    Guo, Zhiling
    Zheng, Yinqiang
    Zhang, Haoran
    Shi, Xiaodan
    Cai, Zekun
    Shibasaki, Ryosuke
    Yan, Jinyue
    arXiv, 2023,
  • [32] On the Importance of Spatio-Temporal Learning for Video Quality Assessment
    Fontanel, Dario
    Higham, David
    Vallade, Benoit Quentin Arthur
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 481 - 487
  • [33] Study of Spatio-Temporal Modeling in Video Quality Assessment
    Fang, Yuming
    Li, Zhaoqian
    Yan, Jiebin
    Sui, Xiangjie
    Liu, Hantao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2693 - 2702
  • [34] Modelling of spatio-temporal interaction for video quality assessment
    Huynh-Thu, Quan
    Ghanbari, Mohammed
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2010, 25 (07) : 535 - 546
  • [35] SPATIO-TEMPORAL SSIM INDEX FOR VIDEO QUALITY ASSESSMENT
    Wang, Yue
    Jiang, Tingting
    Ma, Siwei
    Gao, Wen
    2012 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2012,
  • [36] Video anomaly detection based on multi-scale optical flow spatio-temporal enhancement and normality mining
    He, Qiang
    Shi, Ruinian
    Chen, Linlin
    Huo, Lianzhi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, : 1873 - 1888
  • [37] Spatio-temporal transform based video hashing
    Coskun, Baris
    Sankur, Bulent
    Memon, Nasir
    IEEE TRANSACTIONS ON MULTIMEDIA, 2006, 8 (06) : 1190 - 1208
  • [38] Spatio-Temporal Fusion Network for Video Super-Resolution
    Li, Huabin
    Zhang, Pingjian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [39] Fusion of InSAR and GNSS Based on Adaptive Spatio-Temporal Kalman Model for Reconstructing High Spatio-Temporal Resolution Deformation
    Li, Peiling
    Li, Zhiwei
    Mao, Wenxiang
    Shi, Qiang
    Lin, Qiwei
    IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 17 : 19616 - 19626
  • [40] Associative Memory With Spatio-Temporal Enhancement for Video Anomaly Detection
    Zhong, Yuanhong
    Hu, Yongting
    Tang, Panliang
    Wang, Heng
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1212 - 1216