Multi-Codec Video Quality Enhancement Model Based on Spatio-Temporal Deformable Fusion

被引：0

作者：

Kreisler, Gilberto ^{[1
]}

da Silveira Junior, Garibaldi ^{[1
]}

Zatt, Bruno ^{[1
]}

Palomino, Daniel ^{[1
]}

Correa, Guilherme ^{[1
]}

机构：

[1] Fed Univ Pelotas UFPel, Grad Program Comp PPGC, Video Technol Res Grp ViTech, Pelotas, RS, Brazil

来源：

15TH IEEE LATIN AMERICAN SYMPOSIUM ON CIRCUITS AND SYSTEMS, LASCAS 2024 | 2024年

关键词：

video quality enhancement (VQE); video coding; deep learning;

D O I：

10.1109/LASCAS60203.2024.10506192

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The popularization of mobile phones and other multimedia portable devices paved the way for the increase in video consumption worldwide. However, it is impossible to transmit a non-compressed video due to the high bandwidth required. To achieve significant compression rates, video codecs usually employ methods that damage the visual quality perceived by the end user in non-negligible levels. Different architectures based on deep learning have been recently proposed for Video Quality Enhancement (VQE). Still, most of them are trained and validated using videos generated by a single codec under fixed configurations. With the increase of video coding formats and standards on the market, VQE methods that apply to different contexts are desired. This paper proposes a new VQE model based on the Spatio-Temporal Deformable Fusion (STDF) architecture, providing quality gains for videos compressed according to different formats and standards, such as HEVC, VVC, VP9, and AV1. The results demonstrate that by considering different video coding standards and formats to build the STDF model, a significant increase in VQE is achieved, with an average PSNR increment of up to 0.382 dB.

引用

页码：163 / 167

页数：5

共 50 条

[31] Real-World Video for Zoom Enhancement based on Spatio-Temporal Coupling
Guo, Zhiling
Zheng, Yinqiang
Zhang, Haoran
Shi, Xiaodan
Cai, Zekun
Shibasaki, Ryosuke
Yan, Jinyue
arXiv, 2023,
[32] On the Importance of Spatio-Temporal Learning for Video Quality Assessment
Fontanel, Dario
Higham, David
Vallade, Benoit Quentin Arthur
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 481 - 487
[33] Study of Spatio-Temporal Modeling in Video Quality Assessment
Fang, Yuming
Li, Zhaoqian
Yan, Jiebin
Sui, Xiangjie
Liu, Hantao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2693 - 2702
[34] Modelling of spatio-temporal interaction for video quality assessment
Huynh-Thu, Quan
Ghanbari, Mohammed
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2010, 25 (07) : 535 - 546
[35] SPATIO-TEMPORAL SSIM INDEX FOR VIDEO QUALITY ASSESSMENT
Wang, Yue
Jiang, Tingting
Ma, Siwei
Gao, Wen
2012 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2012,
[36] Video anomaly detection based on multi-scale optical flow spatio-temporal enhancement and normality mining
He, Qiang
Shi, Ruinian
Chen, Linlin
Huo, Lianzhi
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, : 1873 - 1888
[37] Spatio-temporal transform based video hashing
Coskun, Baris
Sankur, Bulent
Memon, Nasir
IEEE TRANSACTIONS ON MULTIMEDIA, 2006, 8 (06) : 1190 - 1208
[38] Spatio-Temporal Fusion Network for Video Super-Resolution
Li, Huabin
Zhang, Pingjian
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[39] Fusion of InSAR and GNSS Based on Adaptive Spatio-Temporal Kalman Model for Reconstructing High Spatio-Temporal Resolution Deformation
Li, Peiling
Li, Zhiwei
Mao, Wenxiang
Shi, Qiang
Lin, Qiwei
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 17 : 19616 - 19626
[40] Associative Memory With Spatio-Temporal Enhancement for Video Anomaly Detection
Zhong, Yuanhong
Hu, Yongting
Tang, Panliang
Wang, Heng
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1212 - 1216

← 1 2 3 4 5 →