A New Approach to Video Coding Leveraging Hybrid Coding and Video Frame Interpolation

被引：0

作者：

Brascher, Andre Beims ^{[1
]}

da Silveira, Gabriela Furtado ^{[1
]}

Cancellier, Luiz Henrique ^{[1
]}

Seidel, Ismael ^{[1
]}

Grellert, Mateus ^{[2
]}

Guntzel, Jose Luis ^{[1
]}

机构：

[1] Fed Univ Santa Catarina UFSC, Embedded Comp Lab ECL, Florianopolis, Brazil

[2] Fed Univ Rio Grande do Sul UFRGS, Inst Informat, Porto Alegre, Brazil

来源：

2023 36TH SBC/SBMICRO/IEEE/ACM SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN, SBCCI | 2023年

关键词：

Convolutional Neural Network; Video Frame Interpolation; Frame Rate-Up Conversion; Video Coding;

D O I：

10.1109/SBCCI60457.2023.10261663

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this work, we propose the use of a video coding method, dubbed Decoupled Interpolated Video Coding (DIVC), which blends traditional hybrid video coding with novel approaches based on Neural Networks (NNs). The DIVC approach provides a base-level representation with a reduced bit rate by dropping frames in a regular manner which can be decoded by standard hybrid video coding. Meanwhile, we also regenerate the dropped frames from the reconstructed video from the base-level representation using NN-based Video Frame Interpolation (VFI) to recover the original number of frames per second (fps). We show that the DIVC approach can improve video coding efficiency considering the perceptual quality measured with the Structural Similarity (SSIM) metric. More specifically, such approach achieved a reduction of 38.96% (AI configuration) and 5% (RA configuration) in terms of SSIMdB-based Bjontegaard Delta Bit Rate (BD-Rate) when compared to traditional video coding. Meanwhile, considering the Peak Signal-to-Noise Ratio (PSNR), which may not correlate as well to human perception, the discussed approach resulted in PSNR BD-Rate reduction of 16.19% in AI configuration and increase of 19.33% in RA configuration. Employing VFI to improve video coding efficiency tends to be more suited to processing video sequences with high fps or slower motion.

引用

页码：161 / 166

页数：6

共 50 条

[31] Frame design for multiple description video coding
Wang, D
Canagarajah, N
Bull, D
[J]. 2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 2719 - 2722
[32] Deep Reference Frame Interpolation based Inter Prediction Enhancement for Versatile Video Coding
Jia, Jianghao
Liu, Zizheng
Xu, Xiaozhong
Liu, Shan
Chen, Zhenzhong
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
[33] GPU BASED MOTION-COMPENSATED FRAME INTERPOLATION ACCELERATION FOR FUTURE VIDEO CODING
Tang, Jianlun
Huang, Yan
Xie, Rong
Luo, Zhengyi
Song, Li
[J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 306 - 310
[34] Higher-Order Motion Models for Temporal Frame Interpolation with Applications to Video Coding
Rufenacht, Dominic
Mathew, Reji
Taubman, David
[J]. 2016 PICTURE CODING SYMPOSIUM (PCS), 2016,
[35] Distributed Video Coding with Frame Estimation at Decoder
Chiam, Kin Honn
Salleh, Mohd Fadzli Mohd
[J]. ADVANCED COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY, 2015, 315 : 299 - 308
[36] Coding the displaced frame difference for video compression
Ratakonda, K
Yoon, SC
Ahuja, N
[J]. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL I, 1997, : 353 - 356
[37] Analysis of fractal inter frame video coding using parallel approach
Milind V. Kulkarni
D. B. Kulkarni
[J]. Signal, Image and Video Processing, 2017, 11 : 629 - 634
[38] Analysis of fractal inter frame video coding using parallel approach
Kulkarni, Milind V.
Kulkarni, D. B.
[J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2017, 11 (04) : 629 - 634
[39] Intra Frame Prediction for Video Coding Using a Conditional Autoencoder Approach
Brand, Fabian
Seiler, Juergen
Kaup, Andre
[J]. 2019 PICTURE CODING SYMPOSIUM (PCS), 2019,
[40] VIDEO RETARGETING BASED FRAME-COMPATIBLE STEREO VIDEO CODING
Chen, Siao-Wei
Tsai, Ming-Feng
Chiang, Jui-Chiu
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 1854 - 1858

← 1 2 3 4 5 →