A New Approach to Video Coding Leveraging Hybrid Coding and Video Frame Interpolation

被引:0
|
作者
Brascher, Andre Beims [1 ]
da Silveira, Gabriela Furtado [1 ]
Cancellier, Luiz Henrique [1 ]
Seidel, Ismael [1 ]
Grellert, Mateus [2 ]
Guntzel, Jose Luis [1 ]
机构
[1] Fed Univ Santa Catarina UFSC, Embedded Comp Lab ECL, Florianopolis, Brazil
[2] Fed Univ Rio Grande do Sul UFRGS, Inst Informat, Porto Alegre, Brazil
关键词
Convolutional Neural Network; Video Frame Interpolation; Frame Rate-Up Conversion; Video Coding;
D O I
10.1109/SBCCI60457.2023.10261663
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we propose the use of a video coding method, dubbed Decoupled Interpolated Video Coding (DIVC), which blends traditional hybrid video coding with novel approaches based on Neural Networks (NNs). The DIVC approach provides a base-level representation with a reduced bit rate by dropping frames in a regular manner which can be decoded by standard hybrid video coding. Meanwhile, we also regenerate the dropped frames from the reconstructed video from the base-level representation using NN-based Video Frame Interpolation (VFI) to recover the original number of frames per second (fps). We show that the DIVC approach can improve video coding efficiency considering the perceptual quality measured with the Structural Similarity (SSIM) metric. More specifically, such approach achieved a reduction of 38.96% (AI configuration) and 5% (RA configuration) in terms of SSIMdB-based Bjontegaard Delta Bit Rate (BD-Rate) when compared to traditional video coding. Meanwhile, considering the Peak Signal-to-Noise Ratio (PSNR), which may not correlate as well to human perception, the discussed approach resulted in PSNR BD-Rate reduction of 16.19% in AI configuration and increase of 19.33% in RA configuration. Employing VFI to improve video coding efficiency tends to be more suited to processing video sequences with high fps or slower motion.
引用
收藏
页码:161 / 166
页数:6
相关论文
共 50 条
  • [31] Frame design for multiple description video coding
    Wang, D
    Canagarajah, N
    Bull, D
    [J]. 2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 2719 - 2722
  • [32] Deep Reference Frame Interpolation based Inter Prediction Enhancement for Versatile Video Coding
    Jia, Jianghao
    Liu, Zizheng
    Xu, Xiaozhong
    Liu, Shan
    Chen, Zhenzhong
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [33] GPU BASED MOTION-COMPENSATED FRAME INTERPOLATION ACCELERATION FOR FUTURE VIDEO CODING
    Tang, Jianlun
    Huang, Yan
    Xie, Rong
    Luo, Zhengyi
    Song, Li
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 306 - 310
  • [34] Higher-Order Motion Models for Temporal Frame Interpolation with Applications to Video Coding
    Rufenacht, Dominic
    Mathew, Reji
    Taubman, David
    [J]. 2016 PICTURE CODING SYMPOSIUM (PCS), 2016,
  • [35] Distributed Video Coding with Frame Estimation at Decoder
    Chiam, Kin Honn
    Salleh, Mohd Fadzli Mohd
    [J]. ADVANCED COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY, 2015, 315 : 299 - 308
  • [36] Coding the displaced frame difference for video compression
    Ratakonda, K
    Yoon, SC
    Ahuja, N
    [J]. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL I, 1997, : 353 - 356
  • [37] Analysis of fractal inter frame video coding using parallel approach
    Milind V. Kulkarni
    D. B. Kulkarni
    [J]. Signal, Image and Video Processing, 2017, 11 : 629 - 634
  • [38] Analysis of fractal inter frame video coding using parallel approach
    Kulkarni, Milind V.
    Kulkarni, D. B.
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2017, 11 (04) : 629 - 634
  • [39] Intra Frame Prediction for Video Coding Using a Conditional Autoencoder Approach
    Brand, Fabian
    Seiler, Juergen
    Kaup, Andre
    [J]. 2019 PICTURE CODING SYMPOSIUM (PCS), 2019,
  • [40] VIDEO RETARGETING BASED FRAME-COMPATIBLE STEREO VIDEO CODING
    Chen, Siao-Wei
    Tsai, Ming-Feng
    Chiang, Jui-Chiu
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 1854 - 1858