Learning-based End-to-End Video Compression Using Predictive Coding

被引:1
|
作者
de Oliveira, Matheus C. [1 ]
Martins, Luiz G. R. [1 ]
Jung, Henrique Costa [1 ]
Guerin Jr, Nilson Donizete [1 ]
da Silva, Renam Castro [2 ]
Peixoto, Eduardo [1 ]
Macchiavello, Bruno [1 ]
Hung, Edson M. [1 ]
Testoni, Vanessa [2 ]
Freitas, Pedro Garcia [2 ]
机构
[1] Univ Brasilia, Brasilia, DF, Brazil
[2] Samsung R&D Brazil, Campinas, SP, Brazil
关键词
D O I
10.1109/SIBGRAPI54419.2021.00030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Driven by the growing demand for video applications, deep learning techniques have become alternatives for implementing end-to-end encoders to achieve applicable compression rates. Conventional video codecs exploit both spatial and temporal correlation. However, due to some restrictions (e.g. computational complexity), they are commonly limited to linear transformations and translational motion estimation. Autoencoder models open up the way for exploiting predictive end-to-end video codecs without such limitations. This paper presents an entire learning-based video codec that exploits spatial and temporal correlations. The presented codec extends the idea of P-frame prediction presented in our previous work. The architecture adopted for I-frame coding is defined by a variational autoencoder with non-parametric entropy modeling. Besides an entropy model parameterized by a hyperprior, the inter-frame encoder architecture has two other independent networks, responsible for motion estimation and residue prediction. Experimental results indicate that some improvements still have to be incorporated into our codec to overcome the all-intra coding set up regarding the traditional algorithms High Efficiency Video Coding (HEVC) and Versatile Video Coding (VVC).
引用
收藏
页码:160 / 167
页数:8
相关论文
共 50 条
  • [31] Transform Network Architectures for Deep Learning Based End-to-End Image/Video Coding in Subsampled Color Spaces
    Egilmez, Hilmi E.
    Singh, Ankitesh K.
    Coban, Muhammed
    Karczewicz, Marta
    Zhu, Yinhao
    Yang, Yang
    Said, Amir
    Cohen, Taco S.
    [J]. IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2021, 2 : 441 - 452
  • [32] End-to-End Depth-Guided Relighting Using Lightweight Deep Learning-Based Method
    Nathan, Sabari
    Kansal, Priya
    [J]. JOURNAL OF IMAGING, 2023, 9 (09)
  • [33] End-to-End Deep Learning-Based Human Activity Recognition Using Channel State Information
    Hsieh, Chaur-Heh
    Chen, Jen-Yang
    Kuo, Chung-Ming
    Wang, Ping
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2021, 22 (02): : 271 - 281
  • [34] Deep Learning-Based End-to-End Language Development Screening for Children Using Linguistic Knowledge
    Oh, Byoung-Doo
    Lee, Yoon-Kyoung
    Kim, Jong-Dae
    Park, Chan-Young
    Kim, Yu-Seop
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [35] Joint source-channel video coding based on the optimization of end-to-end distortions
    Lie, Wen-Nung
    Gao, Zhi-Wei
    Liu, Tung-Lin
    Jui, Ping-Chang
    [J]. ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PROCEEDINGS, 2006, 4319 : 842 - +
  • [36] Learning End-to-End Lossy Image Compression: A Benchmark
    Hu, Yueyu
    Yang, Wenhan
    Ma, Zhan
    Liu, Jiaying
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4194 - 4211
  • [37] Video Multi-Scale-Based End-to-End Rate Control in Deep Contextual Video Compression
    Wei, Lili
    Yang, Zhenglong
    Zhang, Hua
    Liu, Xinyu
    Deng, Weihao
    Zhang, Youchao
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (13):
  • [38] End-to-End Video Captioning with Multitask Reinforcement Learning
    Li, Lijun
    Gong, Boqing
    [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 339 - 348
  • [39] End-to-End Learning of Motion Representation for Video Understanding
    Fan, Lijie
    Huang, Wenbing
    Gan, Chuang
    Ermon, Stefano
    Gong, Boqing
    Huang, Junzhou
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6016 - 6025
  • [40] End-to-End Learning-Based Framework for Amplify-and-Forward Relay Networks
    Gupta, Ankit
    Sellathurai, Mathini
    [J]. Gupta, Ankit (ag104@hw.ac.uk), 1600, Institute of Electrical and Electronics Engineers Inc. (09): : 81660 - 81677