Learning-based End-to-End Video Compression Using Predictive Coding

被引：1

作者：

de Oliveira, Matheus C. ^{[1
]}

Martins, Luiz G. R. ^{[1
]}

Jung, Henrique Costa ^{[1
]}

Guerin Jr, Nilson Donizete ^{[1
]}

da Silva, Renam Castro ^{[2
]}

Peixoto, Eduardo ^{[1
]}

Macchiavello, Bruno ^{[1
]}

Hung, Edson M. ^{[1
]}

Testoni, Vanessa ^{[2
]}

Freitas, Pedro Garcia ^{[2
]}

机构：

[1] Univ Brasilia, Brasilia, DF, Brazil

[2] Samsung R&D Brazil, Campinas, SP, Brazil

来源：

2021 34TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2021) | 2021年

关键词：

D O I：

10.1109/SIBGRAPI54419.2021.00030

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Driven by the growing demand for video applications, deep learning techniques have become alternatives for implementing end-to-end encoders to achieve applicable compression rates. Conventional video codecs exploit both spatial and temporal correlation. However, due to some restrictions (e.g. computational complexity), they are commonly limited to linear transformations and translational motion estimation. Autoencoder models open up the way for exploiting predictive end-to-end video codecs without such limitations. This paper presents an entire learning-based video codec that exploits spatial and temporal correlations. The presented codec extends the idea of P-frame prediction presented in our previous work. The architecture adopted for I-frame coding is defined by a variational autoencoder with non-parametric entropy modeling. Besides an entropy model parameterized by a hyperprior, the inter-frame encoder architecture has two other independent networks, responsible for motion estimation and residue prediction. Experimental results indicate that some improvements still have to be incorporated into our codec to overcome the all-intra coding set up regarding the traditional algorithms High Efficiency Video Coding (HEVC) and Versatile Video Coding (VVC).

引用

页码：160 / 167

页数：8

共 50 条

[1] LEARNING-BASED END-TO-END VIDEO COMPRESSION WITH SPATIAL-TEMPORAL ADAPTATION
Zhang, Zhaobin
Li, Yue
Zhang, Kai
Zhang, Li
He, Yuwen
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2821 - 2825
[2] End-to-End Learning-Based Image Compression: A Review
Chen Jimin
Lin Zehao
[J]. LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
[3] End-to-End Learning-Based Image Compression With a Decoupled Framework
Zhang, Zhaobin
Esenlik, Semih
Wu, Yaojun
Wang, Meng
Zhang, Kai
Zhang, Li
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3067 - 3081
[4] An End-to-End Learning Framework for Video Compression
Lu, Guo
Zhang, Xiaoyun
Ouyang, Wanli
Chen, Li
Gao, Zhiyong
Xu, Dong
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3292 - 3308
[5] EICNet: An End-to-End Efficient Learning-Based Image Compression Network
Cheng, Ziyi
[J]. IEEE Access, 2024, 12 : 142668 - 142676
[6] End-to-End Learning of Video Compression Using Spatio-Temporal Autoencoders
Pessoa, Jorge
Aidos, Helena
Tomas, Pedro
Figueiredo, Mario A. T.
[J]. 2020 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2020, : 276 - 281
[7] An End-to-End Learning-based Cost Estimator
Sun, Ji
Li, Guoliang
[J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 13 (03): : 307 - 319
[8] End-to-end Distributed Video Coding
Zhou, Junwei
Lv, Ting
Yi, XiangBo
[J]. DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 496 - 496
[9] Ensemble Learning-Based Rate-Distortion Optimization for End-to-End Image Compression
Wang, Yefei
Liu, Dong
Ma, Siwei
Wu, Feng
Gao, Wen
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 1193 - 1207
[10] New Results in End-to-end Image and Video Compression by Deep Learning
Ozsoy, Gokberk
Yilmaz, Melih
Kirmemis, Ogun
Tekalp, A. Murat
[J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,

← 1 2 3 4 5 →