Learning for Video Compression With Recurrent Auto-Encoder and Recurrent Probability Model

被引:81
|
作者
Yang, Ren [1 ]
Mentzer, Fabian [1 ]
Van Gool, Luc [1 ]
Timofte, Radu [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Informat Technol & Elect Engn, CH-8092 Zurich, Switzerland
关键词
Video compression; Image coding; Decoding; Entropy; Correlation; Motion compensation; Estimation; Deep learning; recurrent neural network; video compression; AUTOENCODER;
D O I
10.1109/JSTSP.2020.3043590
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The past few years have witnessed increasing interests in applying deep learning to video compression. However, the existing approaches compress a video frame with only a few number of reference frames, which limits their ability to fully exploit the temporal correlation among video frames. To overcome this shortcoming, this paper proposes a Recurrent Learned Video Compression (RLVC) approach with the Recurrent Auto-Encoder (RAE) and Recurrent Probability Model (RPM). Specifically, the RAE employs recurrent cells in both the encoder and decoder. As such, the temporal information in a large range of frames can be used for generating latent representations and reconstructing compressed outputs. Furthermore, the proposed RPM network recurrently estimates the Probability Mass Function (PMF) of the latent representation, conditioned on the distribution of previous latent representations. Due to the correlation among consecutive frames, the conditional cross entropy can be lower than the independent cross entropy, thus reducing the bit-rate. The experiments show that our approach achieves the state-of-the-art learned video compression performance in terms of both PSNR and MS-SSIM. Moreover, our approach outperforms the default Low-Delay P (LDP) setting of x265 on PSNR, and also has better performance on MS-SSIM than the SSIM-tuned x265 and the slowest setting of x265. The codes are available at https://github.com/RenYang-home/RLVC.git.
引用
收藏
页码:388 / 401
页数:14
相关论文
共 50 条
  • [41] A Fault Detection Method based on Convolutional Gated Recurrent Unit Auto-encoder for Tennessee Eastman Process
    Yu, Jianbo
    Liu, Xing
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1234 - 1238
  • [42] Deep Auto-Encoder Neural Networks in Reinforcement Learning
    Lange, Sascha
    Riedmiller, Martin
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [43] An Auto-Encoder for Learning Conversation Representation Using LSTM
    Zhou, Xiaoqiang
    Hu, Baotian
    Chen, Qingcai
    Wang, Xiaolong
    NEURAL INFORMATION PROCESSING, PT I, 2015, 9489 : 310 - 317
  • [44] An Efficient Encryption and Compression of Sensed IoTMedical Images Using Auto-Encoder
    El-kafrawy, Passent
    Aboghazalah, Maie
    Ahmed, Abdelmoty M.
    Torkey, Hanaa
    El-Sayed, Ayman
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 134 (02): : 909 - 926
  • [45] IMAGE COMPRESSION WITH STOCHASTIC WINNER-TAKE-ALL AUTO-ENCODER
    Dumas, Thierry
    Roumy, Aline
    Guillemot, Christine
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1512 - 1516
  • [46] A deep auto-encoder model for gene expression prediction
    Rui Xie
    Jia Wen
    Andrew Quitadamo
    Jianlin Cheng
    Xinghua Shi
    BMC Genomics, 18
  • [47] A deep auto-encoder model for gene expression prediction
    Xie, Rui
    Wen, Jia
    Quitadamo, Andrew
    Cheng, Jianlin
    Shi, Xinghua
    BMC GENOMICS, 2017, 18
  • [48] A Variational Auto-Encoder Model for Underwater Acoustic Channels
    Wei, Li
    Wang, Zhaohui
    WUWNET'21: THE 15TH ACM INTERNATIONAL CONFERENCE ON UNDERWATER NETWORKS & SYSTEMS, 2021,
  • [49] Compressed Sensing Verses Auto-Encoder: On the Perspective of Signal Compression and Restoration
    Jeong, Jin-Young
    Ozger, Mustafa
    Lee, Woong-Hee
    IEEE ACCESS, 2024, 12 : 41967 - 41979
  • [50] A Variational Auto-Encoder Model for Stochastic Point Processes
    Mehrasa, Nazanin
    Jyothi, Akash Abdu
    Durand, Thibaut
    He, Jiawei
    Sigal, Leonid
    Mori, Greg
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3160 - 3169