Deep Learning in Latent Space for Video Prediction and Compression

被引:45
|
作者
Liu, Bowen [1 ]
Chen, Yu [1 ]
Liu, Shiyu [1 ]
Kim, Hun-Seok [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
EVENT DETECTION;
D O I
10.1109/CVPR46437.2021.00076
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning-based video compression has achieved substantial progress during recent years. The most influential approaches adopt deep neural networks (DNNs) to remove spatial and temporal redundancies by finding the appropriate lower-dimensional representations of frames in the video. We propose a novel DNN based framework that predicts and compresses video sequences in the latent vector space. The proposed method first learns the efficient lower-dimensional latent space representation of each video frame and then performs inter-frame prediction in that latent domain. The proposed latent domain compression of individual frames is obtained by a deep autoencoder trained with a generative adversarial network (GAN). To exploit the temporal correlation within the video frame sequence, we employ a convolutional long short-term memory (ConvLSTM) network to predict the latent vector representation of the future frame. We demonstrate our method with two applications; video compression and abnormal event detection that share the identical latent frame prediction network. The proposed method exhibits superior or competitive performance compared to the state-of-the-art algorithms specifically designed for either video compression or anomaly detection.(1)
引用
收藏
页码:701 / 710
页数:10
相关论文
共 50 条
  • [11] Biased Extrapolation in Latent Space for Imbalanced Deep Learning
    Jeong, Suhyeon
    Lee, Seungkyu
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 337 - 346
  • [12] Latent space data assimilation by using deep learning
    Peyron, Mathis
    Fillion, Anthony
    Gurol, Selime
    Marchais, Victor
    Gratton, Serge
    Boudier, Pierre
    Goret, Gael
    QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 2021, 147 (740) : 3759 - 3777
  • [13] Learning deep latent space for unsupervised violence detection
    Ehsan, Tahereh Zarrat
    Nahvi, Manoochehr
    Mohtavipour, Seyed Mehdi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (08) : 12493 - 12512
  • [14] Learning deep latent space for unsupervised violence detection
    Tahereh Zarrat Ehsan
    Manoochehr Nahvi
    Seyed Mehdi Mohtavipour
    Multimedia Tools and Applications, 2023, 82 : 12493 - 12512
  • [15] Deep-learning the Latent Space of Light Transport
    Hermosilla, P.
    Maisch, S.
    Ritschel, T.
    Ropinski, T.
    COMPUTER GRAPHICS FORUM, 2019, 38 (04) : 207 - 217
  • [16] Video Frame Prediction via Deep Learning
    Yilmaz, M. Akin
    Tekalp, A. Murat
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [17] A Review on Deep Learning Techniques for Video Prediction
    Oprea, Sergiu
    Martinez-Gonzalez, Pablo
    Garcia-Garcia, Alberto
    Castro-Vargas, John Alejandro
    Orts-Escolano, Sergio
    Garcia-Rodriguez, Jose
    Argyros, Antonis
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 2806 - 2826
  • [18] Relational Deep Learning: A Deep Latent Variable Model for Link Prediction
    Wang, Hao
    Shi, Xingjian
    Yeung, Dit-Yan
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2688 - 2694
  • [19] Deep learning based CETSA feature prediction cross multiple cell lines with latent space representation
    Zhao, Shenghao
    Yang, Xulei
    Zeng, Zeng
    Qian, Peisheng
    Zhao, Ziyuan
    Dai, Lingyun
    Prabhu, Nayana
    Nordlund, Paer
    Tam, Wai Leong
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [20] Towards image denoising in the latent space of learning-based compression
    Testolina, Michela
    Upenik, Evgeniy
    Ebrahimi, Touradj
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842