Video Frame Prediction via Deep Learning

被引:0
|
作者
Yilmaz, M. Akin [1 ]
Tekalp, A. Murat [1 ]
机构
[1] Koc Univ, Elekt Elekt Miihendisligi Bolumu, Istanbul, Turkey
关键词
frame prediction; deep learning; recurrent network architectures; stateful training; convolutional network architectures;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper provides new results over our previous work presented in ICIP 2019 on the performance of learned frame prediction architectures and associated training methods. More specifically, we show that using an end-to-end residual connection in the fully convolutional neural network (FCNN) provides improved performance. In order to provide comparative results, we trained a residual FCNN, a convolutional RNN (CRNN), and a convolutional long-short term memory (CLSTM) network for next frame prediction using the mean square loss. We performed both stateless and stateful training for recurrent networks. Experimental results show that the residual FCNN architecture performs the best in terms of peak signal to noise ratio (PSNR) at the expense of higher training and test (inference) computational complexity. The CRNN can be stably and efficiently trained using the stateful truncated backpropagation through time procedure, and requires an order of magnitude less inference runtime to achieve an acceptable performance in near real-time.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Deep Learning based Prediction Model for Adaptive Video Streaming
    Lekharu, Anirban
    Moulii, K. Y.
    Sur, Arijit
    Sarkar, Arnab
    [J]. 2020 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2020,
  • [22] TRANSFER LEARNING WITH DEEP NETWORKS FOR SALIENCY PREDICTION IN NATURAL VIDEO
    Chaabouni, Souad
    Benois-Pineau, Jenny
    Ben Amari, Chokri
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1604 - 1608
  • [23] DEEP LEARNING FOR MULTIMODAL-BASED VIDEO INTERESTINGNESS PREDICTION
    Shen, Yuesong
    Demarty, Claire-Helene
    Duong, Ngoc Q. K.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1003 - 1008
  • [24] DeepVS: A Deep Learning Based Video Saliency Prediction Approach
    Jiang, Lai
    Xu, Mai
    Liu, Tie
    Qiao, Minglang
    Wang, Zulin
    [J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 625 - 642
  • [25] Application of Deep Learning Techniques to Video QoE Prediction in Smartphones
    Cardenas-Angelat, Carlos
    Banos Polglase, Janie
    Vaca-Rubio, Cristian J.
    Carmen Aguayo-Torres, Mari
    [J]. 2019 EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS (EUCNC), 2019, : 252 - 256
  • [26] Deep frame interpolation for video compression
    Begaint, Jean
    Galpin, Franck
    Guillotel, Philippe
    Guillemot, Christine
    [J]. 2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 556 - 556
  • [27] Deep Bayesian Video Frame Interpolation
    Yu, Zhiyang
    Zhang, Yu
    Xiang, Xujie
    Zou, Dongqing
    Chen, Xijun
    Ren, Jimmy S.
    [J]. COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 144 - 160
  • [28] Deep Video Prediction Network-ased Inter-Frame Coding in HEVC
    Lee, Jung-Kyung
    Kim, Nayoung
    Cho, Seunghyun
    Kang, Je-Won
    [J]. IEEE ACCESS, 2020, 8 : 95906 - 95917
  • [29] Deep Reference Frame Interpolation based Inter Prediction Enhancement for Versatile Video Coding
    Jia, Jianghao
    Liu, Zizheng
    Xu, Xiaozhong
    Liu, Shan
    Chen, Zhenzhong
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [30] Deep Video Frame Interpolation Using Cyclic Frame Generation
    Liu, Yu-Lun
    Liao, Yi-Tung
    Lin, Yen-Yu
    Chuang, Yung-Yu
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8794 - 8802