Video Frame Prediction via Deep Learning

被引:0
|
作者
Yilmaz, M. Akin [1 ]
Tekalp, A. Murat [1 ]
机构
[1] Koc Univ, Elekt Elekt Miihendisligi Bolumu, Istanbul, Turkey
关键词
frame prediction; deep learning; recurrent network architectures; stateful training; convolutional network architectures;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper provides new results over our previous work presented in ICIP 2019 on the performance of learned frame prediction architectures and associated training methods. More specifically, we show that using an end-to-end residual connection in the fully convolutional neural network (FCNN) provides improved performance. In order to provide comparative results, we trained a residual FCNN, a convolutional RNN (CRNN), and a convolutional long-short term memory (CLSTM) network for next frame prediction using the mean square loss. We performed both stateless and stateful training for recurrent networks. Experimental results show that the residual FCNN architecture performs the best in terms of peak signal to noise ratio (PSNR) at the expense of higher training and test (inference) computational complexity. The CRNN can be stably and efficiently trained using the stateful truncated backpropagation through time procedure, and requires an order of magnitude less inference runtime to achieve an acceptable performance in near real-time.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Deep Frame Prediction for Video Coding
    Choi, Hyomin
    Bajic, Ivan V.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 1843 - 1855
  • [2] Deep Inter Prediction via Reference Frame Interpolation for Blurry Video Coding
    Zhu, Zezhi
    Zhao, Lili
    Lin, Xuhu
    Guo, Xuezhou
    Chen, Jianwen
    [J]. 2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [3] Optimizing Video Prediction via Video Frame Interpolation
    Wu, Yue
    Wen, Qiang
    Chen, Qifeng
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17793 - 17802
  • [4] A LIGHTWEIGHT MODEL FOR DEEP FRAME PREDICTION IN VIDEO CODING
    Choi, Hyomin
    Bajic, Ivan, V
    [J]. 2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 1122 - 1126
  • [5] Filtration network: A frame sampling strategy via deep reinforcement learning for video captioning
    Qian, Tiancheng
    Mei, Xue
    Xu, Pengxiang
    Ge, Kangqi
    Qiu, Zhelei
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (06) : 11085 - 11097
  • [6] Unsupervised Learning of Visual Representations via Rotation and Future Frame Prediction for Video Retrieval
    Kumar, Vidit
    Tripathi, Vikas
    Pant, Bhaskar
    [J]. ADVANCES IN COMPUTING AND DATA SCIENCES, PT I, 2021, 1440 : 701 - 710
  • [7] DEEP REINFORCEMENT LEARNING FOR VIDEO PREDICTION
    Ho, Yung-Han
    Cho, Chuan-Yuan
    Peng, Wen-Hsiao
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 604 - 608
  • [8] Video Compression via Inter-frame Chroma Prediction
    Huang, Rulin
    Li, Shaohui
    Dai, Wenrui
    Luo, Jixiang
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    [J]. DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 457 - 457
  • [9] A Review on Deep Learning Techniques for Video Prediction
    Oprea, Sergiu
    Martinez-Gonzalez, Pablo
    Garcia-Garcia, Alberto
    Castro-Vargas, John Alejandro
    Orts-Escolano, Sergio
    Garcia-Rodriguez, Jose
    Argyros, Antonis
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 2806 - 2826
  • [10] Bi-prediction Enhancement with Deep Frame Prediction Network for Versatile Video Coding
    Tao, Hao
    Qian, Jian
    Yu, Li
    Wang, Hongkui
    [J]. 2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 374 - 374