Video Frame Prediction via Deep Learning

被引：0

作者：

Yilmaz, M. Akin ^{[1
]}

Tekalp, A. Murat ^{[1
]}

机构：

[1] Koc Univ, Elekt Elekt Miihendisligi Bolumu, Istanbul, Turkey

来源：

2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU) | 2020年

关键词：

frame prediction; deep learning; recurrent network architectures; stateful training; convolutional network architectures;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper provides new results over our previous work presented in ICIP 2019 on the performance of learned frame prediction architectures and associated training methods. More specifically, we show that using an end-to-end residual connection in the fully convolutional neural network (FCNN) provides improved performance. In order to provide comparative results, we trained a residual FCNN, a convolutional RNN (CRNN), and a convolutional long-short term memory (CLSTM) network for next frame prediction using the mean square loss. We performed both stateless and stateful training for recurrent networks. Experimental results show that the residual FCNN architecture performs the best in terms of peak signal to noise ratio (PSNR) at the expense of higher training and test (inference) computational complexity. The CRNN can be stably and efficiently trained using the stateful truncated backpropagation through time procedure, and requires an order of magnitude less inference runtime to achieve an acceptable performance in near real-time.

引用

页数：4

共 50 条

[1] Deep Frame Prediction for Video Coding
Choi, Hyomin
Bajic, Ivan V.
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 1843 - 1855
[2] Deep Inter Prediction via Reference Frame Interpolation for Blurry Video Coding
Zhu, Zezhi
Zhao, Lili
Lin, Xuhu
Guo, Xuezhou
Chen, Jianwen
[J]. 2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
[3] Optimizing Video Prediction via Video Frame Interpolation
Wu, Yue
Wen, Qiang
Chen, Qifeng
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17793 - 17802
[4] A LIGHTWEIGHT MODEL FOR DEEP FRAME PREDICTION IN VIDEO CODING
Choi, Hyomin
Bajic, Ivan, V
[J]. 2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 1122 - 1126
[5] Filtration network: A frame sampling strategy via deep reinforcement learning for video captioning
Qian, Tiancheng
Mei, Xue
Xu, Pengxiang
Ge, Kangqi
Qiu, Zhelei
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (06) : 11085 - 11097
[6] Unsupervised Learning of Visual Representations via Rotation and Future Frame Prediction for Video Retrieval
Kumar, Vidit
Tripathi, Vikas
Pant, Bhaskar
[J]. ADVANCES IN COMPUTING AND DATA SCIENCES, PT I, 2021, 1440 : 701 - 710
[7] DEEP REINFORCEMENT LEARNING FOR VIDEO PREDICTION
Ho, Yung-Han
Cho, Chuan-Yuan
Peng, Wen-Hsiao
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 604 - 608
[8] Video Compression via Inter-frame Chroma Prediction
Huang, Rulin
Li, Shaohui
Dai, Wenrui
Luo, Jixiang
Li, Chenglin
Zou, Junni
Xiong, Hongkai
[J]. DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 457 - 457
[9] A Review on Deep Learning Techniques for Video Prediction
Oprea, Sergiu
Martinez-Gonzalez, Pablo
Garcia-Garcia, Alberto
Castro-Vargas, John Alejandro
Orts-Escolano, Sergio
Garcia-Rodriguez, Jose
Argyros, Antonis
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 2806 - 2826
[10] Bi-prediction Enhancement with Deep Frame Prediction Network for Versatile Video Coding
Tao, Hao
Qian, Jian
Yu, Li
Wang, Hongkui
[J]. 2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 374 - 374

← 1 2 3 4 5 →