Video Frame Prediction via Deep Learning

被引：0

作者：

Yilmaz, M. Akin ^{[1
]}

Tekalp, A. Murat ^{[1
]}

机构：

[1] Koc Univ, Elekt Elekt Miihendisligi Bolumu, Istanbul, Turkey

来源：

2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU) | 2020年

关键词：

frame prediction; deep learning; recurrent network architectures; stateful training; convolutional network architectures;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper provides new results over our previous work presented in ICIP 2019 on the performance of learned frame prediction architectures and associated training methods. More specifically, we show that using an end-to-end residual connection in the fully convolutional neural network (FCNN) provides improved performance. In order to provide comparative results, we trained a residual FCNN, a convolutional RNN (CRNN), and a convolutional long-short term memory (CLSTM) network for next frame prediction using the mean square loss. We performed both stateless and stateful training for recurrent networks. Experimental results show that the residual FCNN architecture performs the best in terms of peak signal to noise ratio (PSNR) at the expense of higher training and test (inference) computational complexity. The CRNN can be stably and efficiently trained using the stateful truncated backpropagation through time procedure, and requires an order of magnitude less inference runtime to achieve an acceptable performance in near real-time.

引用

页数：4

共 50 条

[21] Deep Learning based Prediction Model for Adaptive Video Streaming
Lekharu, Anirban
Moulii, K. Y.
Sur, Arijit
Sarkar, Arnab
[J]. 2020 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2020,
[22] TRANSFER LEARNING WITH DEEP NETWORKS FOR SALIENCY PREDICTION IN NATURAL VIDEO
Chaabouni, Souad
Benois-Pineau, Jenny
Ben Amari, Chokri
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1604 - 1608
[23] DEEP LEARNING FOR MULTIMODAL-BASED VIDEO INTERESTINGNESS PREDICTION
Shen, Yuesong
Demarty, Claire-Helene
Duong, Ngoc Q. K.
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1003 - 1008
[24] DeepVS: A Deep Learning Based Video Saliency Prediction Approach
Jiang, Lai
Xu, Mai
Liu, Tie
Qiao, Minglang
Wang, Zulin
[J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 625 - 642
[25] Application of Deep Learning Techniques to Video QoE Prediction in Smartphones
Cardenas-Angelat, Carlos
Banos Polglase, Janie
Vaca-Rubio, Cristian J.
Carmen Aguayo-Torres, Mari
[J]. 2019 EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS (EUCNC), 2019, : 252 - 256
[26] Deep frame interpolation for video compression
Begaint, Jean
Galpin, Franck
Guillotel, Philippe
Guillemot, Christine
[J]. 2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 556 - 556
[27] Deep Bayesian Video Frame Interpolation
Yu, Zhiyang
Zhang, Yu
Xiang, Xujie
Zou, Dongqing
Chen, Xijun
Ren, Jimmy S.
[J]. COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 144 - 160
[28] Deep Video Prediction Network-ased Inter-Frame Coding in HEVC
Lee, Jung-Kyung
Kim, Nayoung
Cho, Seunghyun
Kang, Je-Won
[J]. IEEE ACCESS, 2020, 8 : 95906 - 95917
[29] Deep Reference Frame Interpolation based Inter Prediction Enhancement for Versatile Video Coding
Jia, Jianghao
Liu, Zizheng
Xu, Xiaozhong
Liu, Shan
Chen, Zhenzhong
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
[30] Deep Video Frame Interpolation Using Cyclic Frame Generation
Liu, Yu-Lun
Liao, Yi-Tung
Lin, Yen-Yu
Chuang, Yung-Yu
[J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8794 - 8802

← 1 2 3 4 5 →