Novel CNN approach for video prediction based on FitVid

被引:0
|
作者
Watanabe, Taiju [1 ]
Takahiro, Shindo [1 ]
Watanabe, Hiroshi [1 ]
机构
[1] Waseda Univ, Sch Fundamental Sci & Engn, Shillman Hall,3-14-9 Okubo,Shinjuku, Tokyo 1690072, Japan
关键词
Video prediction; SimVP; FitVid; RNN; ViT;
D O I
10.1117/12.2665224
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video prediction is a task in computer vision that predicts future frames from the past few frames of video. In video prediction, a simple CNN-based approach called SimVP has marked remarkable performance without using RNN or vison transformer (ViT). In this paper, we propose a model structure to improve performance of video prediction based on FitVid. FitVid is a regression-based method of predicting future videos using not only video but also motion. We focus on video prediction only conditioned on videos. To this goal, we introduce network modules used in SimVP to FitVid. Experimental results show that the proposed structure shows better prediction accuracy compared to SimVP.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] FitVid: Responsive and Flexible Video Content Adaptation
    Kim, Jeongyeon
    Choi, Yubin
    Kahng, Minsuk
    Kim, Juho
    PROCEEDINGS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI' 22), 2022,
  • [2] A Video Shot Boundary Detection Approach based on CNN Feature
    Liang, Rui
    Zhu, Qingxin
    Wei, Honglei
    Liao, Shujiao
    2017 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2017, : 489 - 494
  • [3] CNN Based Optimal Intra Prediction Mode Estimation in Video Coding
    Yokoyama, Ryota
    Tahara, Masahiko
    Takeuchi, Masaru
    Sun, Heming
    Matsuo, Yasutaka
    Katto, Jiro
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2020, : 151 - 152
  • [4] CNN-Based Time Series Decomposition Model for Video Prediction
    Lee, Jinyoung
    Kim, Gyeyoung
    IEEE ACCESS, 2024, 12 : 131205 - 131216
  • [5] A novel approach based on a modified mask R-CNN for the weight prediction of live pigs
    Xie, Chuanqi
    Cang, Yuji
    Lou, Xizhong
    Xiao, Hua
    Xu, Xing
    Li, Xiangjun
    Zhou, Weidong
    ARTIFICIAL INTELLIGENCE IN AGRICULTURE, 2024, 12 : 19 - 28
  • [6] Survival Prediction Based on Histopathology Imaging and Clinical Data: A Novel, Whole Slide CNN Approach
    Agarwal, Saloni
    Abaker, Mohamedelfatih Eltigani Osman
    Daescu, Ovidiu
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT V, 2021, 12905 : 762 - 771
  • [7] A novel approach for software defect prediction using CNN and GRU based on SMOTE Tomek method
    Khleel, Nasraldeen Alnor Adam
    Nehez, Karoly
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2023, 60 (03) : 673 - 707
  • [8] A novel approach for software defect prediction using CNN and GRU based on SMOTE Tomek method
    Nasraldeen Alnor Adam Khleel
    Károly Nehéz
    Journal of Intelligent Information Systems, 2023, 60 : 673 - 707
  • [9] Continuous video stream pixel sensor: A CNN-LSTM based deep learning approach for mode shape prediction
    Yang, Ruoyu
    Singh, Shubhendu Kumar
    Tavakkoli, Mostafa
    Amiri, Nikta
    Karami, M. Amin
    Rai, Rahul
    STRUCTURAL CONTROL & HEALTH MONITORING, 2022, 29 (03):
  • [10] Comparing CNN and RNN for Prediction of Judgement in Video Interview Based on Facial Gestures
    Singhal, Nishank
    Singhal, Neetika
    Srishti
    2018 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2018, : 175 - 180