Video spatio-temporal generative adversarial network for local action generation

被引:0
|
作者
Liu, Xuejun [1 ]
Guo, Jiacheng [1 ]
Cui, Zhongji [1 ]
Liu, Ling [2 ]
Yan, Yong [1 ]
Sha, Yun [1 ]
机构
[1] Beijing Inst Petrochem Technol, Coll Informat Engn, Beijing, Peoples R China
[2] Beijing Inst Graph Commun, Coll Art & Design, Beijing, Peoples R China
关键词
video generation; deep learning; generative adversarial networks; two-stage model;
D O I
10.1117/1.JEI.32.5.053003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Generating action videos in future scenes based on static images can make computer vision systems to be better applied for video understanding and intelligent decision-making. However, current models pay more attention to the motion trend of the generated objects, and the processing effect on local details is not ideal. The local features of the generated video will have the problem of blurred frames and incoherent motion. This paper proposes a two-stage model, video spatio-temporal generative adversarial network (VSTGAN), which consists of two GAN networks, such as temporal network and spatial network (S-net). The model fully combines the advantages of CNNs, recurrent neural networks (RNNs), and GANs to decompose the complex spatiotemporal generation problem into temporal and spatial dimensions. Therefore, VSTGAN can focus on local features from the above dimensions respectively. In the temporal dimension, we propose an RNN unit, the convolutional attention unit (ConvAU), which uses the convolutional attention module to dynamically generate weights to update the hidden state. Thus, T-net uses the ConvAU to generate local dynamics. In the spatial dimension, S-net uses CNNs and attention modules to perform resolution reconstruction of the generated local dynamics for video generation. We build two small-sample datasets and validate our approach on these two new datasets and the KTH public dataset. The results show that our approach can effectively generate local details in future action videos and that the model performance on small-sample datasets is competitive with the state-of-the-art in video generation.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] STemGAN: spatio-temporal generative adversarial network for video anomaly detection
    Rituraj Singh
    Krishanu Saini
    Anikeit Sethi
    Aruna Tiwari
    Sumeet Saurav
    Sanjay Singh
    [J]. Applied Intelligence, 2023, 53 : 28133 - 28152
  • [2] STemGAN: spatio-temporal generative adversarial network for video anomaly detection
    Singh, Rituraj
    Saini, Krishanu
    Sethi, Anikeit
    Tiwari, Aruna
    Saurav, Sumeet
    Singh, Sanjay
    [J]. APPLIED INTELLIGENCE, 2023, 53 (23) : 28133 - 28152
  • [3] Spatio-temporal generative adversarial network for gait anonymization
    Tieu, Ngoc-Dung T.
    Nguyen, Huy H.
    Hoang-Quoc Nguyen-Son
    Yamagishi, Junichi
    Echizen, Isao
    [J]. JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2019, 46 : 307 - 319
  • [4] Spatio-Temporal Generative Adversarial Networks
    Qin, Chao
    Gao, Xiaoguang
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2020, 29 (04) : 623 - 631
  • [5] Spatio-Temporal Generative Adversarial Networks
    QIN Chao
    GAO Xiaoguang
    [J]. Chinese Journal of Electronics, 2020, 29 (04) : 623 - 631
  • [6] Spatio-Temporal Learning for Video Deblurring based on Two-Stream Generative Adversarial Network
    Song, Liyao
    Wang, Quan
    Lie, Haiwei
    Fan, Jiancun
    Hu, Bingliang
    [J]. NEURAL PROCESSING LETTERS, 2021, 53 (04) : 2701 - 2714
  • [7] Spatio-Temporal Learning for Video Deblurring based on Two-Stream Generative Adversarial Network
    Liyao Song
    Quan Wang
    Haiwei Li
    Jiancun Fan
    Bingliang Hu
    [J]. Neural Processing Letters, 2021, 53 : 2701 - 2714
  • [8] Distributed spatio-temporal generative adversarial networks
    QIN Chao
    GAO Xiaoguang
    [J]. Journal of Systems Engineering and Electronics, 2020, 31 (03) : 578 - 592
  • [9] STGAN: Spatio-Temporal Generative Adversarial Network for Traffic Data Imputation
    Yuan, Ye
    Zhang, Yong
    Wang, Boyue
    Peng, Yuan
    Hu, Yongli
    Yin, Baocai
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (01) : 200 - 211
  • [10] Distributed spatio-temporal generative adversarial networks
    Qin Chao
    Gao Xiaoguang
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2020, 31 (03) : 578 - 592