Motion-Aware Feature Enhancement Network for Video Prediction

被引:18
|
作者
Lin, Xue [1 ]
Zou, Qi [1 ]
Xu, Xixia [1 ]
Huang, Yaping [1 ]
Tian, Yi [1 ]
机构
[1] Beijing Jiaotong Univ, Beijing Key Lab Traff Data Anal & Min, Beijing 100044, Peoples R China
关键词
Predictive models; Encoding; Multiprotocol label switching; Stochastic processes; Dynamics; Feature extraction; Task analysis; Video prediction; unsupervised learning; attention mechanism; perceptual loss;
D O I
10.1109/TCSVT.2020.2987141
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video prediction is challenging, due to the pixel-level precision requirement and the difficulty in capturing scene dynamics. Most approaches tackle the problems by pixel-level reconstruction objectives and two decomposed branches, which still suffer from blurry generations or dramatic degradations in long-term prediction. In this paper, we propose a Motion-Aware Feature Enhancement (MAFE) network for video prediction to produce realistic future frames and achieve relatively long-term predictions. First, a Channel-wise and Spatial Attention (CSA) module is designed to extract motion-aware features, which enhances the contribution of important motion details during encoding, and subsequently improves the discriminability of attention map for the frame refinement. Second, a Motion Perceptual Loss (MPL) is proposed to guide the learning of temporal cues, which benefits to robust long-term video prediction. Extensive experiments on three human activity video datasets: KTH, Human3.6M, and PennAction demonstrate the effectiveness of the proposed video prediction model compared with the state-of-the-art approaches.
引用
收藏
页码:688 / 700
页数:13
相关论文
共 50 条
  • [1] Motion-Aware Deep Video Coding Network
    Khan, Rida
    Liu, Ying
    [J]. BIG DATA II: LEARNING, ANALYTICS, AND APPLICATIONS, 2020, 11395
  • [2] MAU: A Motion-Aware Unit for Video Prediction and Beyond
    Chang, Zheng
    Zhang, Xinfeng
    Wang, Shanshe
    Siwei
    Ye, Yan
    Xiang, Xinguang
    Gao, Wen
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [3] Fully Motion-Aware Network for Video Object Detection
    Wang, Shiyao
    Zhou, Yucong
    Yan, Junjie
    Deng, Zhidong
    [J]. COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 : 557 - 573
  • [4] Motion-Aware Video Frame Interpolation
    Han, Pengfei
    Zhang, Fuhua
    Zhao, Bin
    Li, Xuelong
    [J]. NEURAL NETWORKS, 2024, 178
  • [5] Motion-Aware Video Quality Assessment
    Arvanitidou, Marina Georgia
    Sikora, Thomas
    [J]. 2017 FIFTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2017, : 2042 - 2045
  • [6] MVFI-Net: Motion-Aware Video Frame Interpolation Network
    Lin, Xuhu
    Zhao, Lili
    Liu, Xi
    Chen, Jianwen
    [J]. COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 340 - 356
  • [7] Motion-Aware Memory Network for Fast Video Salient Object Detection
    Zhao, Xing
    Liang, Haoran
    Li, Peipei
    Sun, Guodao
    Zhao, Dongdong
    Liang, Ronghua
    He, Xiaofei
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 709 - 721
  • [8] Motion-Aware Temporal Coherence for Video Resizing
    Wang, Yu-Shuen
    Fu, Hongbo
    Sorkine, Olga
    Lee, Tong-Yee
    Seidel, Hans-Peter
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (05): : 1 - 10
  • [9] Motion-Aware Gradient Domain Video Composition
    Chen, Tao
    Zhu, Jun-Yan
    Shamir, Ariel
    Hu, Shi-Min
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (07) : 2532 - 2544
  • [10] Motion-Aware Rapid Video Saliency Detection
    Guo, Fang
    Wang, Wenguan
    Shen, Ziyi
    Shen, Jianbing
    Shao, Ling
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (12) : 4887 - 4898