A motion-aware ConvLSTM network for action recognition

被引:36
|
作者
Majd, Mahshid [1 ]
Safabakhsh, Reza [2 ,3 ]
机构
[1] Amirkabir Univ Technol, Artificial Intelligence & Robot, Tehran, Iran
[2] Amirkabir Univ Technol, Dept Comp Engn, Tehran, Iran
[3] Amirkabir Univ Technol, Comp Vis Lab, Tehran, Iran
关键词
Human action recognition; Deep learning; Convolutional networks; LSTM; ConvLSTM; GOING DEEPER;
D O I
10.1007/s10489-018-1395-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human action recognition is an emerging goal of computer vision with several applications such as video surveillance and human-computer interaction. Despite many attempts to develop deep architectures to learn the spatio-temporal features of video, hand-crafted optical flow is still an important part of the recognition process. To engage the motion features deeply inside the learning process, we propose a spatio-temporal video recognition network where a motion-aware long short-term memory module is introduced to estimate the motion flow along with extracting spatio-temporal features. A specific optical flow estimator is subsumed which is based on kernelized cross correlation. The proposed network can be used without any extra learning process and there is no need to pre-compute and store the optical flow. Extensive experiments on two action recognition benchmarks verify the effectiveness of the proposed approach.
引用
收藏
页码:2515 / 2521
页数:7
相关论文
共 50 条
  • [1] A motion-aware ConvLSTM network for action recognition
    Mahshid Majd
    Reza Safabakhsh
    [J]. Applied Intelligence, 2019, 49 : 2515 - 2521
  • [2] SiamMAST: Siamese motion-aware spatio-temporal network for video action recognition
    Lu, Xuemin
    Quan, Wei
    Marek, Reformat
    Zhao, Haiquan
    Chen, Jim X. X.
    [J]. VISUAL COMPUTER, 2024, 40 (05): : 3163 - 3181
  • [3] SiamMAST: Siamese motion-aware spatio-temporal network for video action recognition
    Xuemin Lu
    Wei Quan
    Reformat Marek
    Haiquan Zhao
    Jim X. Chen
    [J]. The Visual Computer, 2024, 40 : 3163 - 3181
  • [4] A Semantic and Motion-Aware Spatiotemporal Transformer Network for Action Detection
    Korban, Matthew
    Youngs, Peter
    Acton, Scott T.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6055 - 6069
  • [5] Motion-Aware Deep Video Coding Network
    Khan, Rida
    Liu, Ying
    [J]. BIG DATA II: LEARNING, ANALYTICS, AND APPLICATIONS, 2020, 11395
  • [6] Fully Motion-Aware Network for Video Object Detection
    Wang, Shiyao
    Zhou, Yucong
    Yan, Junjie
    Deng, Zhidong
    [J]. COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 : 557 - 573
  • [7] Motion-Aware Feature Enhancement Network for Video Prediction
    Lin, Xue
    Zou, Qi
    Xu, Xixia
    Huang, Yaping
    Tian, Yi
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (02) : 688 - 700
  • [8] Motion-Aware Video Frame Interpolation
    Han, Pengfei
    Zhang, Fuhua
    Zhao, Bin
    Li, Xuelong
    [J]. NEURAL NETWORKS, 2024, 178
  • [9] Motion-Aware Video Quality Assessment
    Arvanitidou, Marina Georgia
    Sikora, Thomas
    [J]. 2017 FIFTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2017, : 2042 - 2045
  • [10] MVFI-Net: Motion-Aware Video Frame Interpolation Network
    Lin, Xuhu
    Zhao, Lili
    Liu, Xi
    Chen, Jianwen
    [J]. COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 340 - 356