A motion-aware ConvLSTM network for action recognition

被引:35
|
作者
Majd, Mahshid [1 ]
Safabakhsh, Reza [2 ,3 ]
机构
[1] Amirkabir Univ Technol, Artificial Intelligence & Robot, Tehran, Iran
[2] Amirkabir Univ Technol, Dept Comp Engn, Tehran, Iran
[3] Amirkabir Univ Technol, Comp Vis Lab, Tehran, Iran
关键词
Human action recognition; Deep learning; Convolutional networks; LSTM; ConvLSTM; GOING DEEPER;
D O I
10.1007/s10489-018-1395-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human action recognition is an emerging goal of computer vision with several applications such as video surveillance and human-computer interaction. Despite many attempts to develop deep architectures to learn the spatio-temporal features of video, hand-crafted optical flow is still an important part of the recognition process. To engage the motion features deeply inside the learning process, we propose a spatio-temporal video recognition network where a motion-aware long short-term memory module is introduced to estimate the motion flow along with extracting spatio-temporal features. A specific optical flow estimator is subsumed which is based on kernelized cross correlation. The proposed network can be used without any extra learning process and there is no need to pre-compute and store the optical flow. Extensive experiments on two action recognition benchmarks verify the effectiveness of the proposed approach.
引用
收藏
页码:2515 / 2521
页数:7
相关论文
共 50 条
  • [21] Motion-Aware Decoding of Compressed-Sensed Video
    Liu, Ying
    Li, Ming
    Pados, Dimitris A.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (03) : 438 - 444
  • [22] MAT: Motion-aware multi-object tracking
    Han, Shoudong
    Huang, Piao
    Wang, Hongwei
    Yu, En
    Liu, Donghaisheng
    Pan, Xiaofeng
    [J]. NEUROCOMPUTING, 2022, 476 : 75 - 86
  • [23] A motion-aware and temporal-enhanced Spatial-Temporal Graph Convolutional Network for skeleton-based human action segmentation
    Chai, Shurong
    Jain, Rahul Kumar
    Liu, Jiaqing
    Teng, Shiyu
    Tateyama, Tomoko
    Li, Yinhao
    Chen, Yen -Wei
    [J]. NEUROCOMPUTING, 2024, 580
  • [24] MAU: A Motion-Aware Unit for Video Prediction and Beyond
    Chang, Zheng
    Zhang, Xinfeng
    Wang, Shanshe
    Siwei
    Ye, Yan
    Xiang, Xinguang
    Gao, Wen
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [25] LEARNING MOTION-AWARE POLICIES FOR ROBUST VISUAL TRACKING
    Wang, Qianqian
    Zhuang, Liansheng
    Wang, Ning
    Zhou, Wengang
    Li, Houqiang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1786 - 1791
  • [26] MaCLR: Motion-Aware Contrastive Learning of Representations for Videos
    Xiao, Fanyi
    Tighe, Joseph
    Modolo, Davide
    [J]. COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 353 - 370
  • [27] Motion-Aware Correlation Filters for Online Visual Tracking
    Zhang, Yihong
    Yang, Yijin
    Zhou, Wuneng
    Shi, Lifeng
    Li, Demin
    [J]. SENSORS, 2018, 18 (11)
  • [28] Motion-Aware Robotic 3D Ultrasound
    Jiang, Zhongliang
    Wang, Hanyu
    Li, Zhenyu
    Grimm, Matthias
    Zhou, Mingchuan
    Eck, Ulrich
    Brecht, Sandra, V
    Lueth, Tim C.
    Wendler, Thomas
    Navab, Nassir
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 12494 - 12500
  • [29] Motion-Aware Dynamic Architecture for Efficient Frame Interpolation
    Choi, Myungsub
    Lee, Suyoung
    Kim, Heewon
    Lee, Kyoung Mu
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13819 - 13828
  • [30] MPSN: Motion-aware Pseudo-Siamese Network for indoor video head detection in buildings
    Sun, Kailai
    Ma, Xiaoteng
    Liu, Peng
    Zhao, Qianchuan
    [J]. BUILDING AND ENVIRONMENT, 2022, 222