Human Action Recognition by Learning Spatio-Temporal Features With Deep Neural Networks

被引:76
|
作者
Wang, Lei [1 ,2 ]
Xu, Yangyang [1 ]
Cheng, Jun [1 ,2 ]
Xia, Haiying [3 ]
Yin, Jianqin [4 ]
Wu, Jiaji [5 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Guangdong Prov Key Lab Robot & Intelligent Syst, Shenzhen 518055, Peoples R China
[2] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[3] Guangxi Normal Univ, Guilin 541000, Peoples R China
[4] Beijing Univ Posts & Telecommun, Sch Automat, Beijing 100876, Peoples R China
[5] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Artificial intelligent; human action recognition; attention model; deep neural networks; robotic system;
D O I
10.1109/ACCESS.2018.2817253
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition is one of the fundamental challenges in robotics systems. In this paper, we propose one lightweight action recognition architecture based on deep neural networks just using RGB data. The proposed architecture consists of convolution neural network (CNN), long short-term memory (LSTM) units, and temporal-wise attention model. First, the CNN is used to extract spatial features to distinguish objects from the background with both local and semantic characteristics. Second, two kinds of LSTM networks are performed on the spatial feature maps of different CNN layers (pooling layer and fully-connected layer) to extract temporal motion features. Then, one temporal-wise attention model is designed after the LSTM to learn which parts in which frames are more important. Lastly, a joint optimization module is designed to explore intrinsic relations between two kinds of LSTM features. Experimental results demonstrate the efficiency of the proposed method.
引用
收藏
页码:17913 / 17922
页数:10
相关论文
共 50 条
  • [1] Human Action Recognition Based on Spatio-temporal Features
    Sawant, Nikhil
    Biswas, K. K.
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 357 - 362
  • [2] Spatio-temporal Semantic Features for Human Action Recognition
    Liu, Jia
    Wang, Xiaonian
    Li, Tianyu
    Yang, Jie
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2012, 6 (10): : 2632 - 2649
  • [3] Deep Neural Networks for Learning Spatio-Temporal Features From Tomography Sensors
    Costilla-Reyes, Omar
    Scully, Patricia
    Ozanyan, Krikor B.
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2018, 65 (01) : 645 - 653
  • [4] Depthwise Spatio-Temporal STFT Convolutiona Neural Networks for Human Action Recognition
    Kumawat, Sudhakar
    Verma, Manisha
    Nakashima, Yuta
    Raman, Shanmuganathan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4839 - 4851
  • [5] Online Spatio-Temporal Learning in Deep Neural Networks
    Bohnstingl, Thomas
    Wozniak, Stanislaw
    Pantazi, Angeliki
    Eleftheriou, Evangelos
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8894 - 8908
  • [6] A Spatio-Temporal Deep Learning Approach For Human Action Recognition in Infrared Videos
    Shah, Anuj K.
    Ghosh, Ripul
    Akula, Aparna
    [J]. OPTICS AND PHOTONICS FOR INFORMATION PROCESSING XII, 2018, 10751
  • [7] Accelerated Learning of Discriminative Spatio-temporal Features for Action Recognition
    Varshney, Munender
    Rameshan, Renu
    [J]. 2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [8] Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition
    Hara, Kensho
    Kataoka, Hirokatsu
    Satoh, Yutaka
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 3154 - 3160
  • [9] Learning Long-Term Temporal Features With Deep Neural Networks for Human Action Recognition
    Yu, Sheng
    Xie, Li
    Liu, Lin
    Xia, Daoxun
    [J]. IEEE ACCESS, 2020, 8 : 1840 - 1850
  • [10] Leveraging spatio-temporal features using graph neural networks for human activity recognition
    Raj, M. S. Subodh
    George, Sudhish N.
    Raja, Kiran
    [J]. PATTERN RECOGNITION, 2024, 150