Human action recognition in videos with articulated pose information by deep networks

被引:9
|
作者
Farrajota, M. [1 ]
Rodrigues, Joao M. F. [1 ]
du Buf, J. M. H. [1 ]
机构
[1] Univ Algarve, Vis Lab, LARSyS, P-8005139 Faro, Portugal
关键词
Human action; Human pose; ConvNet; Neural networks; Auto-encoders; LSTM;
D O I
10.1007/s10044-018-0727-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition is of great importance in understanding human motion from video. It is an important topic in computer vision due to its many applications such as video surveillance, human-machine interaction and video retrieval. One key problem is to automatically recognize low-level actions and high-level activities of interest. This paper proposes a way to cope with low-level actions by combining information of human body joints to aid action recognition. This is achieved by using high-level features computed by a convolutional neural network which was pre-trained on Imagenet, with articulated body joints as low-level features. These features are then used to feed a Long Short-Term Memory network to learn the temporal dependencies of an action. For pose prediction, we focus on articulated relations between body joints. We employ a series of residual auto-encoders to produce multiple predictions which are then combined to provide a likelihood map of body joints. In the network topology, features are processed across all scales which capture the various spatial relationships associated with the body. Repeated bottom-up and top-down processing with intermediate supervision of each auto-encoder network is applied. We demonstrate state-of-the-art results on the popular FLIC, LSP and UCF Sports datasets.
引用
收藏
页码:1307 / 1318
页数:12
相关论文
共 50 条
  • [21] Emotion and Gesture Guided Action Recognition in Videos Using Supervised Deep Networks
    Nigam, Nitika
    Dutta, Tanima
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (05) : 2546 - 2556
  • [22] Joint Dynamic Pose Image and Space Time Reversal for Human Action Recognition from Videos
    Liu, Mengyuan
    Meng, Fanyang
    Chen, Chen
    Wu, Songtao
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8762 - 8769
  • [23] DeepPear: Deep Pose Estimation and Action Recognition
    Jhuang, You-Ying
    Tsai, Wen-Jiin
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7119 - 7125
  • [24] Human Action Recognition Using Deep Neural Networks
    Koli, Rashmi R.
    Bagban, Tanveer, I
    [J]. PROCEEDINGS OF THE 2020 FOURTH WORLD CONFERENCE ON SMART TRENDS IN SYSTEMS, SECURITY AND SUSTAINABILITY (WORLDS4 2020), 2020, : 376 - 380
  • [25] Temporal Segment Networks for Action Recognition in Videos
    Wang, Limin
    Xiong, Yuanjun
    Wang, Zhe
    Qiao, Yu
    Lin, Dahua
    Tang, Xiaoou
    Van Gool, Luc
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (11) : 2740 - 2755
  • [26] Deep metric learning for open-set human action recognition in videos
    Gutoski, Matheus
    Lazzaretti, Andre Eugenio
    Lopes, Heitor Silverio
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (04): : 1207 - 1220
  • [27] Deep metric learning for open-set human action recognition in videos
    Matheus Gutoski
    André Eugênio Lazzaretti
    Heitor Silvério Lopes
    [J]. Neural Computing and Applications, 2021, 33 : 1207 - 1220
  • [28] ACTION RECOGNITION IN STILL IMAGES USING A COMBINATION OF HUMAN POSE AND CONTEXT INFORMATION
    Zheng, Yin
    Zhang, Yu-Jin
    Li, Xue
    Liu, Bao-Di
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 785 - 788
  • [29] HUMAN ACTIVITY DETECTION AND ACTION RECOGNITION IN VIDEOS USING CONVOLUTIONAL NEURAL NETWORKS
    Basavaiah, Jagadeesh
    Patil, Chandrashekar Mohan
    [J]. JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2020, 19 (02): : 157 - 183
  • [30] Temporally guided articulated hand pose tracking in surgical videos
    Louis, Nathan
    Zhou, Luowei
    Yule, Steven J.
    Dias, Roger D.
    Manojlovich, Milisa
    Pagani, Francis D.
    Likosky, Donald S.
    Corso, Jason J.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (01) : 117 - 125