Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition

被引：860

作者：

Liu, Jun ^{[1
]}

Shahroudy, Amir ^{[1
]}

Xu, Dong ^{[2
]}

Wang, Gang ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore

[2] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW, Australia

来源：

COMPUTER VISION - ECCV 2016, PT III | 2016年 / 9907卷

关键词：

3D action recognition; Recurrent neural networks; Long short-term memory; Trust gate; Spatio-temporal analysis; SEQUENCE;

D O I：

10.1007/978-3-319-46487-9_50

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D action recognition - analysis of human actions based on 3D skeleton data - becomes popular recently due to its succinctness, robustness, and view-invariant representation. Recent attempts on this problem suggested to develop RNN-based learning methods to model the contextual dependency in the temporal domain. In this paper, we extend this idea to spatio-temporal domains to analyze the hidden sources of action-related information within the input data over both domains concurrently. Inspired by the graphical structure of the human skeleton, we further propose a more powerful tree-structure based traversal method. To handle the noise and occlusion in 3D skeleton data, we introduce new gating mechanism within LSTM to learn the reliability of the sequential input data and accordingly adjust its effect on updating the long-term context information stored in the memory cell. Our method achieves state-of-the-art performance on 4 challenging benchmark datasets for 3D human action analysis.

引用

页码：816 / 833

页数：18

共 50 条

[21] Simplex-Based 3D Spatio-Temporal Feature Description for Action Recognition
Zhang, Hao
Zhou, Wenjun
Reardon, Christopher
Parker, Lynne E.
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2067 - 2074
[22] Action Recognition in Videos with Spatio-Temporal Fusion 3D Convolutional Neural Networks
Y. Wang
X. J. Shen
H. P. Chen
J. X. Sun
Pattern Recognition and Image Analysis, 2021, 31 : 580 - 587
[23] Human Activity Recognition: A Spatio-temporal Image Encoding of 3D Skeleton Data for Online Action Detection
Mokhtari, Nassim
Nedelec, Alexis
De Loor, Pierre
PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 448 - 455
[24] Spatio-Temporal Steerable Pyramid for Human Action Recognition
Zhen, Xiantong
Shao, Ling
2013 10TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), 2013,
[25] Spatio-temporal Video Autoencoder for Human Action Recognition
Sousa e Santos, Anderson Carlos
Pedrini, Helio
PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 114 - 123
[26] Spatio-temporal Semantic Features for Human Action Recognition
Liu, Jia
Wang, Xiaonian
Li, Tianyu
Yang, Jie
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2012, 6 (10): : 2632 - 2649
[27] Human Action Recognition Using Spatio-temporal Classification
Fang, Chin-Hsien
Chen, Ju-Chin
Tseng, Chien-Chung
Lien, Jenn-Jier James
COMPUTER VISION - ACCV 2009, PT II, 2010, 5995 : 98 - 109
[28] Human Action Recognition Based on Spatio-temporal Features
Sawant, Nikhil
Biswas, K. K.
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 357 - 362
[29] Human Action Recognition Using 2-D Spatio-Temporal Templates
Chen, Duan-Yu
Shih, Sheng-Wen
Liao, Hong-Yuan Mark
2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 667 - +
[30] Part-wise Spatio-temporal Attention Driven CNN-based 3D Human Action Recognition
Dhiman, Chhavi
Vishwakarma, Dinesh Kumar
Agarwal, Paras
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (03)

← 1 2 3 4 5 →