Human skeleton pose and spatio-temporal feature-based activity recognition using ST-GCN

被引:3
|
作者
Lovanshi, Mayank [1 ]
Tiwari, Vivek [1 ,2 ]
机构
[1] Int Inst Informat Technol IIIT, Naya Raipur, India
[2] ABV Indian Inst Informat Technol & Management, Gwalior, India
关键词
Activity recognition; Pose estimation; ST-GCN; Spatio-temporal feature; Skeleton joints; SPATIAL-DISTRIBUTION; UNIFIED FRAMEWORK; GRADIENTS; MODEL;
D O I
10.1007/s11042-023-16001-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Skeleton-based Human Activity Recognition has recently sparked a lot of attention because skeleton data has proven resistant to changes in lighting, body sizes, dynamic camera perspectives, and complicated backgrounds. The Spatial-Temporal Graph Convolutional Networks (ST-GCN) model has been exposed to study spatial and temporal dependencies effectively from skeleton data. However, efficient use of 3D skeleton in-depth information remains a significant challenge, specifically for human joint motion patterns and linkages information. This study attempts a promising solution through a custom ST-GCN model and skeleton joints for human activity recognition. Special attention was given to spatial & temporal features, which were further fed to the classification model for better pose estimation. A comparative study is presented for activity recognition using large-scale databases such as NTU-RGB-D, Kinetics-Skeleton, and Florence 3D datasets. The Custom ST-GCN model outperforms (Top-1 accuracy) the state-of-the-art method on NTU-RGB-D, Kinetics-Skeleton & Florence 3D dataset with a higher margin by 0.7%, 1.25%, and 1.92%, respectively. Similarly, with Top-5 accuracy, the Custom ST-GCN model offers results hike by 0.5%, 0.73% & 1.52%, respectively. It shows that the presented graph-based topologies capture the changing aspects of a motion-based skeleton sequence better than some of the other approaches.
引用
收藏
页码:12705 / 12730
页数:26
相关论文
共 50 条
  • [31] ENSEMBLE SPATIO-TEMPORAL DISTANCE NET FOR SKELETON BASED ACTION RECOGNITION
    Naveenkumar, M.
    Domnic, S.
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2019, 20 (03): : 485 - 494
  • [32] Spatio-Temporal Graph Routing for Skeleton-Based Action Recognition
    Li, Bin
    Li, Xi
    Zhang, Zhongfei
    Wu, Fei
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8561 - 8568
  • [33] ST-TGR: Spatio-Temporal Representation Learning for Skeleton-Based Teaching Gesture Recognition
    Chen, Zengzhao
    Huang, Wenkai
    Liu, Hai
    Wang, Zhuo
    Wen, Yuqun
    Wang, Shengming
    SENSORS, 2024, 24 (08)
  • [34] Spatio-Temporal Depth Cuboid Similarity Feature for Activity Recognition Using Depth Camera
    Xia, Lu
    Aggarwal, J. K.
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2834 - 2841
  • [35] MICRO-EXPRESSION RECOGNITION BASED ON THE SPATIO-TEMPORAL FEATURE
    Su, Wenchao
    Wang, Yanyan
    Su, Fei
    Zhao, Zhicheng
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [36] PROGRESSIVE SPATIO-TEMPORAL GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED HUMAN ACTION RECOGNITION
    Heidari, Negar
    Iosifidis, Alexandros
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3220 - 3224
  • [37] Spatio-temporal feature-based keyframe detection from video shots using spectral clustering
    Vazquez-Martin, Ricardo
    Bandera, Antonio
    PATTERN RECOGNITION LETTERS, 2013, 34 (07) : 770 - 779
  • [38] Abnormal Activity Recognition Using Spatio-Temporal Features
    Chathuramali, K. G. Manosha
    Ramasinghe, Sameera
    Rodrigo, Ranga
    2014 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS), 2014,
  • [39] Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates
    Liu, Jun
    Shahroudy, Amir
    Xu, Dong
    Kot, Alex C.
    Wang, Gang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (12) : 3007 - 3021
  • [40] Skeleton-based action recognition using spatio-temporal features with convolutional neural networks
    Rostami, Zahra
    Afrasiabi, Mahlagha
    Khotanlou, Hassan
    2017 IEEE 4TH INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED ENGINEERING AND INNOVATION (KBEI), 2017, : 583 - 587