Skeleton-Based Square Grid for Human Action Recognition With 3D Convolutional Neural Network

被引:14
|
作者
Ding, Wenwen [1 ]
Ding, Chongyang [2 ]
Li, Guang [2 ]
Liu, Kai [2 ]
机构
[1] Huaibei Normal Univ, Sch Math Sci, Hefei 235000, Anhui, Peoples R China
[2] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Peoples R China
来源
IEEE ACCESS | 2021年 / 9卷
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Skeleton; Three-dimensional displays; Spatiotemporal phenomena; Feature extraction; Convolution; Kernel; Convolutional neural networks; 3D convolutional neural networks; skeleton action recognition; neural network; attention mechanism; ATTENTION;
D O I
10.1109/ACCESS.2021.3059650
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural networks (CNNs) can effectively handle grid-structured data but not dynamic skeletons, which are usually expressed as graph structures. In this study, we first propose a skeleton-based square grid (SSG) for transforming dynamic skeletons into three-dimensional (3D) grid-structured data so that CNNs can be applied to such data. Each SSG contains a joint-based square grid (JSG) and a rigid-based square grid (RSG) based on intrinsic and extrinsic dependencies of various body parts, respectively. Next, to enhance the ability of deep features to capture the correlations among 3D grid-structured data, a two-stream 3D CNN is constructed to learn spatiotemporal features using the JSG and RSG sequences. Finally, we introduce a soft attention model that selectively focuses on the informative body parts in the skeleton sequences. We validate our model in terms of action recognition using three datasets: NTU RGB+D, Kinetics Motion, and SBU Kinect Interaction datasets. Our experimental results demonstrate the effectiveness of the proposed approach as well as its superior performance when compared with those of state-of-the-art methods.
引用
收藏
页码:54078 / 54089
页数:12
相关论文
共 50 条
  • [1] 3D skeleton-based action recognition with convolutional neural networks
    Van-Nam Hoang
    Thi-Lan Le
    Thanh-Hai Tran
    Hai-Vu
    Van-Toi Nguyen
    [J]. 2019 INTERNATIONAL CONFERENCE ON MULTIMEDIA ANALYSIS AND PATTERN RECOGNITION (MAPR), 2019,
  • [2] SKELETON-BASED HUMAN ACTION RECOGNITION USING SPATIAL TEMPORAL 3D CONVOLUTIONAL NEURAL NETWORKS
    Tu, Juanhui
    Liu, Mengyuan
    Liu, Hong
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [3] Dynamic Edge Convolutional Neural Network for Skeleton-Based Human Action Recognition
    Tasnim, Nusrat
    Baek, Joong-Hwan
    [J]. SENSORS, 2023, 23 (02)
  • [4] Learning Graph Convolutional Network for Skeleton-Based Human Action Recognition by Neural Searching
    Peng, Wei
    Hong, Xiaopeng
    Chen, Haoyu
    Zhao, Guoying
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 2669 - 2676
  • [5] SKELETON-BASED ACTION RECOGNITION WITH CONVOLUTIONAL NEURAL NETWORKS
    Li, Chao
    Zhong, Qiaoyong
    Xie, Di
    Pu, Shiliang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [6] Pyramidal Graph Convolutional Network for Skeleton-Based Human Action Recognition
    Li, Fanjia
    Zhu, Aichun
    Liu, Zhongyu
    Huo, Yu
    Xu, Yonggang
    Hua, Gang
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (14) : 16183 - 16191
  • [7] Convolutional relation network for skeleton-based action recognition
    Zhu, Jiagang
    Zou, Wei
    Zhu, Zheng
    Hu, Yiming
    [J]. NEUROCOMPUTING, 2019, 370 : 109 - 117
  • [8] Convolutional neural network with adaptive inferential framework for skeleton-based action recognition
    Huang, Hong'en
    Su, Hang
    Chang, Zhigang
    Yu, Mingyang
    Gao, Jialin
    Li, Xinzhe
    Zheng, Shibao
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 73
  • [9] A 3D graph convolutional networks model for 2D skeleton-based human action recognition
    Weng, Libo
    Lou, Weidong
    Shen, Xin
    Gao, Fei
    [J]. IET IMAGE PROCESSING, 2023, 17 (03) : 773 - 783
  • [10] Human Action Recognition with 3D Convolutional Neural Network
    Lima, Tiago
    Fernandes, Bruno
    Barros, Pablo
    [J]. 2017 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2017,