SkeletonNet: Mining Deep Part Features for 3-D Action Recognition

被引:135
|
作者
Ke, Qiuhong [1 ]
An, Senjian [1 ]
Bennamoun, Mohammed [1 ]
Sohel, Ferdous [2 ]
Boussaid, Farid [3 ]
机构
[1] Univ Western Australia, Sch Comp Sci & Software Engn, Crawley, WA 6009, Australia
[2] Murdoch Univ, Sch Engn & Informat Technol, Murdoch, WA 6150, Australia
[3] Univ Western Australia, Sch Elect Elect & Comp Engn, Crawley, WA 6009, Australia
基金
澳大利亚研究理事会;
关键词
Convolutional neural networks (CNNs); robust features; 3-D action recognition; REAL-TIME; TRACKING; RGB;
D O I
10.1109/LSP.2017.2690339
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This letter presents SkeletonNet, a deep learning framework for skeleton-based 3-D action recognition. Given a skeleton sequence, the spatial structure of the skeleton joints in each frame and the temporal information between multiple frames are two important factors for action recognition. We first extract body-part-based features from each frame of the skeleton sequence. Compared to the original coordinates of the skeleton joints, the proposed features are translation, rotation, and scale invariant. To learn robust temporal information, instead of treating the features of all frames as a time series, we transform the features into images and feed them to the proposed deep learning network, which contains two parts: one to extract general features from the input images, while the other to generate a discriminative and compact representation for action recognition. The proposed method is tested on the SBU kinect interaction dataset, the CMU dataset, and the large-scale NTU RGB+D dataset and achieves state-of-the-art performance.
引用
收藏
页码:731 / 735
页数:5
相关论文
共 50 条
  • [1] Deep 3D Flow Features for Human Action Recognition
    Psaltis, Athanasios
    Papadopoulos, Georgios Th
    Daras, Petros
    2018 16TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2018,
  • [2] Encoding Pose Features to Images With Data Augmentation for 3-D Action Recognition
    Huynh-The, Thien
    Hua, Cam-Hao
    Kim, Dong-Seong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (05) : 3100 - 3111
  • [3] Body Joint Guided 3-D Deep Convolutional Descriptors for Action Recognition
    Cao, Congqi
    Zhang, Yifan
    Zhang, Chunjie
    Lu, Hanqing
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (03) : 1095 - 1108
  • [4] Multiview-Based 3-D Action Recognition Using Deep Networks
    Li, Chuankun
    Hou, Yonghong
    Wang, Pichao
    Li, Wanqing
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2019, 49 (01) : 95 - 104
  • [5] Invariant features for 3-D gesture recognition
    Campbell, LW
    Becker, DA
    Azarbayejani, A
    Bobick, AF
    Pentland, A
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, 1996, : 157 - 162
  • [6] 3-D Face Recognition Using Curvelet Local Features
    Elaiwat, S.
    Bennamoun, M.
    Boussaid, F.
    El-Sallam, A.
    IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (02) : 172 - 175
  • [7] THE USE OF FRACTAL FEATURES FOR RECOGNITION OF 3-D DISCHARGE PATTERNS
    KRIVDA, A
    GULSKI, E
    SATISH, L
    ZAENGL, WS
    IEEE TRANSACTIONS ON DIELECTRICS AND ELECTRICAL INSULATION, 1995, 2 (05) : 889 - 892
  • [8] 3-D Palmprint Recognition With Joint Line and Orientation Features
    Li, Wei
    Zhang, David
    Zhang, Lei
    Lu, Guangming
    Yan, Jingqi
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2011, 41 (02): : 274 - 279
  • [9] 3-D face recognition: features, databases, algorithms and challenges
    Hemprasad Patil
    Ashwin Kothari
    Kishor Bhurchandi
    Artificial Intelligence Review, 2015, 44 : 393 - 441
  • [10] Learning the Spherical Harmonic Features for 3-D Face Recognition
    Liu, Peijiang
    Wang, Yunhong
    Huang, Di
    Zhang, Zhaoxiang
    Chen, Liming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (03) : 914 - 925