3D PostureNet: A unified framework for skeleton-based posture recognition

被引:23
|
作者
Liu, Jianbo [1 ,2 ]
Wang, Ying [1 ]
Liu, Yongcheng [1 ,2 ]
Xiang, Shiming [1 ]
Pan, Chunhong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Human posture recognition; Static hand gesture recognition; Skeleton-based; 3D convolutional neural network; SYSTEM;
D O I
10.1016/j.patrec.2020.09.029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image-based posture recognition is a very challenging problem as it is difficult to acquire rich 3D information from postures in 2D images. Existing methods founded on 3D skeleton cues could alleviate this issue, but they are not particularly efficient due to the application of handcrafted features and traditional classifiers. This paper presents a novel and unified framework for skeleton-based posture recognition, applying powerful 3D Convolutional Neural Network (CNN) to this issue. Technically, bounding-box-based normalization for the raw skeleton data is proposed to eliminate the coordinate differences caused by diverse recording environments and posture displacements. Moreover, Gaussian voxelization for the skeleton is employed to expressively represent the posture configuration. Thereby, an end-to-end framework based on 3D CNN, called 3D PostureNet, is developed for robust posture recognition. To verify its effectiveness, a large-scale writing posture dataset is created and released in this work, including 113,400 samples of 30 subjects with 15 postures. Extensive experiments on the public MSRA hand gesture dataset, body pose dataset and the proposed writing posture dataset demonstrate that 3D PostureNet achieves significantly superior performance on both skeleton-based human posture and hand posture recognition tasks. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:143 / 149
页数:7
相关论文
共 50 条
  • [31] Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition
    Yu, Lubin
    Tian, Lianfang
    Du, Qiliang
    Bhutto, Jameel Ahmed
    APPLIED INTELLIGENCE, 2023, 53 (12) : 14838 - 14854
  • [32] Unsupervised 3D Skeleton-Based Action Recognition using Cross-Attention with Conditioned Generation Capabilities
    Lerch, David J.
    Zhong, Zeyun
    Martin, Manuel
    Voit, Michael
    Beyerer, Juergen
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 202 - 211
  • [33] Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition
    Lubin Yu
    Lianfang Tian
    Qiliang Du
    Jameel Ahmed Bhutto
    Applied Intelligence, 2023, 53 : 14838 - 14854
  • [34] Behavior Recognition Based on 3D Skeleton Features
    Liu, W. T.
    Lu, T. W.
    Miao, S. J.
    Peng, L.
    Min, F.
    INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENVIRONMENTAL ENGINEERING (CSEE 2015), 2015, : 760 - 765
  • [35] Action recognition using kinematics posture feature on 3D skeleton joint locations
    Ahad, Md Atiqur Rahman
    Ahmed, Masud
    Das Antar, Anindya
    Makihara, Yasushi
    Yagi, Yasushi
    PATTERN RECOGNITION LETTERS, 2021, 145 (145) : 216 - 224
  • [36] Fast 3D-graph convolutional networks for skeleton-based action recognition
    Zhang, Guohao
    Wen, Shuhuan
    Li, Jiaqi
    Che, Haijun
    APPLIED SOFT COMPUTING, 2023, 145
  • [37] A Novel Skeleton-based Model with Spine for 3D Human Pose Estimation
    Li, Zhaoxu
    Liu, Sheng
    Bai, Jue
    Peng, Chenglei
    Li, Yang
    Du, Sidan
    2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 501 - 506
  • [38] Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction
    Xu, Chenxin
    Tan, Robby T.
    Tan, Yuhong
    Chen, Siheng
    Wang, Xinchao
    Wang, Yanfeng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9475 - 9486
  • [39] Revisiting Skeleton-based Action Recognition
    Duan, Haodong
    Zhao, Yue
    Chen, Kai
    Lin, Dahua
    Dai, Bo
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2959 - 2968
  • [40] View-invariant 3D Skeleton-based Human Activity Recognition based on Transformer and Spatio-temporal Features
    Snoun, Ahmed
    Bouchrika, Tahani
    Jemai, Olfa
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 706 - 715