3D PostureNet: A unified framework for skeleton-based posture recognition

被引：23

作者：

Liu, Jianbo ^{[1
,2
]}

Wang, Ying ^{[1
]}

Liu, Yongcheng ^{[1
,2
]}

Xiang, Shiming ^{[1
]}

Pan, Chunhong ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

来源：

PATTERN RECOGNITION LETTERS | 2020年 / 140卷 / 140期

基金：

中国国家自然科学基金;

关键词：

Human posture recognition; Static hand gesture recognition; Skeleton-based; 3D convolutional neural network; SYSTEM;

D O I：

10.1016/j.patrec.2020.09.029

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image-based posture recognition is a very challenging problem as it is difficult to acquire rich 3D information from postures in 2D images. Existing methods founded on 3D skeleton cues could alleviate this issue, but they are not particularly efficient due to the application of handcrafted features and traditional classifiers. This paper presents a novel and unified framework for skeleton-based posture recognition, applying powerful 3D Convolutional Neural Network (CNN) to this issue. Technically, bounding-box-based normalization for the raw skeleton data is proposed to eliminate the coordinate differences caused by diverse recording environments and posture displacements. Moreover, Gaussian voxelization for the skeleton is employed to expressively represent the posture configuration. Thereby, an end-to-end framework based on 3D CNN, called 3D PostureNet, is developed for robust posture recognition. To verify its effectiveness, a large-scale writing posture dataset is created and released in this work, including 113,400 samples of 30 subjects with 15 postures. Extensive experiments on the public MSRA hand gesture dataset, body pose dataset and the proposed writing posture dataset demonstrate that 3D PostureNet achieves significantly superior performance on both skeleton-based human posture and hand posture recognition tasks. (C) 2020 Elsevier B.V. All rights reserved.

引用

页码：143 / 149

页数：7

共 50 条

[41] Focus on temporal graph convolutional networks with unified attention for skeleton-based action recognition
Gao, Bing-Kun
Dong, Le
Bi, Hong-Bo
Bi, Yun-Ze
APPLIED INTELLIGENCE, 2022, 52 (05) : 5608 - 5616
[42] Focus on temporal graph convolutional networks with unified attention for skeleton-based action recognition
Bing-Kun Gao
Le Dong
Hong-Bo Bi
Yun-Ze Bi
Applied Intelligence, 2022, 52 : 5608 - 5616
[43] Automatic 3D Skeleton-based Dynamic Hand Gesture Recognition Using Multi-Layer Convolutional LSTM
Mohammed, Adam A. Q.
Gao, Yuan
Ji, Zhilong
Lv, Jiancheng
Islam, Md Sajjatul
Sang, Yongsheng
2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 8 - 14
[44] A convolutional autoencoder model with weighted multi-scale attention modules for 3D skeleton-based action recognition
Khezerlou, F.
Baradarani, A.
Balafar, M. A.
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 92
[45] A Fast Positioning Algorithm Based on 3D Posture Recognition
Xu, Xianbing
Peng, Chengbin
Xiao, Jiangjian
Jing, Huimin
Wu, Xiaojie
INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 360 - 370
[46] Auto-Learning-GCN: An Ingenious Framework for Skeleton-Based Action Recognition
Xin, Wentian
Liu, Yi
Liu, Ruyi
Miao, Qiguang
Shi, Cheng
Pun, Chi-Man
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 29 - 42
[47] Convolutional neural network with adaptive inferential framework for skeleton-based action recognition
Huang, Hong'en
Su, Hang
Chang, Zhigang
Yu, Mingyang
Gao, Jialin
Li, Xinzhe
Zheng, Shibao
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 73
[48] Skeleton-based canonical forms for non-rigid 3D shape retrieval
David Pickup
Xianfang Sun
Paul L.Rosin
Ralph R.Martin
Computational Visual Media, 2016, 2 (03) : 231 - 243
[49] Skeleton-Based 3D Tracking of Multiple Fish From Two Orthogonal Views
Qian, Zhiming
Shi, Meiling
Wang, Meijiao
Cun, Tianrui
COMPUTER VISION, PT I, 2017, 771 : 25 - 36
[50] Skeleton-based canonical forms for non-rigid 3D shape retrieval
Pickup D.
Sun X.
Rosin P.L.
Martin R.R.
Computational Visual Media, 2016, 2 (3) : 231 - 243

← 1 2 3 4 5 →