A Quad Joint Relational Feature for 3D Skeletal Action Recognition with Circular CNNs

被引：1

作者：

Kishore, P. V. V. ^{[1
]}

Perera, Darshika G. ^{[2
]}

Kumar, M. Tej A. Kiran ^{[1
]}

Kumar, D. Anil ^{[1
]}

Kumar, E. Kiran ^{[1
]}

机构：

[1] KLEF Deemed Univ, Dept Elect & Commun Engn, Guntur, Andhra Pradesh, India

[2] Univ Colorado, Dept Elect & Comp Engn, Colorado Springs, CO 80933 USA

来源：

2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS) | 2020年

关键词：

3D human action recognition; circular CNNs; joint volume features; geometric 3D feature maps; 3D motion capture;

D O I：

10.1109/iscas45731.2020.9180732

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

To deal with the limitations of human action recognition systems that apply deep neural networks (DNNs) to 3D skeletal feature maps, we propose an improved set of features that enable better pattern discrimination when using a spectrally enriched circular convolutional neural network (CCNN). These new features exploit the local relationships between joint movements based on 3D quadrilaterals constructed for all possible sets of four joints. Next, we compute the volumes of these time-varying quadrilaterals, by generating color-coded images, named spatio-temporal quad-joint relative volume feature maps (QjRVMs). To preserve the pixel frequency distribution while training a DNN, which is otherwise lost due to vanishing gradients and random dropouts, we propose a new architecture CCNNs. CCNNs use cyclic multi-resolution filters in a four-stream architecture, requiring only batch normalization and ReLU operations to identify multiple pixel pattern variations simultaneously. Applying the proposed CCNN to QjRVM images illustrates that combining multi-resolution features enhances the overall classification accuracy. Finally, we evaluate our proposed human action framework using our own 102-class, 5-subject action dataset, created using 3D motion capture technology, named KLHA3D-102. We also evaluate our framework using 3 publicly available datasets: CMU, HDM05, and NTU RGB-D.

引用

页数：5

共 50 条

[21] Recognizing Human Actions Using 3D Skeletal Information and CNNs
Papadakis, Antonios
Mathe, Eirini
Vernikos, Ioannis
Maniatis, Apostolos
Spyrou, Evaggelos
Mylonas, Phivos
ENGINEERING APPLICATIONS OF NEURAL NETWORKSX, 2019, 1000 : 511 - 521
[22] 3D mixed CNNs with edge-point feature learning
Du, Zijin
Ye, Hailiang
Cao, Feilong
KNOWLEDGE-BASED SYSTEMS, 2021, 221
[23] 3D TRAJECTORIES FOR ACTION RECOGNITION
Koperski, Michal
Bilinski, Piotr
Bremond, Francois
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4176 - 4180
[24] Sign language recognition based on lightweight 3D CNNs and Transformer
Lu F.
Han X.
Cheng X.
Tian G.
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (05): : 13 - 18
[25] Gestures recognition based on multimodal fusion by using 3D CNNs
Zhu, Yimin
Gao, Qing
Shi, Hongyan
Liu, Jinguo
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (01) : 1647 - 1661
[26] Efficient 3D CNNs with knowledge transfer for sign language recognition
Xiangzu Han
Fei Lu
Guohui Tian
Multimedia Tools and Applications, 2022, 81 : 10071 - 10090
[27] 3D object recognition and pose with relational indexing
Costa, MS
COMPUTER VISION AND IMAGE UNDERSTANDING, 2000, 79 (03) : 364 - 407
[28] Spatio-Temporal Features in Action Recognition Using 3D Skeletal Joints
Trascau, Mihai
Nan, Mihai
Florea, Adina Magda
SENSORS, 2019, 19 (02)
[29] AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement
Guan, Shannan
Lu, Haiyan
Zhu, Linchao
Fang, Gengfa
NEUROCOMPUTING, 2022, 514 : 256 - 267
[30] Efficient 3D CNNs with knowledge transfer for sign language recognition
Han, Xiangzu
Lu, Fei
Tian, Guohui
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (07) : 10071 - 10090

← 1 2 3 4 5 →