A Quad Joint Relational Feature for 3D Skeletal Action Recognition with Circular CNNs

被引:1
|
作者
Kishore, P. V. V. [1 ]
Perera, Darshika G. [2 ]
Kumar, M. Tej A. Kiran [1 ]
Kumar, D. Anil [1 ]
Kumar, E. Kiran [1 ]
机构
[1] KLEF Deemed Univ, Dept Elect & Commun Engn, Guntur, Andhra Pradesh, India
[2] Univ Colorado, Dept Elect & Comp Engn, Colorado Springs, CO 80933 USA
关键词
3D human action recognition; circular CNNs; joint volume features; geometric 3D feature maps; 3D motion capture;
D O I
10.1109/iscas45731.2020.9180732
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To deal with the limitations of human action recognition systems that apply deep neural networks (DNNs) to 3D skeletal feature maps, we propose an improved set of features that enable better pattern discrimination when using a spectrally enriched circular convolutional neural network (CCNN). These new features exploit the local relationships between joint movements based on 3D quadrilaterals constructed for all possible sets of four joints. Next, we compute the volumes of these time-varying quadrilaterals, by generating color-coded images, named spatio-temporal quad-joint relative volume feature maps (QjRVMs). To preserve the pixel frequency distribution while training a DNN, which is otherwise lost due to vanishing gradients and random dropouts, we propose a new architecture CCNNs. CCNNs use cyclic multi-resolution filters in a four-stream architecture, requiring only batch normalization and ReLU operations to identify multiple pixel pattern variations simultaneously. Applying the proposed CCNN to QjRVM images illustrates that combining multi-resolution features enhances the overall classification accuracy. Finally, we evaluate our proposed human action framework using our own 102-class, 5-subject action dataset, created using 3D motion capture technology, named KLHA3D-102. We also evaluate our framework using 3 publicly available datasets: CMU, HDM05, and NTU RGB-D.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Recognizing Human Actions Using 3D Skeletal Information and CNNs
    Papadakis, Antonios
    Mathe, Eirini
    Vernikos, Ioannis
    Maniatis, Apostolos
    Spyrou, Evaggelos
    Mylonas, Phivos
    ENGINEERING APPLICATIONS OF NEURAL NETWORKSX, 2019, 1000 : 511 - 521
  • [22] 3D mixed CNNs with edge-point feature learning
    Du, Zijin
    Ye, Hailiang
    Cao, Feilong
    KNOWLEDGE-BASED SYSTEMS, 2021, 221
  • [23] 3D TRAJECTORIES FOR ACTION RECOGNITION
    Koperski, Michal
    Bilinski, Piotr
    Bremond, Francois
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4176 - 4180
  • [24] Sign language recognition based on lightweight 3D CNNs and Transformer
    Lu F.
    Han X.
    Cheng X.
    Tian G.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (05): : 13 - 18
  • [25] Gestures recognition based on multimodal fusion by using 3D CNNs
    Zhu, Yimin
    Gao, Qing
    Shi, Hongyan
    Liu, Jinguo
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (01) : 1647 - 1661
  • [26] Efficient 3D CNNs with knowledge transfer for sign language recognition
    Xiangzu Han
    Fei Lu
    Guohui Tian
    Multimedia Tools and Applications, 2022, 81 : 10071 - 10090
  • [27] 3D object recognition and pose with relational indexing
    Costa, MS
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2000, 79 (03) : 364 - 407
  • [28] Spatio-Temporal Features in Action Recognition Using 3D Skeletal Joints
    Trascau, Mihai
    Nan, Mihai
    Florea, Adina Magda
    SENSORS, 2019, 19 (02)
  • [29] AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement
    Guan, Shannan
    Lu, Haiyan
    Zhu, Linchao
    Fang, Gengfa
    NEUROCOMPUTING, 2022, 514 : 256 - 267
  • [30] Efficient 3D CNNs with knowledge transfer for sign language recognition
    Han, Xiangzu
    Lu, Fei
    Tian, Guohui
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (07) : 10071 - 10090