A Quad Joint Relational Feature for 3D Skeletal Action Recognition with Circular CNNs

被引:1
|
作者
Kishore, P. V. V. [1 ]
Perera, Darshika G. [2 ]
Kumar, M. Tej A. Kiran [1 ]
Kumar, D. Anil [1 ]
Kumar, E. Kiran [1 ]
机构
[1] KLEF Deemed Univ, Dept Elect & Commun Engn, Guntur, Andhra Pradesh, India
[2] Univ Colorado, Dept Elect & Comp Engn, Colorado Springs, CO 80933 USA
关键词
3D human action recognition; circular CNNs; joint volume features; geometric 3D feature maps; 3D motion capture;
D O I
10.1109/iscas45731.2020.9180732
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To deal with the limitations of human action recognition systems that apply deep neural networks (DNNs) to 3D skeletal feature maps, we propose an improved set of features that enable better pattern discrimination when using a spectrally enriched circular convolutional neural network (CCNN). These new features exploit the local relationships between joint movements based on 3D quadrilaterals constructed for all possible sets of four joints. Next, we compute the volumes of these time-varying quadrilaterals, by generating color-coded images, named spatio-temporal quad-joint relative volume feature maps (QjRVMs). To preserve the pixel frequency distribution while training a DNN, which is otherwise lost due to vanishing gradients and random dropouts, we propose a new architecture CCNNs. CCNNs use cyclic multi-resolution filters in a four-stream architecture, requiring only batch normalization and ReLU operations to identify multiple pixel pattern variations simultaneously. Applying the proposed CCNN to QjRVM images illustrates that combining multi-resolution features enhances the overall classification accuracy. Finally, we evaluate our proposed human action framework using our own 102-class, 5-subject action dataset, created using 3D motion capture technology, named KLHA3D-102. We also evaluate our framework using 3 publicly available datasets: CMU, HDM05, and NTU RGB-D.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Action Recognition Using Deep 3D CNNs with Sequential Feature Aggregation and Attention
    Anvarov, Fazliddin
    Kim, Dae Ha
    Song, Byung Cheol
    ELECTRONICS, 2020, 9 (01)
  • [2] 3D CNNs on Distance Matrices for Human Action Recognition
    Hernandez Ruiz, Alejandro
    Porzi, Lorenzo
    Bulo, Samuel Rota
    Moreno-Noguer, Francesc
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1087 - 1095
  • [3] Spatiotemporal Multimodal Learning With 3D CNNs for Video Action Recognition
    Wu, Hanbo
    Ma, Xin
    Li, Yibin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1250 - 1261
  • [4] Joint movement similarities for robust 3D action recognition using skeletal data
    Pazhoumand-Dar, Hossein
    Lam, Chiou-Peng
    Masek, Martin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 30 : 10 - 21
  • [5] Action recognition using kinematics posture feature on 3D skeleton joint locations
    Ahad, Md Atiqur Rahman
    Ahmed, Masud
    Das Antar, Anindya
    Makihara, Yasushi
    Yagi, Yasushi
    PATTERN RECOGNITION LETTERS, 2021, 145 (145) : 216 - 224
  • [6] 3D SPARSE QUANTIZATION FOR FEATURE LEARNING IN ACTION RECOGNITION
    Zhao, Yang
    Cheng, Hong
    Yang, Lu
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 263 - 267
  • [7] A New Feature Descriptor for 3D Human Action Recognition
    Asadi-Aghbolaghi, Maryam
    Ramezanpour, Sadegh
    Kasaei, Shohreh
    2014 22ND IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2014, : 1157 - 1161
  • [8] 3D CNNs with Adaptive Temporal Feature Resolutions
    Fayyaz, Mohsen
    Bahrami, Emad
    Diba, Ali
    Noroozi, Mehdi
    Adeli, Ehsan
    Van Gool, Luc
    Gall, Juergen
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4729 - 4738
  • [9] Learning 3D Skeletal Representation From Transformer for Action Recognition
    Cha, Junuk
    Saqlain, Muhammad
    Kim, Donguk
    Lee, Seungeun
    Lee, Seongyeong
    Baek, Seungryul
    IEEE ACCESS, 2022, 10 : 67541 - 67550
  • [10] A hybrid deep learning architecture using 3D CNNs and GRUs for human action recognition
    Savadi Hosseini M.
    Ghaderi F.
    International Journal of Engineering, Transactions B: Applications, 2020, 33 (05): : 959 - 965