CNN-based and DTW features for human activity recognition on depth maps

被引:13
|
作者
Trelinski, Jacek [1 ]
Kwolek, Bogdan [1 ]
机构
[1] AGH Univ Sci & Technol, Dept Comp Sci, 30 Mickiewicza Av,Bldg D-17, PL-30059 Krakow, Poland
来源
NEURAL COMPUTING & APPLICATIONS | 2021年 / 33卷 / 21期
关键词
Convolutional neural networks; Multivariate time-series; Ensembles; Depth-based human action recognition;
D O I
10.1007/s00521-021-06097-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present a new algorithm for human action recognition on raw depth maps. At the beginning, for each class we train a separate one-against-all convolutional neural network (CNN) to extract class-specific features representing person shape. Each class-specific, multivariate time-series is processed by a Siamese multichannel 1D CNN or a multichannel 1D CNN to determine features representing actions. Afterwards, for the nonzero pixels representing the person shape in each depth map we calculate statistical features. On multivariate time-series of such features we determine Dynamic Time Warping (DTW) features. They are determined on the basis of DTW distances between all training time-series. Finally, each class-specific feature vector is concatenated with the DTW feature vector. For each action category we train a multiclass classifier, which predicts probability distribution of class labels. From pool of such classifiers we select a number of classifiers such that an ensemble built on them achieves the best classification accuracy. Action recognition is performed by a soft voting ensemble that averages distributions calculated by such classifiers with the largest discriminative power. We demonstrate experimentally that on MSR-Action3D and UTD-MHAD datasets the proposed algorithm attains promising results and outperforms several state-of-the-art depth-based algorithms.
引用
收藏
页码:14551 / 14563
页数:13
相关论文
共 50 条
  • [1] CNN-based and DTW features for human activity recognition on depth maps
    Jacek Trelinski
    Bogdan Kwolek
    Neural Computing and Applications, 2021, 33 : 14551 - 14563
  • [2] Embedded Features for 1D CNN-based Action Recognition on Depth Maps
    Trelinski, Jacek
    Kwolek, Bogdan
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP, 2021, : 536 - 543
  • [3] CNN-BASED ACTION RECOGNITION USING ADAPTIVE MULTISCALE DEPTH MOTION MAPS AND STABLE JOINT DISTANCE MAPS
    He, Junyou
    Xia, Hailun
    Feng, Chunyan
    Chu, Yunfei
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 439 - 443
  • [4] CNN-based Sensor Fusion Techniques for Multimodal Human Activity Recognition
    Muenzner, Sebastian
    Schmidt, Philip
    Reiss, Attila
    Hanselmann, Michael
    Stiefelhagen, Rainer
    Duerichen, Robert
    PROCEEDINGS OF THE 2017 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (ISWC 17), 2017, : 158 - 165
  • [5] Hybrid Facial Emotion Recognition Using CNN-Based Features
    Shahzad, H. M.
    Bhatti, Sohail Masood
    Jaffar, Arfan
    Akram, Sheeraz
    Alhajlah, Mousa
    Mahmood, Awais
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [6] CNNBiF: CNN-based Bigram Features for Named Entity Recognition
    Sung, Chul
    Goel, Vaibhava
    Marcheret, Etienne
    Rennie, Steven J.
    Nahamoo, David
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1016 - 1021
  • [7] CNN-BASED TEMPLATE MATCHING FOR DETECTING FEATURES FROM HISTORICAL MAPS
    Xia, Xue
    Heitzler, Magnus
    Hurni, Lorenz
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 43-B2 : 1167 - 1173
  • [8] CNN-Based Multimodal Human Recognition in Surveillance Environments
    Koo, Ja Hyung
    Cho, Se Woon
    Baek, Na Rae
    Kim, Min Cheol
    Park, Kang Ryoung
    SENSORS, 2018, 18 (09)
  • [9] Beyond Human Recognition: A CNN-Based Framework for Handwritten Character Recognition
    Chen, Li
    Wang, Song
    Fan, Wei
    Sun, Jun
    Naoi, Satoshi
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 695 - 699
  • [10] Static, Dynamic and Acceleration Features for CNN-Based Speech Emotion Recognition
    Khalifa, Intissar
    Ejbali, Ridha
    Napoletano, Paolo
    Schettini, Raimondo
    Zaied, Mourad
    AIXIA 2021 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13196 : 348 - 358