Depth Pooling Based Large-Scale 3-D Action Recognition With Convolutional Neural Networks

被引:125
|
作者
Wang, Pichao [1 ]
Li, Wanqing [1 ]
Gao, Zhimin [1 ]
Tang, Chang [2 ]
Ogunbona, Philip O. [1 ]
机构
[1] Univ Wollongong, Adv Multimedia Res Lab, Wollongong, NSW 2522, Australia
[2] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Hubei, Peoples R China
关键词
Large-scale; depth; action recognition; convolutional neural networks; GESTURE RECOGNITION;
D O I
10.1109/TMM.2018.2818329
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes three simple, compact yet effective representations of depth sequences, referred to respectively as dynamic depth images (DDI), dynamic depth normal images (DDNI), and dynamic depth motion normal images (DDMNI), for both isolated and continuous action recognition. These dynamic images are constructed from a segmented sequence of depth maps using hierarchical bidirectional rank pooling to effectively capture the spatial-temporal information. Specifically, DDI exploits the dynamics of postures over time, and DDNI and DDMNI exploit the 3-D structural information captured by depth maps. Upon the proposed representations, a convolutional neural network (ConvNet)-based method is developed for action recognition. The image-based representations enable us to fine-tune the existing ConvNet models trained on image data without training a large number of parameters from scratch. The proposed method achieved the state-of-art results on three large datasets, namely, the large-scale continuous gesture recognition dataset (means the Jaccard index 0.4109), the large-scale isolated gesture recognition dataset (59.21%), and the NTU RGB+D dataset (87.08% cross-subject and 84.22% cross-view) even though only the depth modality was used.
引用
收藏
页码:1051 / 1061
页数:11
相关论文
共 50 条
  • [31] An efficient attention module for 3d convolutional neural networks in action recognition
    Jiang, Guanghao
    Jiang, Xiaoyan
    Fang, Zhijun
    Chen, Shanshan
    [J]. APPLIED INTELLIGENCE, 2021, 51 (10) : 7043 - 7057
  • [32] Basketball technique action recognition using 3D convolutional neural networks
    Wang, Jingfei
    Zuo, Liang
    Martinez, Carlos Cordente
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [33] AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos
    Kar, Amlan
    Rai, Nishant
    Sikka, Karan
    Sharma, Gaurav
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5699 - 5708
  • [34] 3D ACTION RECOGNITION USING DATA VISUALIZATION AND CONVOLUTIONAL NEURAL NETWORKS
    Liu, Mengyuan
    Chen, Chen
    Liu, Hong
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 925 - 930
  • [35] An efficient attention module for 3d convolutional neural networks in action recognition
    Guanghao Jiang
    Xiaoyan Jiang
    Zhijun Fang
    Shanshan Chen
    [J]. Applied Intelligence, 2021, 51 : 7043 - 7057
  • [36] Training Convolutional Neural Network for Sketch Recognition on Large-Scale Dataset
    Zhou, Wen
    Jia, Jinyuan
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (01) : 82 - 89
  • [37] Convolutional Neural Network-Based Action Recognition on Depth Maps
    Trelinski, Jacek
    Kwolek, Bogdan
    [J]. COMPUTER VISION AND GRAPHICS ( ICCVG 2018), 2018, 11114 : 209 - 221
  • [38] Performance Comparison and Analysis for Large-Scale Crowd Counting Based on Convolutional Neural Networks
    Alotaibi, Reem
    Alzahrani, Bander
    Wang, Rui
    Alafif, Tarik
    Barnawi, Ahmed
    Hu, Long
    [J]. IEEE ACCESS, 2020, 8 : 204425 - 204432
  • [39] A High Performance FPGA-based Accelerator for Large-Scale Convolutional Neural Networks
    Li, Huimin
    Fan, Xitian
    Jiao, Li
    Cao, Wei
    Zhou, Xuegong
    Wang, Lingli
    [J]. 2016 26TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2016,
  • [40] Weighted pooling for image recognition of deep convolutional neural networks
    Zhu, Xiaoning
    Meng, Qingyue
    Ding, Bojian
    Gu, Lize
    Yang, Yixian
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 4): : S9371 - S9383