3D CNN for Human Action Recognition

被引:4
|
作者
Boualia, Sameh Neili [1 ,2 ]
Ben Amara, Najoua Essoukri [2 ]
机构
[1] Univ Tunis El Manar, Natl Engn Sch Tunis, Tunis 1002, Tunisia
[2] Univ Sousse, Ecole Natl Ingn Sousse, LATIS Lab Adv Technol & Intelligent Syst, Sousse 4023, Tunisia
关键词
Human Action Recognition; Deep Learning; 3D CNN;
D O I
10.1109/SSD52085.2021.9429429
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recognizing different human actions from still images or videos is an important research area in the computer vision and artificial intelligence domains. It represents a key step for a wide range of applications including: human-computer interaction, ambient assisted living, intelligent driving and video surveillance. However, unless the many research works being involved, there are still many challenges ahead including: the high changes in human body shapes, clothing and viewpoint changes and the conditions of system acquisition (illumination variations, occlusions, etc). With the emergence of new deep learning techniques, many approaches are recently proposed for Human Action Recognition (HAR). Compared with conventional machine learning methods, deep learning techniques have more powerful learning ability. The most wide-spread deep learning approach is the Convolutional Neural Network (CNN/ConvNets). It has shown remarkable achievements due to its precision and robustness. As a branch of neural network, 3D CNN is a relatively new technique in the field of deep learning. In this paper, we propose a HAR approach based on a 3D CNN modet We apply the developed model to recognize human actions of KTH and J-HMDB datasets, and we achieve state of the art performance in comparison to baseline methods.
引用
收藏
页码:276 / 282
页数:7
相关论文
共 50 条
  • [41] Ensembling 3D CNN Framework for Video Recognition
    Huang, Ruolin
    Dong, Hongbin
    Yin, Guisheng
    Fu, Qiang
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [42] Fuzzy Integral-Based CNN Classifier Fusion for 3D Skeleton Action Recognition
    Banerjee, Avinandan
    Singh, Pawan Kumar
    Sarkar, Ram
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2206 - 2216
  • [43] Separable 3D residual attention network for human action recognition
    Zhang, Zufan
    Peng, Yue
    Gan, Chenquan
    Abate, Andrea Francesco
    Zhu, Lianxiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (04) : 5435 - 5453
  • [44] Human Action Recognition Based on Quaternion 3D Skeleton Representation
    Xu Haiyang
    Kong Jun
    Jiang Min
    LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (02)
  • [45] Exploring 3D Human Action Recognition: from Offline to Online
    Li, Rui
    Liu, Zhenyu
    Tan, Jianrong
    SENSORS, 2018, 18 (02)
  • [46] Local Surface Geometric Feature for 3D human action recognition
    Zhang, Erhu
    Chen, Wanjun
    Zhang, Zhuomin
    Zhang, Yan
    NEUROCOMPUTING, 2016, 208 : 281 - 289
  • [47] Automatic Key Pose Selection for 3D Human Action Recognition
    Gong, Wenjuan
    Bagdanov, Andrew D.
    Xavier Roca, F.
    Gonzalez, Jordi
    ARTICULATED MOTION AND DEFORMABLE OBJECTS, PROCEEDINGS, 2010, 6169 : 290 - 299
  • [48] Graph Regularized Implicit Pose for 3D Human Action Recognition
    Kerola, Tommi
    Inoue, Nakamasa
    Shinoda, Koichi
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [49] Separable 3D residual attention network for human action recognition
    Zufan Zhang
    Yue Peng
    Chenquan Gan
    Andrea Francesco Abate
    Lianxiang Zhu
    Multimedia Tools and Applications, 2023, 82 : 5435 - 5453
  • [50] Part-wise Spatio-temporal Attention Driven CNN-based 3D Human Action Recognition
    Dhiman, Chhavi
    Vishwakarma, Dinesh Kumar
    Agarwal, Paras
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (03)