Deep Learning for Human Action Recognition: A Comprehensive Review

被引:3
|
作者
Duc-Quang Vu [1 ,2 ]
Trang Phung Thi Thu [3 ]
Ngan Le [4 ]
Wang, Jia-Ching [1 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan, Taiwan
[2] Thai Nguyen Univ Educ, Thai Nguyen, Vietnam
[3] Thai Nguyen Univ, Thai Nguyen, Vietnam
[4] Univ Arkansas, Dept Comp Sci & Comp Engn, Fayetteville, AR 72701 USA
关键词
Action recognition; supervised learning; self-supervised learning; deep learning; deep neural networks; NETWORKS;
D O I
10.1561/116.00000068
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Over the past several years, we have witnessed remarkable progress in numerous computer vision applications, particularly in human activity analysis. Human action recognition, which aims to automatically examine and recognize the actions taking place in the video, has been widely applied in many applications. This paper presents a comprehensive survey of approaches and techniques in deep learning-based human activity analysis. First, we introduce the problem definition in action recognition together with its challenges. Second, we provide a comprehensive survey of feature representation methods. Third, we categorize human activity methodologies and discuss their advantages and limitations. In particular, we divide human action recognition into three main categories according to training mechanisms, i.e., supervised learning, semi-supervised learning, and self-supervised learning. We further analyze the existing network architectures, their performance, and source code availability for each main category. Fourth, we provide a detailed analysis of the existing, publicly available datasets, including small-scale and large-scale datasets for human action recognition. Finally, we discuss some open issues and future research directions.
引用
收藏
页数:40
相关论文
共 50 条
  • [31] Deep learning-based EEG emotion recognition: a comprehensive review
    Yuxiao Geng
    Shuo Shi
    Xiaoke Hao
    Neural Computing and Applications, 2025, 37 (4) : 1919 - 1950
  • [32] Deep learning pipelines for recognition of gait biometrics with covariates: a comprehensive review
    Parashar, Anubha
    Parashar, Apoorva
    Ding, Weiping
    Shekhawat, Rajveer S. S.
    Rida, Imad
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (08) : 8889 - 8953
  • [33] Deep learning-based welding image recognition: A comprehensive review
    Liu, Tianyuan
    Zheng, Pai
    Bao, Jinsong
    JOURNAL OF MANUFACTURING SYSTEMS, 2023, 68 : 601 - 625
  • [34] Deep Learning Approaches for Continuous Sign Language Recognition: A Comprehensive Review
    Khan, Asma
    Jin, Seyong
    Lee, Geon-Hee
    Arzu, Gul E.
    Dang, L. Minh
    Nguyen, Tan N.
    Choi, Woong
    Moon, Hyeonjoon
    IEEE ACCESS, 2025, 13 : 55524 - 55544
  • [35] Correction: Machine learning for human emotion recognition: a comprehensive review
    Eman M. G. Younis
    Someya Mohsen
    Essam H. Houssein
    Osman Ali Sadek Ibrahim
    Neural Computing and Applications, 2024, 36 : 8949 - 8949
  • [36] A Review of Machine Learning and Deep Learning for Object Detection, Semantic Segmentation, and Human Action Recognition in Machine and Robotic Vision
    Manakitsa, Nikoleta
    Maraslidis, George S.
    Moysis, Lazaros
    Fragulis, George F.
    TECHNOLOGIES, 2024, 12 (02)
  • [37] REALISTIC HUMAN ACTION RECOGNITION: WHEN DEEP LEARNING MEETS VLAD
    Zhang, Lei
    Feng, Yangyang
    Han, Jiqing
    Zhen, Xiantong
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 1352 - 1356
  • [38] Human Action Recognition using Computer Vision and Deep Learning Techniques
    Ganta, Suresh
    Desu, Devi Sri
    Golla, Aishwarya
    Kumar, M. Ashok
    2023 ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES FOR HIGH PERFORMANCE APPLICATIONS, ACCTHPA, 2023,
  • [39] A Deep Learning Approach for Human Action Recognition Using Skeletal Information
    Mathe, Eirini
    Maniatis, Apostolos
    Spyrou, Evaggelos
    Mylonas, Phivos
    GENEDIS 2018: COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 1194 : 105 - 114
  • [40] Learning a Deep Model for Human Action Recognition from Novel Viewpoints
    Rahmani, Hossein
    Mian, Ajmal
    Shah, Mubarak
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (03) : 667 - 681