Deep learning and RGB-D based human action, human-human and human-object interaction recognition: A survey?

被引:0
|
作者
Khaire, Pushpajit [1 ]
Kumar, Praveen [1 ]
机构
[1] Visvesvaraya Natl Inst Technol, Dept Comp Sci & Engn, Nagpur, India
关键词
Human action recognition; CNN; LSTM; Human-human interaction; Human-object interaction; Deep learning; RGB-D sensors; Multi-modality; Fusion; Skeleton; GCN; FLOW ESTIMATION; NEURAL-NETWORK; SEQUENCES; STREAMS;
D O I
10.1016/j.jvcir.2022.103531
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human activity recognition is one of the most studied topics in the field of computer vision. In recent years, with the availability of RGB-D sensors and powerful deep learning techniques, research on human activity recognition has gained momentum. From simple human atomic actions, the research has advanced towards recognizing more complex human activities using RGB-D data. This paper presents a comprehensive survey of the advanced deep learning based recognition methods and categorizes them in human atomic action, human-human interaction, human-object interaction. The reviewed methods are further classified based on the individual modality used for recognition i.e. RGB based, depth based, skeleton based, and hybrid. We also review and categorize recent challenging RGB-D datasets for the same. In addition, the paper also briefly reviews RGB-D datasets and methods for online activity recognition. The paper concludes with a discussion on limitations, challenges, and recent trends for promising future directions.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Deep learning and RGB-D based human action, human–human and human–object interaction recognition: A survey
    Khaire, Pushpajit
    Kumar, Praveen
    [J]. Journal of Visual Communication and Image Representation, 2022, 86
  • [2] RGB-D sensing based human action and interaction analysis: A survey
    Liu, Bangli
    Cai, Haibin
    Ju, Zhaojie
    Liu, Honghai
    [J]. PATTERN RECOGNITION, 2019, 94 : 1 - 12
  • [3] Human-Object Interaction Detection: A Survey of Deep Learning-Based Methods
    Li, Fang
    Wang, Shunli
    Wang, Shuaiping
    Zhang, Lihua
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 441 - 452
  • [4] Distillation of human-object interaction contexts for action recognition
    Almushyti, Muna
    Li, Frederick W. B.
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2022, 33 (05)
  • [5] Viewpoint Invariant RGB-D Human Action Recognition
    Liu, Jian
    Akhtar, Naveed
    Mian, Ajmal
    [J]. 2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 261 - 268
  • [6] Cascaded Human-Object Interaction Recognition
    Zhou, Tianfei
    Wang, Wenguan
    Qi, Siyuan
    Ling, Haibin
    Shen, Jianbing
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4262 - 4271
  • [7] Human action recognition in RGB-D videos using motion sequence information and deep learning
    Ijjina, Earnest Paul
    Chalavadi, Krishna Mohan
    [J]. PATTERN RECOGNITION, 2017, 72 : 504 - 516
  • [8] RGB-D-based human motion recognition with deep learning: A survey
    Wang, Pichao
    Li, Wanqing
    Ogunbona, Philip
    Wan, Jun
    Escalera, Sergio
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 171 : 118 - 139
  • [9] A Survey of Human-Object Interaction Detection
    Gong, Xun
    Zhang, Zhiying
    Liu, Lu
    Ma, Bing
    Wu, Kunlun
    [J]. Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2022, 57 (04): : 693 - 704
  • [10] Fusion of Skeleton and RGB Features for RGB-D Human Action Recognition
    Weiyao, Xu
    Muqing, Wu
    Min, Zhao
    Ting, Xia
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (17) : 19157 - 19164