Deep learning and RGB-D based human action, human-human and human-object interaction recognition: A survey?

被引：0

作者：

Khaire, Pushpajit ^{[1
]}

Kumar, Praveen ^{[1
]}

机构：

[1] Visvesvaraya Natl Inst Technol, Dept Comp Sci & Engn, Nagpur, India

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2022年 / 86卷

关键词：

Human action recognition; CNN; LSTM; Human-human interaction; Human-object interaction; Deep learning; RGB-D sensors; Multi-modality; Fusion; Skeleton; GCN; FLOW ESTIMATION; NEURAL-NETWORK; SEQUENCES; STREAMS;

D O I：

10.1016/j.jvcir.2022.103531

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Human activity recognition is one of the most studied topics in the field of computer vision. In recent years, with the availability of RGB-D sensors and powerful deep learning techniques, research on human activity recognition has gained momentum. From simple human atomic actions, the research has advanced towards recognizing more complex human activities using RGB-D data. This paper presents a comprehensive survey of the advanced deep learning based recognition methods and categorizes them in human atomic action, human-human interaction, human-object interaction. The reviewed methods are further classified based on the individual modality used for recognition i.e. RGB based, depth based, skeleton based, and hybrid. We also review and categorize recent challenging RGB-D datasets for the same. In addition, the paper also briefly reviews RGB-D datasets and methods for online activity recognition. The paper concludes with a discussion on limitations, challenges, and recent trends for promising future directions.

引用

页数：25

共 50 条

[1] Deep learning and RGB-D based human action, human–human and human–object interaction recognition: A survey
Khaire, Pushpajit
Kumar, Praveen
[J]. Journal of Visual Communication and Image Representation, 2022, 86
[2] RGB-D sensing based human action and interaction analysis: A survey
Liu, Bangli
Cai, Haibin
Ju, Zhaojie
Liu, Honghai
[J]. PATTERN RECOGNITION, 2019, 94 : 1 - 12
[3] Human-Object Interaction Detection: A Survey of Deep Learning-Based Methods
Li, Fang
Wang, Shunli
Wang, Shuaiping
Zhang, Lihua
[J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 441 - 452
[4] Distillation of human-object interaction contexts for action recognition
Almushyti, Muna
Li, Frederick W. B.
[J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2022, 33 (05)
[5] Viewpoint Invariant RGB-D Human Action Recognition
Liu, Jian
Akhtar, Naveed
Mian, Ajmal
[J]. 2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 261 - 268
[6] Cascaded Human-Object Interaction Recognition
Zhou, Tianfei
Wang, Wenguan
Qi, Siyuan
Ling, Haibin
Shen, Jianbing
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4262 - 4271
[7] Human action recognition in RGB-D videos using motion sequence information and deep learning
Ijjina, Earnest Paul
Chalavadi, Krishna Mohan
[J]. PATTERN RECOGNITION, 2017, 72 : 504 - 516
[8] RGB-D-based human motion recognition with deep learning: A survey
Wang, Pichao
Li, Wanqing
Ogunbona, Philip
Wan, Jun
Escalera, Sergio
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 171 : 118 - 139
[9] A Survey of Human-Object Interaction Detection
Gong, Xun
Zhang, Zhiying
Liu, Lu
Ma, Bing
Wu, Kunlun
[J]. Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2022, 57 (04): : 693 - 704
[10] Fusion of Skeleton and RGB Features for RGB-D Human Action Recognition
Weiyao, Xu
Muqing, Wu
Min, Zhao
Ting, Xia
[J]. IEEE SENSORS JOURNAL, 2021, 21 (17) : 19157 - 19164

← 1 2 3 4 5 →