Human activity recognition in RGB-D videos by dynamic images

被引:0
|
作者
Snehasis Mukherjee
Leburu Anvitha
T. Mohana Lahari
机构
[1] Indian Institute of Information Technology SriCity,
来源
关键词
RGB-D; Activity recognition; Dynamic image; Resnet; Gestalt based perception;
D O I
暂无
中图分类号
学科分类号
摘要
Human Activity Recognition in RGB-D videos has been an active research topic during the last decade. However, only a few efforts have been made, for recognizing human activity in RGB-D videos where several performers are performing simultaneously. In this paper we introduce such a challenging dataset with several performers performing the activities simultaniously. We present a novel method for recognizing human activities performed simultaniously in the same videos. The proposed method aims in capturing the motion information of the whole video by producing a dynamic image corresponding to the input video. We use two parallel ResNet-101 architectures to produce the dynamic images for the RGB video and depth video separately. The dynamic images contain only the motion information of the whole frame, which is the main cue for analyzing the motion of the performer during action. Hence, dynamic images help recognizing human action by concentrating only on the motion information appeared on the frame. We send the two dynamic images through a fully connected layer for classification of activity. The proposed dynamic image reduces the complexity of the recognition process by extracting a sparse matrix from a video, while preserving the motion information required for activity recognition, and produces comparable results with respect to the state-of-the-art.
引用
收藏
页码:19787 / 19801
页数:14
相关论文
共 50 条
  • [21] Point-Based Object Recognition in RGB-D Images
    Wilkowski, Artur
    Kornuta, Tomasz
    Kasprzak, Wlodzimierz
    [J]. INTELLIGENT SYSTEMS'2014, VOL 2: TOOLS, ARCHITECTURES, SYSTEMS, APPLICATIONS, 2015, 323 : 593 - 604
  • [22] Fusion of Skeleton and RGB Features for RGB-D Human Action Recognition
    Weiyao, Xu
    Muqing, Wu
    Min, Zhao
    Ting, Xia
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (17) : 19157 - 19164
  • [23] Combining CNN streams of RGB-D and skeletal data for human activity recognition
    Khaire, Pushpajit
    Kumar, Praveen
    Imran, Javed
    [J]. PATTERN RECOGNITION LETTERS, 2018, 115 : 107 - 116
  • [24] Indoor Human Detection using RGB-D images
    Li, Baopu
    Jin, Haoyang
    Zhang, Qi
    Xia, Wei
    Li, Huiyun
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION (ICIA), 2016, : 1354 - 1360
  • [25] Temporal cues enhanced multimodal learning for action recognition in RGB-D videos
    Liu, Dan
    Meng, Fanrong
    Xia, Qing
    Ma, Zhiyuan
    Mi, Jinpeng
    Gan, Yan
    Ye, Mao
    Zhang, Jianwei
    [J]. NEUROCOMPUTING, 2024, 594
  • [26] Novel Human Action Recognition in RGB-D Videos Based on Powerful View Invariant Features Technique
    Mambou, Sebastien
    Krejcar, Ondrej
    Kuca, Kamil
    Selamat, Ali
    [J]. MODERN APPROACHES FOR INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2018, 769 : 343 - 353
  • [27] Viewpoint Invariant RGB-D Human Action Recognition
    Liu, Jian
    Akhtar, Naveed
    Mian, Ajmal
    [J]. 2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 261 - 268
  • [28] RGB-D Object Recognition Using the Knowledge Transferred from Relevant RGB Images
    Gao, Depeng
    Wu, Rui
    Liu, Jiafeng
    Huang, Qingcheng
    Tang, Xianglong
    Liu, Peng
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT VI, 2017, 10639 : 642 - 651
  • [29] Spatial-temporal texture features for 3D human activity recognition using laser-based RGB-D videos
    Ming, Yue
    Wang, Guangchao
    Hong, Xiaopeng
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (03): : 1595 - 1613
  • [30] Gesture Recognition of RGB and RGB-D Static Images Using Convolutional Neural Networks
    Khari, Manju
    Garg, Aditya Kumar
    Gonzalez Crespo, Ruben
    Verdu, Elena
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2019, 5 (07): : 22 - 27