Human activity recognition in RGB-D videos by dynamic images

被引:0
|
作者
Snehasis Mukherjee
Leburu Anvitha
T. Mohana Lahari
机构
[1] Indian Institute of Information Technology SriCity,
来源
关键词
RGB-D; Activity recognition; Dynamic image; Resnet; Gestalt based perception;
D O I
暂无
中图分类号
学科分类号
摘要
Human Activity Recognition in RGB-D videos has been an active research topic during the last decade. However, only a few efforts have been made, for recognizing human activity in RGB-D videos where several performers are performing simultaneously. In this paper we introduce such a challenging dataset with several performers performing the activities simultaniously. We present a novel method for recognizing human activities performed simultaniously in the same videos. The proposed method aims in capturing the motion information of the whole video by producing a dynamic image corresponding to the input video. We use two parallel ResNet-101 architectures to produce the dynamic images for the RGB video and depth video separately. The dynamic images contain only the motion information of the whole frame, which is the main cue for analyzing the motion of the performer during action. Hence, dynamic images help recognizing human action by concentrating only on the motion information appeared on the frame. We send the two dynamic images through a fully connected layer for classification of activity. The proposed dynamic image reduces the complexity of the recognition process by extracting a sparse matrix from a video, while preserving the motion information required for activity recognition, and produces comparable results with respect to the state-of-the-art.
引用
收藏
页码:19787 / 19801
页数:14
相关论文
共 50 条
  • [31] Learning human activities and object affordances from RGB-D videos
    Koppula, Hema Swetha
    Gupta, Rudhir
    Saxena, Ashutosh
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (08): : 951 - 970
  • [32] Sparse Composition of Body Poses and Atomic Actions for Human Activity Recognition in RGB-D Videos (vol 59, pg 63, 2017)
    Lillo, Ivan
    Niebles, Juan Carlos
    Soto, Alvaro
    [J]. IMAGE AND VISION COMPUTING, 2017, 66 : 48 - 48
  • [33] Pose-Invariant Face Recognition via RGB-D Images
    Sang, Gaoli
    Li, Jing
    Zhao, Qijun
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
  • [34] Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images
    Gupta, Saurabh
    Arbelaez, Pablo
    Malik, Jitendra
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 564 - 571
  • [35] Perception Subsystem for Object Recognition and Pose Estimation in RGB-D Images
    Kornuta, Tomasz
    Laszkowski, Michal
    [J]. CHALLENGES IN AUTOMATION, ROBOTICS AND MEASUREMENT TECHNIQUES, 2016, 440 : 597 - 607
  • [36] Jointly Learning Heterogeneous Features for RGB-D Activity Recognition
    Hu, Jian-Fang
    Zheng, Wei-Shi
    Lai, Jianhuang
    Zhang, Jianguo
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5344 - 5352
  • [37] Social Activity Recognition on Continuous RGB-D Video Sequences
    Coppola, Claudio
    Cosar, Serhan
    Faria, Diego R.
    Bellotto, Nicola
    [J]. INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2020, 12 (01) : 201 - 215
  • [38] Jointly Learning Heterogeneous Features for RGB-D Activity Recognition
    Hu, Jian-Fang
    Zheng, Wei-Shi
    Lai, Jianhuang
    Zhang, Jianguo
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (11) : 2186 - 2200
  • [39] INTER PERSON ACTIVITY RECOGNITION USING RGB-D DATA
    Sardeshmukh, M. M.
    Kolte, M. T.
    Sardeshmukh, V. M.
    [J]. JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2020, 15 (06): : 3601 - 3614
  • [40] Social Activity Recognition on Continuous RGB-D Video Sequences
    Claudio Coppola
    Serhan Cosar
    Diego R. Faria
    Nicola Bellotto
    [J]. International Journal of Social Robotics, 2020, 12 : 201 - 215