Depth-based end-to-end deep network for human action recognition

被引:20
|
作者
Chaudhary, Sachin [1 ]
Murala, Subrahmanyam [1 ]
机构
[1] IIT Ropar, Comp Vis & Pattern Recognit Lab, Rupnagar, Punjab, India
关键词
motion estimation; image recognition; depth-based end-to-end deep network; single channel depth map; human action recognition; single image depth estimation; depth-based HAR algorithms; depth motion map estimation; foreground motion segregation; SIDE; frame-wise depth estimation; RGB frame processing; HMDB51 benchmark datasets; UCF101 benchmark datasets; JHMDB benchmark datasets; FUSION;
D O I
10.1049/iet-cvi.2018.5020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognition of human actions from videos can be improved if depth information is available. Depth information certainly helps in segregating foreground motion from the background. Single image depth estimation (SIDE) is a commonly used method for the analysis of weather degraded images. In this study, the idea of SIDE is extended to human action recognition (HAR) on datasets where depth information is not available. Several depth-based HAR algorithms are available but all of them are using the depth information given with the dataset. Some other methods are using depth motion map which refers to the depth of motion in a temporal direction. Here, a new depth-based end-to-end deep network is proposed for HAR in which the frame-wise depth is estimated and this estimated depth is used for processing instead of RGB frame. As colour information is not required for estimating motion, a single channel depth map is used for estimating motion in the video. It makes the system computationally efficient. The proposed method is tested and verified on three benchmark datasets namely JHMDB, HMDB51 and UCF101. The proposed method outperforms the existing state-of-the-art methods for HAR on all the three tested datasets.
引用
收藏
页码:15 / 22
页数:8
相关论文
共 50 条
  • [1] END-TO-END LEARNING OF DEEP CONVOLUTIONAL NEURAL NETWORK FOR 3D HUMAN ACTION RECOGNITION
    Li, Chao
    Sun, Shouqian
    Min, Xin
    Lin, Wenqian
    Nie, Binling
    Zhang, Xianfu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [2] Sample Fusion Network: An End-to-End Data Augmentation Network for Skeleton-Based Human Action Recognition
    Meng, Fanyang
    Liu, Hong
    Liang, Yongsheng
    Tu, Juanhui
    Liu, Mengyuan
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (11) : 5281 - 5295
  • [3] End-to-end temporal attention extraction and human action recognition
    Zhang, Hong
    Xin, Miao
    Wang, Shuhang
    Yang, Yifan
    Zhang, Lei
    Wang, Helong
    [J]. MACHINE VISION AND APPLICATIONS, 2018, 29 (07) : 1127 - 1142
  • [4] End-to-end temporal attention extraction and human action recognition
    Hong Zhang
    Miao Xin
    Shuhang Wang
    Yifan Yang
    Lei Zhang
    Helong Wang
    [J]. Machine Vision and Applications, 2018, 29 : 1127 - 1142
  • [5] End-to-end residual learning-based deep neural network model deployment for human activity recognition
    Alok Negi
    Krishan Kumar
    [J]. International Journal of Multimedia Information Retrieval, 2023, 12
  • [6] End-to-end residual learning-based deep neural network model deployment for human activity recognition
    Negi, Alok
    Kumar, Krishan
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (01)
  • [7] FOSNet: An End-to-End Trainable Deep Neural Network for Scene Recognition
    Seong, Hongje
    Hyun, Junhyuk
    Kim, Euntai
    [J]. IEEE ACCESS, 2020, 8 (08) : 82066 - 82077
  • [8] An End-to-end Transient Recognition Method for VSC-HVDC Based on Deep Belief Network
    Guomin Luo
    Jiaxin Hei
    Changyuan Yao
    Jinghan He
    Meng Li
    [J]. Journal of Modern Power Systems and Clean Energy, 2020, 8 (06) : 1070 - 1079
  • [9] An End-to-end Transient Recognition Method for VSC-HVDC Based on Deep Belief Network
    Luo, Guomin
    Hei, Jiaxin
    Yao, Changyuan
    He, Jinghan
    Li, Meng
    [J]. JOURNAL OF MODERN POWER SYSTEMS AND CLEAN ENERGY, 2020, 8 (06) : 1070 - 1079
  • [10] End-to-End Speech Emotion Recognition Based on Neural Network
    Zhu, Bing
    Zhou, Wenkai
    Wang, Yutian
    Wang, Hui
    Cai, Juan Juan
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 1634 - 1638