Combining CNN streams of dynamic image and depth data for action recognition

被引:0
|
作者
Roshan Singh
Rajat Khurana
Alok Kumar Singh Kushwaha
Rajeev Srivastava
机构
[1] IIT (BHU),Department of Computer Science and Engineering
[2] IKG Punjab Technical University,Department of Computer Science and Engineering
来源
Multimedia Systems | 2020年 / 26卷
关键词
Human activity recognition; RGB-D; CNN; VGG; Multi-stream CNN models; Transfer learning;
D O I
暂无
中图分类号
学科分类号
摘要
RGB-D sensors have been in great demand due to its capability of producing large amount of multimodal data like RGB images and depth maps, useful for better training of deep learning models. In this paper, a deep learning model for recognizing human activities in a video sequence by combining multiple CNN streams has been proposed. The proposed work comprises the use of dynamic images generated from RGB images and depth map for three different dimensions. The proposed model is trained using these four streams on VGG Net for action recognition purpose. Further, it is evaluated and compared with the other state-of-the-art methods available in literature, on three challenging datasets, namely MSR daily Activity, UTD MHAD and CAD 60, in terms of accuracy, error, recall, specificity, precision and f-score. From obtained results, it has been observed that the proposed method outperforms other methods.
引用
收藏
页码:313 / 322
页数:9
相关论文
共 50 条
  • [1] Combining CNN streams of dynamic image and depth data for action recognition
    Singh, Roshan
    Khurana, Rajat
    Kushwaha, Alok Kumar Singh
    Srivastava, Rajeev
    MULTIMEDIA SYSTEMS, 2020, 26 (03) : 313 - 322
  • [2] Fusion of spatial and dynamic CNN streams for action recognition
    Newlin Shebiah Russel
    Arivazhagan Selvaraj
    Multimedia Systems, 2021, 27 : 969 - 984
  • [3] Fusion of spatial and dynamic CNN streams for action recognition
    Russel, Newlin Shebiah
    Selvaraj, Arivazhagan
    MULTIMEDIA SYSTEMS, 2021, 27 (05) : 969 - 984
  • [4] Combining CNN streams of RGB-D and skeletal data for human activity recognition
    Khaire, Pushpajit
    Kumar, Praveen
    Imran, Javed
    PATTERN RECOGNITION LETTERS, 2018, 115 : 107 - 116
  • [5] Sign recognition using depth image streams
    Honda Research Institute USA, Mountain View, CA
    不详
    Proc. Int. Conf. Autom. Face Gesture Recog., (381-386):
  • [6] Sign recognition using depth image streams
    Fujimura, Kikuo
    Liu, Xia
    PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION - PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE, 2006, : 381 - +
  • [7] Dynamic human object recognition by combining color and depth information with a clothing image histogram
    Wang, Yen-Han
    Wang, Tzu-Wei
    Yen, Jia-Yush
    Wang, Fu-Cheng
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2019, 16 (01)
  • [8] Action Recognition Based on Depth Image Sequence
    Liao, Liangcan
    Cao, Guitao
    Cao, Wenming
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 1583 - 1587
  • [9] Action recognition using optimized deep autoencoder and CNN for surveillance data streams of non-stationary environments
    Ullah, Amin
    Muhammad, Khan
    Ul Haq, Ijaz
    Baik, Sung Wook
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 96 : 386 - 397
  • [10] Action Recognition with Dynamic Image Networks
    Bilen, Hakan
    Fernando, Basura
    Gavves, Efstratios
    Vedaldi, Andrea
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (12) : 2799 - 2813