ReadingAct RGB-D action dataset and human action recognition from local features

被引:14
|
作者
Chen, Lulu [1 ]
Wei, Hong [1 ]
Ferryman, James [1 ]
机构
[1] Univ Reading, Sch Syst Engn, Computat Vis Grp, Reading RG6 6AY, Berks, England
关键词
Human action recognition; Depth sensor; Spatio-temporal local features; Dynamic time warping; ReadingAct action dataset; DENSE;
D O I
10.1016/j.patrec.2013.09.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For general home monitoring, a system should automatically interpret people's actions. The system should be non-intrusive, and able to deal with a cluttered background, and loose clothes. An approach based on spatio-temporal local features and a Bag-of-Words (BoW) model is proposed for single-person action recognition from combined intensity and depth images. To restore the temporal structure lost in the traditional BoW method, a dynamic time alignment technique with temporal binning is applied in this work, which has not been previously implemented in the literature for human action recognition on depth imagery. A novel human action dataset with depth data has been created using two Microsoft Kinect sensors. The ReadingAct dataset contains 20 subjects and 19 actions for a total of 2340 videos. To investigate the effect of using depth images and the proposed method, testing was conducted on three depth datasets, and the proposed method was compared to traditional Bag-of-Words methods. Results showed that the proposed method improves recognition accuracy when adding depth to the conventional intensity data, and has advantages when dealing with long actions. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:159 / 169
页数:11
相关论文
共 50 条
  • [31] Discriminative Relational Representation Learning for RGB-D Action Recognition
    Kong, Yu
    Fu, Yun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) : 2856 - 2865
  • [32] Action Tube Extraction based 3D-CNN for RGB-D Action Recognition
    Xu, Zineng
    Vilaplana, Veronica
    Ramon Morros, Josep
    2018 16TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2018,
  • [33] RGB-D Data-Based Action Recognition: A Review
    Shaikh, Muhammad Bilal
    Chai, Douglas
    SENSORS, 2021, 21 (12)
  • [34] Bilinear Heterogeneous Information Machine for RGB-D Action Recognition
    Kong, Yu
    Fu, Yun
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 1054 - 1062
  • [35] A Robust Approach for Action Recognition Based on Spatio-Temporal Features in RGB-D Sequences
    Ly Quoc Ngoc
    Vo Hoai Viet
    Tran Thai Son
    Pham Minh Hoang
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (05) : 166 - 177
  • [36] Spatio-temporal feature extraction and representation for RGB-D human action recognition
    Luo, Jiajia
    Wang, Wei
    Qi, Hairong
    PATTERN RECOGNITION LETTERS, 2014, 50 : 139 - 148
  • [37] Human Action Recognition Using RGB-D Sensor and Deep Convolutional Neural Networks
    Imran, Javed
    Kumar, Praveen
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 144 - 148
  • [38] LEARNED SPATIO-TEMPORAL TEXTURE DESCRIPTORS FOR RGB-D HUMAN ACTION RECOGNITION
    Zhai, Zhengyuan
    Fan, Chunxiao
    Ming, Yue
    COMPUTING AND INFORMATICS, 2018, 37 (06) : 1339 - 1362
  • [39] Trear: Transformer-Based RGB-D Egocentric Action Recognition
    Li, Xiangyu
    Hou, Yonghong
    Wang, Pichao
    Gao, Zhimin
    Xu, Mingliang
    Li, Wanqing
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (01) : 246 - 252
  • [40] Structure-Preserving Binary Representations for RGB-D Action Recognition
    Yu, Mengyang
    Liu, Li
    Shao, Ling
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (08) : 1651 - 1664