Collecting public RGB-D datasets for human daily activity recognition

被引:7
|
作者
Wu, Hanbo [1 ]
Ma, Xin [1 ]
Zhang, Zhimeng [1 ]
Wang, Haibo [1 ]
Li, Yibin [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, 17923 Jingshi Rd, Jinan, Shandong, Peoples R China
来源
关键词
Human daily activity recognition; public RGB-D data sets merging; large-scale RGB-D activity data set; depth motion maps; depth cuboid similarity feature; curvature space scale; OBJECT RECOGNITION; FUSION; MODEL;
D O I
10.1177/1729881417709079
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Human daily activity recognition has been a hot spot in the field of computer vision for many decades. Despite best efforts, activity recognition in naturally uncontrolled settings remains a challenging problem. Recently, by being able to perceive depth and visual cues simultaneously, RGB-D cameras greatly boost the performance of activity recognition. However, due to some practical difficulties, the publicly available RGB-D data sets are not sufficiently large for benchmarking when considering the diversity of their activities, subjects, and background. This severely affects the applicability of complicated learning-based recognition approaches. To address the issue, this article provides a large-scale RGB-D activity data set by merging five public RGB-D data sets that differ from each other on many aspects such as length of actions, nationality of subjects, or camera angles. This data set comprises 4528 samples depicting 7 action categories (up to 46 subcategories) performed by 74 subjects. To verify the challengeness of the data set, three feature representation methods are evaluated, which are depth motion maps, spatiotemporal depth cuboid similarity feature, and curvature space scale. Results show that the merged large-scale data set is more realistic and challenging and therefore more suitable for benchmarking.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [41] Automation of "Ground Truth" Annotation for Multi-View RGB-D Object Instance Recognition Datasets
    Aldoma, Aitor
    Faeulhammer, Thomas
    Vincze, Markus
    2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 5016 - 5023
  • [42] Learning Coupled Classifiers with RGB images for RGB-D object recognition
    Li, Xiao
    Fang, Min
    Zhang, Ju-Jie
    Wu, Jinqiao
    PATTERN RECOGNITION, 2017, 61 : 433 - 446
  • [43] Coupled hidden conditional random fields for RGB-D human action recognition
    Liu, An-An
    Nie, Wei-Zhi
    Su, Yu-Ting
    Ma, Li
    Hao, Tong
    Yang, Zhao-Xuan
    SIGNAL PROCESSING, 2015, 112 : 74 - 82
  • [44] Human Action Recognition Based on RGB-D and Local Interactive Regions Detection
    Liu, Suolan
    Kong, Lizhi
    PROCEEDINGS OF THE 2018 8TH INTERNATIONAL CONFERENCE ON SOCIAL SCIENCE AND EDUCATION RESEARCH (SSER 2018), 2018, 238 : 81 - 85
  • [45] Evaluating fusion of RGB-D and inertial sensors for multimodal human action recognition
    Javed Imran
    Balasubramanian Raman
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 189 - 208
  • [46] Recognition of Human Actions from RGB-D Videos Using a Reject Option
    Carletti, Vincenzo
    Foggia, Pasquale
    Percannella, Gennaro
    Saggese, Alessia
    Vento, Mario
    NEW TRENDS IN IMAGE ANALYSIS AND PROCESSING - ICIAP 2013, 2013, 8158 : 436 - 445
  • [47] Evolutionary joint selection to improve human action recognition with RGB-D devices
    Andre Chaaraoui, Alexandros
    Ramon Padilla-Lopez, Jose
    Climent-Perez, Pau
    Florez-Revuelta, Francisco
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (03) : 786 - 794
  • [48] Research on Human Body Recognition and Position Measurement Based on AdaBoost and RGB-D
    Jian, Zhuozhu
    Zhu, Fangcheng
    Tang, Lingxuan
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 5184 - 5189
  • [49] Evaluating fusion of RGB-D and inertial sensors for multimodal human action recognition
    Imran, Javed
    Raman, Balasubramanian
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (01) : 189 - 208
  • [50] 3D Texture Recognition for RGB-D Images
    Zhong, Guoqiang
    Mao, Xin
    Shi, Yaxin
    Dong, Junyu
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2015, PT II, 2015, 9257 : 518 - 528