Collecting public RGB-D datasets for human daily activity recognition

被引:7
|
作者
Wu, Hanbo [1 ]
Ma, Xin [1 ]
Zhang, Zhimeng [1 ]
Wang, Haibo [1 ]
Li, Yibin [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, 17923 Jingshi Rd, Jinan, Shandong, Peoples R China
来源
关键词
Human daily activity recognition; public RGB-D data sets merging; large-scale RGB-D activity data set; depth motion maps; depth cuboid similarity feature; curvature space scale; OBJECT RECOGNITION; FUSION; MODEL;
D O I
10.1177/1729881417709079
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Human daily activity recognition has been a hot spot in the field of computer vision for many decades. Despite best efforts, activity recognition in naturally uncontrolled settings remains a challenging problem. Recently, by being able to perceive depth and visual cues simultaneously, RGB-D cameras greatly boost the performance of activity recognition. However, due to some practical difficulties, the publicly available RGB-D data sets are not sufficiently large for benchmarking when considering the diversity of their activities, subjects, and background. This severely affects the applicability of complicated learning-based recognition approaches. To address the issue, this article provides a large-scale RGB-D activity data set by merging five public RGB-D data sets that differ from each other on many aspects such as length of actions, nationality of subjects, or camera angles. This data set comprises 4528 samples depicting 7 action categories (up to 46 subcategories) performed by 74 subjects. To verify the challengeness of the data set, three feature representation methods are evaluated, which are depth motion maps, spatiotemporal depth cuboid similarity feature, and curvature space scale. Results show that the merged large-scale data set is more realistic and challenging and therefore more suitable for benchmarking.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [31] Human Action Recognition with Contextual Constraints using a RGB-D Sensor
    Gu, Ye
    Sheng, Weihua
    Ou, Yongsheng
    Liu, Meiqin
    Zhang, Senlin
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2013, : 674 - 679
  • [32] Convolutional LSTM Networks and RGB-D Video for Human Motion Recognition
    Che, Weisong
    Peng, Shuhua
    PROCEEDINGS OF 2018 IEEE 4TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2018), 2018, : 951 - 955
  • [33] Combining Features For RGB-D object Recognition
    Khan, Wasif
    Phaisangittisagul, Ekachai
    Ali, Luqman
    Gansawat, Duangrat
    Kumazawa, Itsuo
    2017 INTERNATIONAL ELECTRICAL ENGINEERING CONGRESS (IEECON), 2017,
  • [34] Object Recognition in Noisy RGB-D Data
    Carlos Rangel, Jose
    Morell, Vicente
    Cazorla, Miguel
    Orts-Escolano, Sergio
    Garcia Rodriguez, Jose
    BIOINSPIRED COMPUTATION IN ARTIFICIAL SYSTEMS, PT II, 2015, 9108 : 261 - 270
  • [35] ACTION RECOGNITION IN RGB-D EGOCENTRIC VIDEOS
    Tang, Yansong
    Tian, Yi
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3410 - 3414
  • [36] On RGB-D Face Recognition using Kinect
    Goswami, Gaurav
    Bharadwaj, Samarth
    Vatsa, Mayank
    Singh, Richa
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON BIOMETRICS: THEORY, APPLICATIONS AND SYSTEMS (BTAS), 2013,
  • [37] Structured Images for RGB-D Action Recognition
    Wang, Pichao
    Wang, Shuang
    Gao, Zhimin
    Hou, Yonghong
    Li, Wanqing
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 1005 - 1014
  • [38] RGB-D based Face Reconstruction and Recognition
    Hsu, Gee-Sern
    Liu, Yu-Lun
    Peng, Hsiao-Chia
    Chung, Sheng-Luen
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 339 - 344
  • [39] GrapesNet: Indian RGB & RGB-D vineyard image datasets for deep learning applications
    Barbole, Dhanashree K.
    Jadhav, Parul M.
    DATA IN BRIEF, 2023, 48
  • [40] Activity Recognition based on RGB-D and Thermal Sensors for Socially Assistive Robots
    Sorostinean, Mihaela
    Tapus, Adriana
    2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 1298 - 1304