Collecting public RGB-D datasets for human daily activity recognition

被引:7
|
作者
Wu, Hanbo [1 ]
Ma, Xin [1 ]
Zhang, Zhimeng [1 ]
Wang, Haibo [1 ]
Li, Yibin [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, 17923 Jingshi Rd, Jinan, Shandong, Peoples R China
来源
关键词
Human daily activity recognition; public RGB-D data sets merging; large-scale RGB-D activity data set; depth motion maps; depth cuboid similarity feature; curvature space scale; OBJECT RECOGNITION; FUSION; MODEL;
D O I
10.1177/1729881417709079
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Human daily activity recognition has been a hot spot in the field of computer vision for many decades. Despite best efforts, activity recognition in naturally uncontrolled settings remains a challenging problem. Recently, by being able to perceive depth and visual cues simultaneously, RGB-D cameras greatly boost the performance of activity recognition. However, due to some practical difficulties, the publicly available RGB-D data sets are not sufficiently large for benchmarking when considering the diversity of their activities, subjects, and background. This severely affects the applicability of complicated learning-based recognition approaches. To address the issue, this article provides a large-scale RGB-D activity data set by merging five public RGB-D data sets that differ from each other on many aspects such as length of actions, nationality of subjects, or camera angles. This data set comprises 4528 samples depicting 7 action categories (up to 46 subcategories) performed by 74 subjects. To verify the challengeness of the data set, three feature representation methods are evaluated, which are depth motion maps, spatiotemporal depth cuboid similarity feature, and curvature space scale. Results show that the merged large-scale data set is more realistic and challenging and therefore more suitable for benchmarking.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [1] Human Activity Recognition using RGB-D Sensors
    Bagate, Asmita
    Shah, Medha
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 902 - 905
  • [2] Human activity recognition in RGB-D videos by dynamic images
    Mukherjee, Snehasis
    Anvitha, Leburu
    Lahari, T. Mohana
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (27-28) : 19787 - 19801
  • [3] Recognition and Classification of Human Activity from RGB-D Videos
    Gurkaynak, Deniz
    Yalcin, Hulya
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1745 - 1748
  • [4] Human activity recognition in RGB-D videos by dynamic images
    Snehasis Mukherjee
    Leburu Anvitha
    T. Mohana Lahari
    Multimedia Tools and Applications, 2020, 79 : 19787 - 19801
  • [5] A survey on RGB-D datasets
    Lopes, Alexandre
    Souza, Roberto
    Pedrini, Helio
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 222
  • [6] Fusion of Skeleton and RGB Features for RGB-D Human Action Recognition
    Weiyao, Xu
    Muqing, Wu
    Min, Zhao
    Ting, Xia
    IEEE SENSORS JOURNAL, 2021, 21 (17) : 19157 - 19164
  • [7] Combining CNN streams of RGB-D and skeletal data for human activity recognition
    Khaire, Pushpajit
    Kumar, Praveen
    Imran, Javed
    PATTERN RECOGNITION LETTERS, 2018, 115 : 107 - 116
  • [8] A New Hybrid Architecture for Human Activity Recognition from RGB-D Videos
    Das, Srijan
    Thonnat, Monique
    Sakhalkar, Kaustubh
    Koperski, Michal
    Bremond, Francois
    Francesca, Gianpiero
    MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 493 - 505
  • [9] Human Activity Recognition from Automatically Labeled Data in RGB-D Videos
    Jardim, David
    Nunes, Luis
    Dias, Miguel
    2016 8TH COMPUTER SCIENCE AND ELECTRONIC ENGINEERING CONFERENCE (CEEC), 2016, : 89 - 94
  • [10] Viewpoint Invariant RGB-D Human Action Recognition
    Liu, Jian
    Akhtar, Naveed
    Mian, Ajmal
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 261 - 268