MetaVD: A Meta Video Dataset for enhancing human action recognition datasets

被引:6
|
作者
Yoshikawa, Yuya [1 ]
Shigeto, Yutaro [1 ]
Takeuchi, Akikazu [1 ]
机构
[1] Chiba Inst Technol, Software Technol & Artificial Intelligence Res La, Chiba, Japan
关键词
Human action recognition; Video datasets;
D O I
10.1016/j.cviu.2021.103276
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Numerous practical datasets have been developed to recognize human actions from videos. However, many of them were constructed by collecting videos within a limited domain; thus, a model trained using one of the existing datasets often fails to classify videos in a different domain accurately. A possible solution for this drawback is to enhance the domain of each action label, i.e., to import videos associated with a given action label from the other datasets, and then, to train a model using the enhanced dataset. To realize this solution, we constructed a meta video dataset from the existing datasets for human action recognition, referred to as MetaVD. MetaVD comprises six popular human action recognition datasets, which we integrated by annotating 568,015 relation labels in total. These relation labels reflect equality, similarity, and hierarchy between action labels of the original datasets. We further present simple yet effective dataset enhancement methods using MetaVD, which are useful for training models with higher generalization performance, as established by experiments on human action classification. As a further contribution of MetaVD, we show that its analysis can provide useful insight into the datasets.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A survey of video datasets for human action and activity recognition
    Chaquet, Jose M.
    Carmona, Enrique J.
    Fernandez-Caballero, Antonio
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (06) : 633 - 659
  • [2] Human action recognition approaches with video datasets-A survey
    Ozyer, Tansel
    Ak, Duygu Selin
    Alhajj, Reda
    KNOWLEDGE-BASED SYSTEMS, 2021, 222
  • [3] Exploring Action Recognition in Endoscopy Video Datasets
    Tian, Yuchen
    Paheding, Sidike
    Azimi, Ehsan
    Lee, Eung-Joo
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2024, 2024, 13034
  • [4] Human action recognition on depth dataset
    Gao, Zan
    Zhang, Hua
    Liu, Anan A.
    Xu, Guangping
    Xue, Yanbing
    NEURAL COMPUTING & APPLICATIONS, 2016, 27 (07): : 2047 - 2054
  • [5] Human action recognition on depth dataset
    Zan Gao
    Hua Zhang
    Anan A. Liu
    Guangping Xu
    Yanbing Xue
    Neural Computing and Applications, 2016, 27 : 2047 - 2054
  • [6] Video benchmarks of human action datasets: a review
    Tej Singh
    Dinesh Kumar Vishwakarma
    Artificial Intelligence Review, 2019, 52 : 1107 - 1154
  • [7] Video benchmarks of human action datasets: a review
    Singh, Tej
    Vishwakarma, Dinesh Kumar
    ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (02) : 1107 - 1154
  • [8] Human Action Recognition in Video
    Singh, Dushyant Kumar
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 : 54 - 66
  • [9] A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications
    Wu, Fei
    Wang, Qingzhong
    Bian, Jiang
    Ding, Ning
    Lu, Feixiang
    Cheng, Jun
    Dou, Dejing
    Xiong, Haoyi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7943 - 7966
  • [10] A survey on video-based Human Action Recognition: recent updates, datasets, challenges, and applications
    Preksha Pareek
    Ankit Thakkar
    Artificial Intelligence Review, 2021, 54 : 2259 - 2322