MetaVD: A Meta Video Dataset for enhancing human action recognition datasets

被引:6
|
作者
Yoshikawa, Yuya [1 ]
Shigeto, Yutaro [1 ]
Takeuchi, Akikazu [1 ]
机构
[1] Chiba Inst Technol, Software Technol & Artificial Intelligence Res La, Chiba, Japan
关键词
Human action recognition; Video datasets;
D O I
10.1016/j.cviu.2021.103276
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Numerous practical datasets have been developed to recognize human actions from videos. However, many of them were constructed by collecting videos within a limited domain; thus, a model trained using one of the existing datasets often fails to classify videos in a different domain accurately. A possible solution for this drawback is to enhance the domain of each action label, i.e., to import videos associated with a given action label from the other datasets, and then, to train a model using the enhanced dataset. To realize this solution, we constructed a meta video dataset from the existing datasets for human action recognition, referred to as MetaVD. MetaVD comprises six popular human action recognition datasets, which we integrated by annotating 568,015 relation labels in total. These relation labels reflect equality, similarity, and hierarchy between action labels of the original datasets. We further present simple yet effective dataset enhancement methods using MetaVD, which are useful for training models with higher generalization performance, as established by experiments on human action classification. As a further contribution of MetaVD, we show that its analysis can provide useful insight into the datasets.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] HUMAN ACTION RECOGNITION WITH OPTIMIZED VIDEO DENSELY SAMPLING
    Wang, Bin
    Liu, Yu
    Xiao, Wenhua
    Xiong, Zhihui
    Wang, Wei
    Zhang, Maojun
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [42] Compact Video Analysis Human Action Recognition Approach
    Aly, Cherry Aly
    Abas, Fazly Salleh
    PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (IEEE ICSIPA 2019), 2019, : 329 - 334
  • [43] Human Action Recognition in Surveillance Video of a Computer Laboratory
    Yussiff, Abdul-Lateef
    Yong, Suet Peng
    Baharudin, Baharum
    2016 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCOINS), 2016, : 418 - 423
  • [44] Firearm-related action recognition and object detection dataset for video surveillance systems
    Ruiz-Santaquiteria, Jesus
    Munoz, Juan D.
    Maigler, Francisco J.
    Deniz, Oscar
    Bueno, Gloria
    DATA IN BRIEF, 2024, 52
  • [45] HACS: Human Action Clips and Segments Dataset for Recognition and Temporal Localization
    Zhao, Hang
    Torralba, Antonio
    Torresani, Lorenzo
    Yan, Zhicheng
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8667 - 8677
  • [46] Enhancing Action Recognition in Vehicle Environments With Human Pose Information
    Konstantinou, Michaela
    Retsinas, George
    Maragos, Petros
    PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2023, 2023, : 197 - 205
  • [47] A Novel Fuzzy HMM Approach for Human Action Recognition in Video
    Mozafari, Kourosh
    Charkari, Nasrollah Moghadam
    Boroujeni, Hamidreza Shayegh
    Behrouzifar, Mohammad
    KNOWLEDGE TECHNOLOGY, 2012, 295 : 184 - 193
  • [48] Multi-surface analysis for human action recognition in video
    Zhang, Hong-Bo
    Lei, Qing
    Zhong, Bi-Neng
    Du, Ji-Xiang
    Peng, Jialin
    Hsiao, Tsung-Chih
    Chen, Duan-Sheng
    SPRINGERPLUS, 2016, 5
  • [49] Evaluation of Human Action Recognition Techniques intended for Video Analytics
    Rashmi, S. R.
    Bhat, Shubha
    Sushmitha, V. C.
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES FOR SMART NATION (SMARTTECHCON), 2017, : 357 - 362
  • [50] Multi modal human action recognition for video content matching
    Jun Guo
    Hao Bai
    Zhanyong Tang
    Pengfei Xu
    Daguang Gan
    Baoying Liu
    Multimedia Tools and Applications, 2020, 79 : 34665 - 34683