MetaVD: A Meta Video Dataset for enhancing human action recognition datasets

被引:6
|
作者
Yoshikawa, Yuya [1 ]
Shigeto, Yutaro [1 ]
Takeuchi, Akikazu [1 ]
机构
[1] Chiba Inst Technol, Software Technol & Artificial Intelligence Res La, Chiba, Japan
关键词
Human action recognition; Video datasets;
D O I
10.1016/j.cviu.2021.103276
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Numerous practical datasets have been developed to recognize human actions from videos. However, many of them were constructed by collecting videos within a limited domain; thus, a model trained using one of the existing datasets often fails to classify videos in a different domain accurately. A possible solution for this drawback is to enhance the domain of each action label, i.e., to import videos associated with a given action label from the other datasets, and then, to train a model using the enhanced dataset. To realize this solution, we constructed a meta video dataset from the existing datasets for human action recognition, referred to as MetaVD. MetaVD comprises six popular human action recognition datasets, which we integrated by annotating 568,015 relation labels in total. These relation labels reflect equality, similarity, and hierarchy between action labels of the original datasets. We further present simple yet effective dataset enhancement methods using MetaVD, which are useful for training models with higher generalization performance, as established by experiments on human action classification. As a further contribution of MetaVD, we show that its analysis can provide useful insight into the datasets.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] 3-D Dataset for Human Activity Recognition in Video Surveillance
    Sardsehmukh, M. M.
    Kolte, M. T.
    Chatur, P. N.
    Chaudhari, D. S.
    2014 IEEE GLOBAL CONFERENCE ON WIRELESS COMPUTING AND NETWORKING (GCWCN), 2014, : 75 - 78
  • [32] Enhancing Human Action Recognition through Temporal Saliency
    Adeli, Vida
    Fazl-Ersi, Ehsan
    Harati, Ahad
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (ICPRAI 2018), 2018, : 176 - 181
  • [33] Human Action Recognition Technology in Dance Video Image
    Qiao, Lei
    Shen, QiuHao
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [34] Motion Feature Combination for Human Action Recognition in Video
    Meng, Hongying
    Pears, Nick
    Bailey, Chris
    COMPUTER VISION AND COMPUTER GRAPHICS, 2008, 21 : 151 - +
  • [35] A DISTRIBUTION BASED VIDEO REPRESENTATION FOR HUMAN ACTION RECOGNITION
    Song, Yan
    Tang, Sheng
    Zheng, Yan-Tao
    Chua, Tat-Seng
    Zhang, Yongdong
    Lin, Shouxun
    2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 772 - 777
  • [36] Human Action Recognition on Simple and Complex Background in Video
    Tuan Le-Viet
    Ngoc Ly-Quoc
    2012 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2012, : 114 - 119
  • [37] Analysis of CNN Architectures for Human Action Recognition in Video
    Silva, David
    Manzo-Martinez, Alain
    Gaxiola, Fernando
    Gonzalez-Gurrola, Luis
    Ramirez-Alonso, Graciela
    COMPUTACION Y SISTEMAS, 2022, 26 (02): : 623 - 641
  • [38] Temporal segment dropout for human action video recognition
    Zhang, Yu
    Chen, Zhengjie
    Xu, Tianyu
    Zhao, Junjie
    Mi, Siya
    Geng, Xin
    Zhang, Min-Ling
    PATTERN RECOGNITION, 2024, 146
  • [39] On the Effects of Low Video Quality in Human Action Recognition
    See, John
    Rahman, Saimunur
    2015 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2015, : 574 - 581
  • [40] Human Body Articulation for Action Recognition in Video Sequences
    Thi, Tuan Hue
    Lu, Sijun
    Zhang, Jian
    Cheng, Li
    Wang, Li
    AVSS: 2009 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, 2009, : 92 - +