MMA: a multi-view and multi-modality benchmark dataset for human action recognition

被引:0
|
作者
Zan Gao
Tao-tao Han
Hua Zhang
Yan-bing Xue
Guang-ping Xu
机构
[1] Tianjin University of Technology Ministry of Education,Key Laboratory of Computer Vision and System
[2] Tianjin University of Technology,Tianjin Key Laboratory of Intelligence Computing and Novel Software Technology
来源
关键词
Action recognition; Benchmark dataset; Multi-view; Multi-modalidy; Cross-view; Multi-task; Cross-domain;
D O I
暂无
中图分类号
学科分类号
摘要
Human action recognition is an active research topic in both computer vision and machine learning communities, which has broad applications including surveillance, biometrics and human computer interaction. In the past decades, although some famous action datasets have been released, there still exist limitations, including the limited action categories and samples, camera views and variety of scenarios. Moreover, most of them are designed for a subset of the learning problems, such as single-view learning problem, cross-view learning problem and multi-task learning problem. In this paper, we introduce a multi-view, multi-modality benchmark dataset for human action recognition (abbreviated to MMA). MMA consists of 7080 action samples from 25 action categories, including 15 single-subject actions and 10 double-subject interactive actions in three views of two different scenarios. Further, we systematically benchmark the state-of-the-art approaches on MMA with respective to all three learning problems by different temporal-spatial feature representations. Experimental results demonstrate that MMA is challenging on all three learning problems due to significant intra-class variations, occlusion issues, views and scene variations, and multiple similar action categories. Meanwhile, we provide the baseline for the evaluation of existing state-of-the-art algorithms.
引用
收藏
页码:29383 / 29404
页数:21
相关论文
共 50 条
  • [21] MMED: A multi-domain and Multi-modality event dataset
    Yang Zhenguo
    Lin Zehang
    Guo Lingni
    Li Qing
    Liu Wenyin
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [22] Multi-view Regularized Extreme Learning Machine for Human Action Recognition
    Iosifidis, Alexandros
    Tefas, Anastasios
    Pitas, Ioannis
    ARTIFICIAL INTELLIGENCE: METHODS AND APPLICATIONS, 2014, 8445 : 84 - 94
  • [23] Feature Extraction and Representation for Distributed Multi-View Human Action Recognition
    Luo, Jiajia
    Wang, Wei
    Qi, Hairong
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2013, 3 (02) : 145 - 154
  • [24] Human action recognition using multi-view image sequences features
    Ahmad, Mohiuddin
    Lee, Seong-Whan
    PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION - PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE, 2006, : 523 - +
  • [25] Silhouette-Based Multi-View Human Action Recognition in Video
    Aryanfar, Alihossein
    Yaakob, Razali
    Halin, Alfian Abdul
    Sulaiman, Md Nasir
    Kasmiran, Khairul Azhar
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND TECHNOLOGY (ICCST), 2014,
  • [26] Human action recognition using hull convexity defect features with multi-modality setups
    Youssef, M. M.
    Asari, V. K.
    PATTERN RECOGNITION LETTERS, 2013, 34 (15) : 1971 - 1979
  • [27] View knowledge transfer network for multi-view action recognition
    Liang, Zixi
    Yin, Ming
    Gao, Junli
    He, Yicheng
    Huang, Weitian
    IMAGE AND VISION COMPUTING, 2022, 118
  • [28] Joint Transferable Dictionary Learning and View Adaptation for Multi-view Human Action Recognition
    Sun, Bin
    Kong, Dehui
    Wang, Shaofan
    Wang, Lichun
    Yin, Baocai
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (02)
  • [29] Multi-View and Multi-Modal Action Recognition with Learned Fusion
    Ardianto, Sandy
    Hang, Hsueh-Ming
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1601 - 1604
  • [30] Regularized Multi-View Multi-Metric Learning for Action Recognition
    Wu, Xuqing
    Shah, Shishir K.
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 471 - 476