Multimodal Multipart Learning for Action Recognition in Depth Videos

被引:76
|
作者
Shahroudy, Amir [1 ,2 ]
Ng, Tian-Tsong [2 ]
Yang, Qingxiong [3 ]
Wang, Gang [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Inst Infocomm Res, 1 Fusionopolis Way, Singapore 138632, Singapore
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
基金
新加坡国家研究基金会;
关键词
Action recognition; kinect; joint sparse regression; mixed norms; structured sparsity; group feature selection; MULTITASK; FEATURES; SELECTION; TRACKING; SPARSITY; MODEL;
D O I
10.1109/TPAMI.2015.2505295
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The articulated and complex nature of human actions makes the task of action recognition difficult. One approach to handle this complexity is dividing it to the kinetics of body parts and analyzing the actions based on these partial descriptors. We propose a joint sparse regression based learning method which utilizes the structured sparsity to model each action as a combination of multimodal features from a sparse set of body parts. To represent dynamics and appearance of parts, we employ a heterogeneous set of depth and skeleton based features. The proper structure of multimodal multipart features are formulated into the learning framework via the proposed hierarchical mixed norm, to regularize the structured features of each part and to apply sparsity between them, in favor of a group feature selection. Our experimental results expose the effectiveness of the proposed learning method in which it outperforms other methods in all three tested datasets while saturating one of them by achieving perfect accuracy.
引用
收藏
页码:2123 / 2129
页数:7
相关论文
共 50 条
  • [31] Action Recognition From Thermal Videos
    Batchuluun, Ganbayar
    Nguyen, Dat Tien
    Tuyen Danh Pham
    Park, Chanhum
    Park, Kang Ryoung
    IEEE ACCESS, 2019, 7 : 103893 - 103917
  • [32] Human Action Recognition from Depth Videos Using Multi-Projection based Representation
    Le, Chien-Quang
    Thanh Duc Ngo
    Duy-Dinh Le
    Satoh, Shin'ichi
    Duc Anh Duong
    2015 IEEE 17TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2015,
  • [33] Deep ChaosNet for Action Recognition in Videos
    Chen, Huafeng
    Zhang, Maosheng
    Gao, Zhengming
    Zhao, Yunhong
    COMPLEXITY, 2021, 2021
  • [34] ACTION RECOGNITION IN UNCONSTRAINED AMATEUR VIDEOS
    Liu, Jingen
    Luo, Jiebo
    Shah, Mubarak
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3549 - +
  • [35] Group Action Recognition in Soccer Videos
    Kong, Yu
    Zhan, Xiaoqin
    Wei, Qingdi
    Hu, Weiming
    Jia, Yunde
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 249 - +
  • [36] Accelerated action recognition and segmentation in videos
    Ghodhbani, Emna
    Mefteh, Ahmed
    Benazza-Benyahia, Amel
    2020 10TH INTERNATIONAL SYMPOSIUM ON SIGNAL, IMAGE, VIDEO AND COMMUNICATIONS (ISIVC), 2021,
  • [37] Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning
    Ali, Saad
    Shah, Mubarak
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (02) : 288 - 303
  • [38] Deep metric learning for open-set human action recognition in videos
    Gutoski, Matheus
    Lazzaretti, Andre Eugenio
    Lopes, Heitor Silverio
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (04): : 1207 - 1220
  • [39] Deep metric learning for open-set human action recognition in videos
    Matheus Gutoski
    André Eugênio Lazzaretti
    Heitor Silvério Lopes
    Neural Computing and Applications, 2021, 33 : 1207 - 1220
  • [40] Enhancing Anomaly Detection in Surveillance Videos with Transfer Learning from Action Recognition
    Liu, Kun
    Zhu, Minzhi
    Fu, Huiyuan
    Ma, Huadong
    Chua, Tat-Seng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4664 - 4668