Multi-view region-adaptive multi-temporal DMM and RGB action recognition

被引:9
|
作者
Al-Faris, Mahmoud [1 ]
Chiverton, John P. [1 ]
Yang, Yanyan [2 ]
Ndzi, David L. [3 ]
机构
[1] Univ Portsmouth, Sch Energy & Elect Engn, Portsmouth PO1 3DJ, Hants, England
[2] Univ Portsmouth, Sch Comp, Portsmouth PO1 3HE, Hants, England
[3] Univ West Scotland, Sch Comp Engn & Phys Sci, Paisley PA1 2BE, Renfrew, Scotland
关键词
Action recognition; DMM; 3D CNN; Region adaptive; ENSEMBLE;
D O I
10.1007/s10044-020-00886-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human action recognition remains an important yet challenging task. This work proposes a novel action recognition system. It uses a novel multi-view region-adaptive multi-resolution-in-time depth motion map (MV-RAMDMM) formulation combined with appearance information. Multi-stream 3D convolutional neural networks (CNNs) are trained on the different views and time resolutions of the region-adaptive depth motion maps. Multiple views are synthesised to enhance the view invariance. The region-adaptive weights, based on localised motion, accentuate and differentiate parts of actions possessing faster motion. Dedicated 3D CNN streams for multi-time resolution appearance information are also included. These help to identify and differentiate between small object interactions. A pre-trained 3D-CNN is used here with fine-tuning for each stream along with multi-class support vector machines. Average score fusion is used on the output. The developed approach is capable of recognising both human action and human-object interaction. Three public-domain data-sets, namely MSR 3D Action, Northwestern UCLA multi-view actions and MSR 3D daily activity, are used to evaluate the proposed solution. The experimental results demonstrate the robustness of this approach compared with state-of-the-art algorithms.
引用
收藏
页码:1587 / 1602
页数:16
相关论文
共 50 条
  • [41] MMA: a multi-view and multi-modality benchmark dataset for human action recognition
    Zan Gao
    Tao-tao Han
    Hua Zhang
    Yan-bing Xue
    Guang-ping Xu
    Multimedia Tools and Applications, 2018, 77 : 29383 - 29404
  • [42] Multi-View Inpainting for RGB-D Sequence
    Li, Feiran
    Ricardez, Gustavo Alfonso Garcia
    Takamatsu, Jun
    Ogasawara, Tsukasa
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 464 - 473
  • [43] Fine-grained action recognition using multi-view attentions
    Zhu, Yisheng
    Liu, Guangcan
    VISUAL COMPUTER, 2020, 36 (09): : 1771 - 1781
  • [44] Multi-view Regularized Extreme Learning Machine for Human Action Recognition
    Iosifidis, Alexandros
    Tefas, Anastasios
    Pitas, Ioannis
    ARTIFICIAL INTELLIGENCE: METHODS AND APPLICATIONS, 2014, 8445 : 84 - 94
  • [45] A Multi-View Face Recognition System
    张永越
    彭振云
    游素亚
    徐光佑
    JournalofComputerScienceandTechnology, 1997, (05) : 400 - 407
  • [46] MULTI-VIEW DESCRIPTOR MINING VIA CODEWORD NET FOR ACTION RECOGNITION
    Liu, Jingyu
    Huang, Yongzhen
    Peng, Xiaojiang
    Wang, Liang
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 793 - 797
  • [47] Feature Extraction and Representation for Distributed Multi-View Human Action Recognition
    Luo, Jiajia
    Wang, Wei
    Qi, Hairong
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2013, 3 (02) : 145 - 154
  • [48] Cross-modality online distillation for multi-view action recognition
    Xu, Chao
    Wu, Xia
    Li, Yachun
    Jin, Yining
    Wang, Mengmeng
    Liu, Yong
    NEUROCOMPUTING, 2021, 456 : 384 - 393
  • [49] Simultaneous Action Recognition and Localization Based on Multi-View Hough Voting
    Hara, Kensho
    Hirayama, Takatsugu
    Mase, Kenji
    2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 616 - 620
  • [50] Fine-grained action recognition using multi-view attentions
    Yisheng Zhu
    Guangcan Liu
    The Visual Computer, 2020, 36 : 1771 - 1781