Multi-view region-adaptive multi-temporal DMM and RGB action recognition

被引:9
|
作者
Al-Faris, Mahmoud [1 ]
Chiverton, John P. [1 ]
Yang, Yanyan [2 ]
Ndzi, David L. [3 ]
机构
[1] Univ Portsmouth, Sch Energy & Elect Engn, Portsmouth PO1 3DJ, Hants, England
[2] Univ Portsmouth, Sch Comp, Portsmouth PO1 3HE, Hants, England
[3] Univ West Scotland, Sch Comp Engn & Phys Sci, Paisley PA1 2BE, Renfrew, Scotland
关键词
Action recognition; DMM; 3D CNN; Region adaptive; ENSEMBLE;
D O I
10.1007/s10044-020-00886-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human action recognition remains an important yet challenging task. This work proposes a novel action recognition system. It uses a novel multi-view region-adaptive multi-resolution-in-time depth motion map (MV-RAMDMM) formulation combined with appearance information. Multi-stream 3D convolutional neural networks (CNNs) are trained on the different views and time resolutions of the region-adaptive depth motion maps. Multiple views are synthesised to enhance the view invariance. The region-adaptive weights, based on localised motion, accentuate and differentiate parts of actions possessing faster motion. Dedicated 3D CNN streams for multi-time resolution appearance information are also included. These help to identify and differentiate between small object interactions. A pre-trained 3D-CNN is used here with fine-tuning for each stream along with multi-class support vector machines. Average score fusion is used on the output. The developed approach is capable of recognising both human action and human-object interaction. Three public-domain data-sets, namely MSR 3D Action, Northwestern UCLA multi-view actions and MSR 3D daily activity, are used to evaluate the proposed solution. The experimental results demonstrate the robustness of this approach compared with state-of-the-art algorithms.
引用
收藏
页码:1587 / 1602
页数:16
相关论文
共 50 条
  • [21] Multi-View Action Recognition using Contrastive Learning
    Shah, Ketul
    Shah, Anshul
    Lau, Chun Pong
    de Melo, Celso M.
    Chellappa, Rama
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3370 - 3380
  • [22] Multi-View Action Recognition One Camera At a Time
    Spurlock, Scott
    Souvenir, Richard
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 604 - 609
  • [23] Compositional action recognition with multi-view feature fusion
    Zhao, Zhicheng
    Liu, Yingan
    Ma, Lei
    PLOS ONE, 2022, 17 (04):
  • [24] Adaptive multi-view graph convolutional networks for skeleton-based action recognition
    Liu, Xing
    Li, Yanshan
    Xia, Rongjie
    NEUROCOMPUTING, 2021, 444 : 288 - 300
  • [25] View-invariant human action recognition via robust locally adaptive multi-view learning
    Feng, Jia-geng
    Xiao, Jun
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2015, 16 (11) : 917 - 929
  • [26] View-invariant human action recognition via robust locally adaptive multi-view learning
    Jia-geng Feng
    Jun Xiao
    Frontiers of Information Technology & Electronic Engineering, 2015, 16 : 917 - 929
  • [27] Change Detection in Urban Areas by Direct Comparison of Multi-view and Multi-temporal ALS Data
    Hebel, Marcus
    Arens, Michael
    Stilla, Uwe
    PHOTOGRAMMETRIC IMAGE ANALYSIS, 2011, 6952 : 185 - +
  • [28] MULTI-TASK LINEAR DISCRIMINANT ANALYSIS FOR MULTI-VIEW ACTION RECOGNITION
    Yan, Yan
    Liu, Gaowen
    Ricci, Elisa
    Sebe, Nicu
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2837 - 2841
  • [29] Multi-view graph convolution network for the recognition of human action with spatial and temporal occlusion problems*
    Chen, Yang
    Wang, Ling
    Hu, Dekun
    Cheng, Hong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 97
  • [30] Temporal Self-Similarity for Appearance-Based Action Recognition in Multi-View Setups
    Koerner, Marco
    Denzler, Joachim
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PT I, 2013, 8047 : 163 - 171