Multi-view region-adaptive multi-temporal DMM and RGB action recognition

被引：9

作者：

Al-Faris, Mahmoud ^{[1
]}

Chiverton, John P. ^{[1
]}

Yang, Yanyan ^{[2
]}

Ndzi, David L. ^{[3
]}

机构：

[1] Univ Portsmouth, Sch Energy & Elect Engn, Portsmouth PO1 3DJ, Hants, England

[2] Univ Portsmouth, Sch Comp, Portsmouth PO1 3HE, Hants, England

[3] Univ West Scotland, Sch Comp Engn & Phys Sci, Paisley PA1 2BE, Renfrew, Scotland

来源：

PATTERN ANALYSIS AND APPLICATIONS | 2020年 / 23卷 / 04期

关键词：

Action recognition; DMM; 3D CNN; Region adaptive; ENSEMBLE;

D O I：

10.1007/s10044-020-00886-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human action recognition remains an important yet challenging task. This work proposes a novel action recognition system. It uses a novel multi-view region-adaptive multi-resolution-in-time depth motion map (MV-RAMDMM) formulation combined with appearance information. Multi-stream 3D convolutional neural networks (CNNs) are trained on the different views and time resolutions of the region-adaptive depth motion maps. Multiple views are synthesised to enhance the view invariance. The region-adaptive weights, based on localised motion, accentuate and differentiate parts of actions possessing faster motion. Dedicated 3D CNN streams for multi-time resolution appearance information are also included. These help to identify and differentiate between small object interactions. A pre-trained 3D-CNN is used here with fine-tuning for each stream along with multi-class support vector machines. Average score fusion is used on the output. The developed approach is capable of recognising both human action and human-object interaction. Three public-domain data-sets, namely MSR 3D Action, Northwestern UCLA multi-view actions and MSR 3D daily activity, are used to evaluate the proposed solution. The experimental results demonstrate the robustness of this approach compared with state-of-the-art algorithms.

引用

页码：1587 / 1602

页数：16

共 50 条

[1] Multi-view region-adaptive multi-temporal DMM and RGB action recognition
Mahmoud Al-Faris
John P. Chiverton
Yanyan Yang
David Ndzi
Pattern Analysis and Applications, 2020, 23 : 1587 - 1602
[2] Action Recognition with a Multi-View Temporal Attention Network
Dengdi Sun
Zhixiang Su
Zhuanlian Ding
Bin Luo
Cognitive Computation, 2022, 14 : 1082 - 1095
[3] Action Recognition with a Multi-View Temporal Attention Network
Sun, Dengdi
Su, Zhixiang
Ding, Zhuanlian
Luo, Bin
COGNITIVE COMPUTATION, 2022, 14 (03) : 1082 - 1095
[4] Multi-view representation learning for multi-view action recognition
Hao, Tong
Wu, Dan
Wang, Qian
Sun, Jin-Sheng
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 453 - 460
[5] Dilated Multi-Temporal Modeling for Action Recognition
Zhang, Tao
Wu, Yifan
Li, Xiaoqiang
APPLIED SCIENCES-BASEL, 2023, 13 (12):
[6] Multi-View Super Vector for Action Recognition
Cai, Zhuowei
Wang, Limin
Peng, Xiaojiang
Qiao, Yu
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 596 - 603
[7] Multi-view human action recognition: A survey
Iosifidis, Alexandros
Tefas, Anastasios
Pitas, Ioannis
2013 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2013), 2013, : 522 - 525
[8] Continuous Multi-View Human Action Recognition
Wang, Qiang
Sun, Gan
Dong, Jiahua
Wang, Qianqian
Ding, Zhengming
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3603 - 3614
[9] Generative Multi-View Human Action Recognition
Wang, Lichen
Ding, Zhengming
Tao, Zhiqiang
Liu, Yunyu
Fu, Yun
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6221 - 6230
[10] Action Recognition Using Multi-Temporal DMMs Based on Adaptive Vague Division
Jiang, Min
Jin, Ke
Kong, Jun
PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON IMAGE AND GRAPHICS PROCESSING (ICIGP 2018), 2018, : 8 - 13

← 1 2 3 4 5 →