Feature Fusion of Deep Spatial Features and Handcrafted Spatiotemporal Features for Human Action Recognition

被引:16
|
作者
Uddin, Md Azher [1 ]
Lee, Young-Koo [1 ]
机构
[1] Kyung Hee Univ, Dept Comp Sci & Engn, Global Campus, Yongin 17104, South Korea
关键词
deep spatial features; spatiotemporal features; Inception-Resnet-v2; Weber's law based volume local gradient ternary pattern; RECOGNIZING HUMAN ACTIONS;
D O I
10.3390/s19071599
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Human action recognition plays a significant part in the research community due to its emerging applications. A variety of approaches have been proposed to resolve this problem, however, several issues still need to be addressed. In action recognition, effectively extracting and aggregating the spatial-temporal information plays a vital role to describe a video. In this research, we propose a novel approach to recognize human actions by considering both deep spatial features and handcrafted spatiotemporal features. Firstly, we extract the deep spatial features by employing a state-of-the-art deep convolutional network, namely Inception-Resnet-v2. Secondly, we introduce a novel handcrafted feature descriptor, namely Weber's law based Volume Local Gradient Ternary Pattern (WVLGTP), which brings out the spatiotemporal features. It also considers the shape information by using gradient operation. Furthermore, Weber's law based threshold value and the ternary pattern based on an adaptive local threshold is presented to effectively handle the noisy center pixel value. Besides, a multi-resolution approach for WVLGTP based on an averaging scheme is also presented. Afterward, both these extracted features are concatenated and feed to the Support Vector Machine to perform the classification. Lastly, the extensive experimental analysis shows that our proposed method outperforms state-of-the-art approaches in terms of accuracy.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Atomic Action Features: A New Feature for Action Recognition
    Zhou, Qiang
    Wang, Gang
    [J]. COMPUTER VISION - ECCV 2012: WORKSHOPS AND DEMONSTRATIONS, PT I, 2012, 7583 : 291 - 300
  • [42] Touch Gesture Recognition Using Spatiotemporal Fusion Features
    Li, Yun-Kai
    Meng, Qing-Hao
    Zhang, Hong-Wei
    [J]. IEEE SENSORS JOURNAL, 2022, 22 (01) : 428 - 437
  • [43] Human Action Recognition by Decision-Making Level Fusion Based on Spatial-Temporal Features
    Li Yandi
    Xu Xiping
    [J]. ACTA OPTICA SINICA, 2018, 38 (08)
  • [44] FEC: A Feature Fusion Framework for SAR Target Recognition Based on Electromagnetic Scattering Features and Deep CNN Features
    Zhang, Jinsong
    Xing, Mengdao
    Xie, Yiyuan
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (03): : 2174 - 2187
  • [45] Human Action Recognition: A Paradigm of Best Deep Learning Features Selection and Serial Based Extended Fusion
    Khan, Seemab
    Khan, Muhammad Attique
    Alhaisoni, Majed
    Tariq, Usman
    Yong, Hwan-Seung
    Armghan, Ammar
    Alenezi, Fayadh
    [J]. SENSORS, 2021, 21 (23)
  • [46] An efficient human action recognition framework with pose-based spatiotemporal features
    Agahian, Saeid
    Negin, Farhood
    Kose, Cemal
    [J]. ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2020, 23 (01): : 196 - 203
  • [47] Human action recognition in videos based on spatiotemporal features and bag-of-poses
    da Silva, Murilo Varges
    Marana, Aparecido Nilceu
    [J]. APPLIED SOFT COMPUTING, 2020, 95 (95)
  • [48] Human action recognition using fusion of features for unconstrained video sequences
    Patel, Chirag I.
    Garg, Sanjay
    Zaveri, Tanish
    Banerjee, Asim
    Patel, Ripal
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2018, 70 : 284 - 301
  • [49] DMMs-Based Multiple Features Fusion for Human Action Recognition
    Bulbul, Mohammad Farhad
    Jiang, Yunsheng
    Ma, Jinwen
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2015, 6 (04): : 23 - 39
  • [50] A deep multimodal network based on bottleneck layer features fusion for action recognition
    Tej Singh
    Dinesh Kumar Vishwakarma
    [J]. Multimedia Tools and Applications, 2021, 80 : 33505 - 33525