Evaluating a bag-of-visual features approach using spatio-temporal features for action recognition

被引:32
|
作者
Nazir, Saima [1 ]
Yousaf, Muhammad Haroon [1 ]
Velastin, Sergio A. [2 ,3 ]
机构
[1] Univ Engn & Technol Taxila, Taxila, Pakistan
[2] Univ Carlos III Madrid, Getafe, Spain
[3] Queen Mary Univ London, London, England
关键词
Human action recognition; Local spatio-temporal features; Bag-of-visual features; Hollywood-2; dataset;
D O I
10.1016/j.compeleceng.2018.01.037
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The detection of the spatial-temporal interest points has a key role in human action recognition algorithms. This research work aims to exploit the existing strength of bag-of-visual features and presents a method for automatic action recognition in realistic and complex scenarios. This paper provides a better feature representation by combining the benefit of both a well-known feature detector and descriptor i.e. the 3D Harris space-time interest point detector and the 3D Scale-Invariant Feature Transform descriptor. Finally, action videos are represented using a histogram of visual features by following the traditional bag-of-visual feature approach. Apart from video representation, a support vector machine (SVM) classifier is used for training and testing. A large number of experiments show the effectiveness of our method on existing benchmark datasets and shows state-of-the-art performance. This article reports 68.1% mean Average Precision (mAP), 94% and 91.8% average accuracy for Hollywood-2, UCF Sports and KTH datasets respectively. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:660 / 669
页数:10
相关论文
共 50 条
  • [1] Spatio-Temporal Frames in a Bag-of-visual-features Approach for Human Actions Recognition
    Lopes, Ana Paula B.
    Oliveira, Rodrigo S.
    de Almeida, Jussara M.
    Araujo, Arnaldo de A.
    2009 XXII BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING (SIBGRAPI 2009), 2009, : 315 - 321
  • [2] Action recognition using spatio-temporal regularity based features
    Goodhart, Taylor
    Yan, Pingkun
    Shah, Mubarak
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 745 - 748
  • [3] Action Recognition Using Discriminative Spatio-Temporal Neighborhood Features
    Cheng, Shi-Lei
    Yang, Jiang-Feng
    Ma, Zheng
    Xie, Mei
    INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND INFORMATION SECURITY (CNIS 2015), 2015, : 166 - 172
  • [4] Learning Bag of Spatio-Temporal Features for Human Interaction Recognition
    Slimani, Khadidja Nour El Houda
    Benezeth, Yannick
    Souami, Feryel
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [5] Graph-based approach for human action recognition using spatio-temporal features
    Ben Aoun, Najib
    Mejdoub, Mahmoud
    Ben Amar, Chokri
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (02) : 329 - 338
  • [6] SKELETON ACTION RECOGNITION BASED ON SPATIO-TEMPORAL FEATURES
    Huang, Qian
    Xie, Mengting
    Li, Xing
    Wang, Shuaichen
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3284 - 3288
  • [7] Spatio-temporal Semantic Features for Human Action Recognition
    Liu, Jia
    Wang, Xiaonian
    Li, Tianyu
    Yang, Jie
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2012, 6 (10): : 2632 - 2649
  • [8] Human Action Recognition Based on Spatio-temporal Features
    Sawant, Nikhil
    Biswas, K. K.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 357 - 362
  • [9] Accelerated Learning of Discriminative Spatio-temporal Features for Action Recognition
    Varshney, Munender
    Rameshan, Renu
    2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [10] ACTION RECOGNITION BY ORTHOGONALIZED SUBSPACES OF LOCAL SPATIO-TEMPORAL FEATURES
    Raytchev, Bisser
    Shigenaka, Ryosuke
    Tamaki, Toru
    Kaneda, Kazufumi
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 4387 - 4391