Evaluating a bag-of-visual features approach using spatio-temporal features for action recognition

被引:32
|
作者
Nazir, Saima [1 ]
Yousaf, Muhammad Haroon [1 ]
Velastin, Sergio A. [2 ,3 ]
机构
[1] Univ Engn & Technol Taxila, Taxila, Pakistan
[2] Univ Carlos III Madrid, Getafe, Spain
[3] Queen Mary Univ London, London, England
关键词
Human action recognition; Local spatio-temporal features; Bag-of-visual features; Hollywood-2; dataset;
D O I
10.1016/j.compeleceng.2018.01.037
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The detection of the spatial-temporal interest points has a key role in human action recognition algorithms. This research work aims to exploit the existing strength of bag-of-visual features and presents a method for automatic action recognition in realistic and complex scenarios. This paper provides a better feature representation by combining the benefit of both a well-known feature detector and descriptor i.e. the 3D Harris space-time interest point detector and the 3D Scale-Invariant Feature Transform descriptor. Finally, action videos are represented using a histogram of visual features by following the traditional bag-of-visual feature approach. Apart from video representation, a support vector machine (SVM) classifier is used for training and testing. A large number of experiments show the effectiveness of our method on existing benchmark datasets and shows state-of-the-art performance. This article reports 68.1% mean Average Precision (mAP), 94% and 91.8% average accuracy for Hollywood-2, UCF Sports and KTH datasets respectively. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:660 / 669
页数:10
相关论文
共 50 条
  • [21] A fast human action recognition network based on spatio-temporal features
    Xu, Jie
    Song, Rui
    Wei, Haoliang
    Guo, Jinhong
    Zhou, Yifei
    Huang, Xiwei
    Neurocomputing, 2021, 441 : 350 - 358
  • [22] Human Action Recognition by SOM Considering the Probability of Spatio-temporal Features
    Ji, Yanli
    Shimada, Atsushi
    Taniguchi, Rin-ichiro
    NEURAL INFORMATION PROCESSING: MODELS AND APPLICATIONS, PT II, 2010, 6444 : 391 - 398
  • [23] Learning to Represent Spatio-Temporal Features for Fine Grained Action Recognition
    Sakhalkar, Kaustubh
    Bremond, Francois
    2018 IEEE THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, APPLICATIONS AND SYSTEMS (IPAS), 2018, : 268 - 272
  • [24] Learning spatio-temporal features for action recognition from the side of the video
    Lishen Pei
    Mao Ye
    Xuezhuan Zhao
    Tao Xiang
    Tao Li
    Signal, Image and Video Processing, 2016, 10 : 199 - 206
  • [25] Human Action Recognition in Video by Fusion of Structural and Spatio-temporal Features
    Borzeshi, Ehsan Zare
    Concha, Oscar Perez
    Piccardi, Massimo
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 474 - 482
  • [26] Human Interaction Recognition Using Improved Spatio-Temporal Features
    Sivarathinabala, M.
    Abirami, S.
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS (ICACNI 2015), VOL 1, 2016, 43 : 191 - 199
  • [27] Affective interaction recognition using spatio-temporal features and context
    Liang, Jinglian
    Xu, Chao
    Feng, Zhiyong
    Ma, Xirong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 144 : 155 - 165
  • [28] A Robust Approach for Action Recognition Based on Spatio-Temporal Features in RGB-D Sequences
    Ly Quoc Ngoc
    Vo Hoai Viet
    Tran Thai Son
    Pham Minh Hoang
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (05) : 166 - 177
  • [29] Bag of Spatio-temporal Synonym Sets for Human Action Recognition
    Pang, Lin
    Cao, Juan
    Guo, Junbo
    Lin, Shouxun
    Song, Yan
    ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 422 - 432
  • [30] Fast Realistic Multi-Action Recognition using Mined Dense Spatio-temporal Features
    Gilbert, Andrew
    Illingworth, John
    Bowden, Richard
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 925 - 931