Evaluating a bag-of-visual features approach using spatio-temporal features for action recognition

被引:32
|
作者
Nazir, Saima [1 ]
Yousaf, Muhammad Haroon [1 ]
Velastin, Sergio A. [2 ,3 ]
机构
[1] Univ Engn & Technol Taxila, Taxila, Pakistan
[2] Univ Carlos III Madrid, Getafe, Spain
[3] Queen Mary Univ London, London, England
关键词
Human action recognition; Local spatio-temporal features; Bag-of-visual features; Hollywood-2; dataset;
D O I
10.1016/j.compeleceng.2018.01.037
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The detection of the spatial-temporal interest points has a key role in human action recognition algorithms. This research work aims to exploit the existing strength of bag-of-visual features and presents a method for automatic action recognition in realistic and complex scenarios. This paper provides a better feature representation by combining the benefit of both a well-known feature detector and descriptor i.e. the 3D Harris space-time interest point detector and the 3D Scale-Invariant Feature Transform descriptor. Finally, action videos are represented using a histogram of visual features by following the traditional bag-of-visual feature approach. Apart from video representation, a support vector machine (SVM) classifier is used for training and testing. A large number of experiments show the effectiveness of our method on existing benchmark datasets and shows state-of-the-art performance. This article reports 68.1% mean Average Precision (mAP), 94% and 91.8% average accuracy for Hollywood-2, UCF Sports and KTH datasets respectively. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:660 / 669
页数:10
相关论文
共 50 条
  • [41] Action Recognition in Dark Videos Using Spatio-Temporal Features and Bidirectional Encoder Representations from Transformers
    Singh H.
    Suman S.
    Subudhi B.N.
    Jakhetiya V.
    Ghosh A.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (06): : 1461 - 1471
  • [42] Scale Invariant Action Recognition Using Compound Features Mined from Dense Spatio-temporal Corners
    Gilbert, Andrew
    Illingworth, John
    Bowden, Richard
    COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 : 222 - 233
  • [43] Learning Spatio-Temporal Features for Action Recognition with Modified Hidden Conditional Random Field
    Xu, Wanru
    Miao, Zhenjiang
    Zhang, Jian
    Tian, Yi
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 786 - 801
  • [44] Study of Human Action Recognition Based on Improved Spatio-temporal Features附视频
    XiaoFei Ji
    QianQian Wu
    ZhaoJie Ju
    YangYang Wang
    International Journal of Automation & Computing, 2014, (05) : 500 - 509
  • [45] Spatio-Temporal Weighted Posture Motion Features for Human Skeleton Action Recognition Research
    Ding C.-Y.
    Liu K.
    Li G.
    Yan L.
    Chen B.-Y.
    Zhong Y.-M.
    Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (01): : 29 - 40
  • [46] Human Action Recognition Based on Selected Spatio-Temporal Features via Bidirectional LSTM
    Li, Wenhui
    Nie, Weizhi
    Su, Yuting
    IEEE ACCESS, 2018, 6 : 44211 - 44220
  • [47] Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition
    Hara, Kensho
    Kataoka, Hirokatsu
    Satoh, Yutaka
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 3154 - 3160
  • [48] Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis
    Le, Quoc V.
    Zou, Will Y.
    Yeung, Serena Y.
    Ng, Andrew Y.
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [49] Spatio-temporal neural network with handcrafted features for skeleton-based action recognition
    Nan, Mihai
    Trascau, Mihai
    Florea, Adina-Magda
    NEURAL COMPUTING & APPLICATIONS, 2024, : 9221 - 9243
  • [50] 6D MOTION GESTURE RECOGNITION USING SPATIO-TEMPORAL FEATURES
    Chen, Mingyu
    AlRegib, Ghassan
    Juang, Biing-Hwang
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2341 - 2344