Statistics of Pairwise Co-occurring Local Spatio-temporal Features for Human Action Recognition

被引:0
|
作者
Bilinski, Piotr [1 ]
Bremond, Francois [1 ]
机构
[1] INRIA Sophia Antipolis, STARS Team, 2004 Route Lucioles, F-06902 Sophia Antipolis, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The bag-of-words approach with local spatio-temporal features have become a popular video representation for action recognition in videos. Together these techniques have demonstrated high recognition results for a number of action classes. Recent approaches have typically focused on capturing global statistics of features. However, existing methods ignore relations between features and thus may not be discriminative enough. Therefore, we propose a novel feature representation which captures statistics of pairwise co-occurring local spatio-temporal features. Our representation captures not only global distribution of features but also focuses on geometric and appearance (both visual and motion) relations among the features. Calculating a set of bag-of-words representations with different geometrical arrangement among the features, we keep an important association between appearance and geometric information. Using two benchmark datasets for human action recognition, we demonstrate that our representation enhances the discriminative power of features and improves action recognition performance.
引用
收藏
页码:311 / 320
页数:10
相关论文
共 50 条
  • [31] Action recognition using lie algebrized gaussians over dense local spatio-temporal features
    Chen, Meng
    Gong, Liyu
    Wang, Tianjiang
    Feng, Qi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (06) : 2127 - 2142
  • [32] Higher-Level Representation of Local Spatio-Temporal Features for Human Action Recognition Using Subspace Matching Kernels
    Raytchev, Bisser
    Kawamoto, Hideaki
    Tamaki, Toru
    Kaneda, Kazufumi
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3862 - 3867
  • [33] Spatio-Temporal Action Localization for Human Action Recognition in Large Dataset
    Megrhi, Sameh
    Jmal, Marwa
    Beghdadi, Azeddine
    Mseddi, Wided
    VIDEO SURVEILLANCE AND TRANSPORTATION IMAGING APPLICATIONS 2015, 2015, 9407
  • [34] Local descriptors for spatio-temporal recognition
    Laptev, Ivan
    Lindeberg, Tony
    SPATIAL COHERENCE FOR VISUAL MOTION ANALYSIS, 2006, 3667 : 91 - 103
  • [35] Adaptive Pooling of the Most Relevant Spatio-Temporal Features for Action Recognition
    Ahmed, Faisal
    Paul, Padma Polash
    Gavrilova, Marina
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 177 - 180
  • [36] Learning Bag of Spatio-Temporal Features for Human Interaction Recognition
    Slimani, Khadidja Nour El Houda
    Benezeth, Yannick
    Souami, Feryel
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [37] Learning spatio-temporal features for action recognition from the side of the video
    Pei, Lishen
    Ye, Mao
    Zhao, Xuezhuan
    Xiang, Tao
    Li, Tao
    SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (01) : 199 - 206
  • [38] Learning to Represent Spatio-Temporal Features for Fine Grained Action Recognition
    Sakhalkar, Kaustubh
    Bremond, Francois
    2018 IEEE THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, APPLICATIONS AND SYSTEMS (IPAS), 2018, : 268 - 272
  • [39] Spatio-Temporal VLAD Encoding for Human Action Recognition in Videos
    Duta, Ionut C.
    Ionescu, Bogdan
    Aizawa, Kiyoharu
    Sebe, Nicu
    MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 365 - 378
  • [40] Multimodal human action recognition based on spatio-temporal action representation recognition model
    Wu, Qianhan
    Huang, Qian
    Li, Xing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) : 16409 - 16430