Log-Euclidean bag of words for human action recognition

被引:33
|
作者
Faraki, Masoud [1 ]
Palhang, Maziar [1 ]
Sanderson, Conrad [2 ,3 ]
机构
[1] Isfahan Univ Technol, Artificial Intelligence Lab, Dept Elect & Comp Engn, Esfahan, Iran
[2] Queensland Univ Technol, Brisbane, Qld 4000, Australia
[3] NICTA, Brisbane, Qld 4001, Australia
基金
澳大利亚研究理事会;
关键词
REGION COVARIANCE; CLASSIFICATION; DESCRIPTORS; ROBUST; DENSE;
D O I
10.1049/iet-cvi.2014.0018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Representing videos by densely extracted local space-time features has recently become a popular approach for analysing actions. In this study, the authors tackle the problem of categorising human actions by devising bag of words (BoWs) models based on covariance matrices of spatiotemporal features, with the features formed from histograms of optical flow. Since covariance matrices form a special type of Riemannian manifold, the space of symmetric positive definite (SPD) matrices, non-Euclidean geometry should be taken into account while discriminating between covariance matrices. To this end, the authors propose to embed SPD manifolds to Euclidean spaces via a diffeomorphism and extend the BoW approach to its Riemannian version. The proposed BoW approach takes into account the manifold geometry of SPD matrices during the generation of the codebook and histograms. Experiments on challenging human action datasets show that the proposed method obtains notable improvements in discrimination accuracy, in comparison with several state-of-the-art methods.
引用
收藏
页码:331 / 339
页数:9
相关论文
共 50 条
  • [1] Human Action Recognition under Log-Euclidean Riemannian Metric
    Yuan, Chunfeng
    Hu, Weiming
    Li, Xi
    Maybank, Stephen
    Luo, Guan
    [J]. COMPUTER VISION - ACCV 2009, PT I, 2010, 5994 : 343 - +
  • [2] Action recognition based on spatio-temporal log-euclidean covariance matrix
    [J]. 1600, Science and Engineering Research Support Society (09):
  • [3] Iris Recognition Using Ordinal Encoding of Log-Euclidean Covariance Matrices
    Li, Peihua
    Wu, Guolong
    [J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 2420 - 2423
  • [4] Double constrained bag of words for human action recognition
    Wu, Chao
    Li, Yaqian
    Zhang, Yaru
    Liu, Bin
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 98
  • [5] A log-euclidean framework for statistics on diffeomorphisms
    Arsigny, Vincent
    Commowick, Olivier
    Pennec, Xavier
    Ayache, Nicholas
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2006, PT 1, 2006, 4190 : 924 - 931
  • [6] An extension of kernel learning methods using a modified Log-Euclidean distance for fast and accurate skeleton-based Human Action Recognition
    Ghorbel, Enjie
    Boonaert, Jacques
    Boutteau, Rami
    Lecoeuche, Stephane
    Savatier, Xavier
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 175 : 32 - 43
  • [7] Discriminant Bag of Words based representation for human action recognition
    Iosifidis, Alexandros
    Tefas, Anastastios
    Pitas, Ioannis
    [J]. PATTERN RECOGNITION LETTERS, 2014, 49 : 185 - 192
  • [8] Log-Euclidean Metrics for Contrast Preserving Decolorization
    Liu, Qiegen
    Shao, Guangpu
    Wang, Yuhao
    Gao, Junbin
    Leung, Henry
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (12) : 5772 - 5783
  • [9] Log-Euclidean free-form deformation
    Modat, Marc
    Ridgway, Gerard R.
    Daga, Pankaj
    Cardoso, M. Jorge
    Hawkes, David J.
    Ashburner, John
    Ourselin, Sebastien
    [J]. MEDICAL IMAGING 2011: IMAGE PROCESSING, 2011, 7962
  • [10] A new bag of visual words encoding method for human action recognition
    Cortes, Xavier
    Conte, Donatello
    Cardot, Hubert
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2480 - 2485