Action Recognition Using Rate-Invariant Analysis of Skeletal Shape Trajectories

被引:151
|
作者
Ben Amor, Boulbaba [1 ]
Su, Jingyong [2 ]
Srivastava, Anuj [3 ]
机构
[1] Inst Mines Telecom Telecom Lille, CRIStAL UMR 9189, Lille, France
[2] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
[3] Texas Tech Univ, Dept Math & Stat, Lubbock, TX 79409 USA
基金
美国国家科学基金会;
关键词
Action recognition; Riemannian geometry; manifold trajectories; depth sensors; skeletal data; ENSEMBLE; LATENCY;
D O I
10.1109/TPAMI.2015.2439257
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of classifying actions of human subjects using depth movies generated by Kinect or other depth sensors. Representing human body as dynamical skeletons, we study the evolution of their (skeletons') shapes as trajectories on Kendall's shape manifold. The action data is typically corrupted by large variability in execution rates within and across subjects and, thus, causing major problems in statistical analyses. To address that issue, we adopt a recently-developed framework of Su et al. [1], [2] to this problem domain. Here, the variable execution rates correspond to re-parameterizations of trajectories, and one uses a parameterization-invariant metric for aligning, comparing, averaging, and modeling trajectories. This is based on a combination of transported square-root vector fields (TSRVFs) of trajectories and the standard Euclidean norm, that allows computational efficiency. We develop a comprehensive suite of computational tools for this application domain: smoothing and denoising skeleton trajectories using median filtering, up-and down-sampling actions in time domain, simultaneous temporal-registration of multiple actions, and extracting invertible Euclidean representations of actions. Due to invertibility these Euclidean representations allow both discriminative and generative models for statistical analysis. For instance, they can be used in a SVM-based classification of original actions, as demonstrated here using MSR Action-3D, MSR Daily Activity and 3D Action Pairs datasets. Using only the skeletal information, we achieve state-of-the-art classification results on these datasets.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [1] Human object interaction recognition using rate-invariant shape analysis of inter joint distances trajectories
    Meng, Meng
    Drira, Hassen
    Daoudi, Mohamed
    Boonaert, Jacques
    [J]. PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 999 - 1004
  • [2] Rate-Invariant Analysis of Covariance Trajectories
    Zhengwu Zhang
    Jingyong Su
    Eric Klassen
    Huiling Le
    Anuj Srivastava
    [J]. Journal of Mathematical Imaging and Vision, 2018, 60 : 1306 - 1323
  • [3] Rate-Invariant Analysis of Covariance Trajectories
    Zhang, Zhengwu
    Su, Jingyong
    Klassen, Eric
    Le, Huiling
    Srivastava, Anuj
    [J]. JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2018, 60 (08) : 1306 - 1323
  • [4] Rate-Invariant Analysis of Trajectories on Riemannian Manifolds with Application in Visual Speech Recognition
    Su, Jingyong
    Srivastava, Anuj
    de Souza, Fillipe D. M.
    Sarkar, Sudeep
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 620 - 627
  • [5] Rate-Invariant Recognition of Humans and Their Activities
    Veeraraghavan, Ashok
    Srivastava, Anuj
    Roy-Chowdhury, Amit K.
    Chellappa, Rama
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (06) : 1326 - 1339
  • [6] Rate-Invariant Modeling in Lie Algebra for Activity Recognition
    Boujebli, Malek
    Drira, Hassen
    Mestiri, Makram
    Farah, Imed Riadh
    [J]. ELECTRONICS, 2020, 9 (11) : 1 - 16
  • [7] Rate-Invariant Comparisons of Covariance Paths for Visual Speech Recognition
    Su, Jingyong
    Srivastava, Anuj
    Souza, Fillipe
    Sarkar, Sudeep
    [J]. 2013 FOURTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2013,
  • [8] ORIGIN OF THE EUKARYOTIC NUCLEUS DETERMINED BY RATE-INVARIANT ANALYSIS OF RIBOSOMAL-RNA SEQUENCES
    LAKE, JA
    [J]. NATURE, 1988, 331 (6152) : 184 - 186
  • [9] Rate Invariant Action Recognition in Lie Algebra
    Boujebli, Malek
    Drira, Hassen
    Mestiri, Makram
    Farah, I. R.
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2017, : 207 - 213
  • [10] View and scale invariant action recognition using multiview shape-flow models
    Natarajan, Pradeep
    Nevatia, Ramakant
    [J]. 2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 2916 - 2923