Non-Linear Temporal Subspace Representations for Activity Recognition

被引:27
|
作者
Cherian, Anoop [1 ,3 ]
Sra, Suvrit [2 ]
Gould, Stephen [3 ]
Hartley, Richard [3 ]
机构
[1] MERL, Cambridge, MA 02139 USA
[2] MIT, Cambridge, MA 02139 USA
[3] ANU Canberra, ACRV, Canberra, ACT, Australia
关键词
D O I
10.1109/CVPR.2018.00234
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Representations that can compactly and effectively capture the temporal evolution of semantic content are important to computer vision and machine learning algorithms that operate on multi-variate time-series data. We investigate such representations motivated by the task of human action recognition. Here each data instance is encoded by a multivariate feature (such as via a deep CNN) where action dynamics are characterized by their variations in time. As these features are often non-linear, we propose a novel pooling method, kernelized rank pooling, that represents a given sequence compactly as the pre-image of the parameters of a hyperplane in a reproducing kernel Hilbert space, projections of data onto which captures their temporal order. We develop this idea further and show that such a pooling scheme can be cast as an order-constrained kernelized PCA objective. We then propose to use the parameters of a kernelized low-rank feature subspace as the representation of the sequences. We cast our formulation as an optimization problem on generalized Grassmann manifolds and then solve it efficiently using Riemannian optimization techniques. We present experiments on several action recognition datasets using diverse feature modalities and demonstrate state-of-the-art results.
引用
收藏
页码:2197 / 2206
页数:10
相关论文
共 50 条
  • [1] Subspace Clustering for Action Recognition with Covariance Representations and Temporal Pruning
    Paoletti, Giancarlo
    Cavazza, Jacopo
    Beyan, Cigdem
    Del Bue, Alessio
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6035 - 6042
  • [2] Performance of different models in non-linear subspace
    Bodapati, Jyostna Devi
    Veeranjaneyulu, N.
    [J]. 2016 INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (ICONSIP), 2016,
  • [3] NON-LINEAR REPRESENTATIONS OF LIE GROUPS
    FLATO, M
    PINCZON, G
    SIMON, J
    [J]. ANNALES SCIENTIFIQUES DE L ECOLE NORMALE SUPERIEURE, 1977, 10 (03): : 405 - 418
  • [4] Feature Extraction Using Linear and Non-linear Subspace Techniques
    Teixeira, Ana R.
    Tome, Ana Maria
    Lang, E. W.
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT II, 2009, 5769 : 115 - +
  • [5] Video event segmentation and visualisation in non-linear subspace
    Tziakos, Ioannis
    Cavallaro, Andrea
    Xu, Li-Qun
    [J]. PATTERN RECOGNITION LETTERS, 2009, 30 (02) : 123 - 131
  • [6] Recursive subspace identification of linear and non-linear Wiener type models
    Lovera, M
    Verhaegen, R
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS, VOLS 1 AND 2, 1996, : 1250 - 1254
  • [7] REPRESENTATIONS OF NON-LINEAR SYSTEMS - THE NARMAX MODEL
    CHEN, S
    BILLINGS, SA
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 1989, 49 (03) : 1013 - 1032
  • [8] NON-LINEAR RECURRENT ALGORITHMS FOR RESTORATION OF REPRESENTATIONS
    BAKUT, PA
    SIDELNIKOV, VN
    [J]. ENGINEERING CYBERNETICS, 1979, 17 (05): : 110 - 115
  • [9] SPARSE REPRESENTATIONS IN NESTED NON-LINEAR MODELS
    Dremeau, Angelique
    Heas, Patrick
    Herzet, Cedric
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [10] LAGRANGE AND EULER REPRESENTATIONS OF NON-LINEAR ACOUSTICS
    POIREE, B
    [J]. JOURNAL DE PHYSIQUE, 1979, 41 : 48 - 52