Spatio-Temporal Convolutional Sparse Auto-Encoder for Sequence Classification

被引:43
|
作者
Baccouche, Moez [1 ]
Mamalet, Franck [1 ]
Wolf, Christian [2 ]
Garcia, Christophe [2 ]
Baskurt, Atilla [2 ]
机构
[1] Orange Labs R&D, 4 Rue Clos Courtel, F-35510 Lyon, France
[2] Univ Lyon, CNRS INSA Lyon, LIRIS, UMR, F-69621 Villeurbanne, France
关键词
SCALE;
D O I
10.5244/C.26.124
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present in this paper a novel learning-based approach for video sequence classification. Contrary to the dominant methodology, which relies on hand-crafted features that are manually engineered to be optimal for a specific task, our neural model automatically learns a sparse shift-invariant representation of the local 2D + t salient information, without any use of prior knowledge. To that aim, a spatio-temporal convolutional sparse auto-encoder is trained to project a given input in a feature space, and to reconstruct it from its projection coordinates. Learning is performed in an unsupervised manner by minimizing a global parametrized objective function. The sparsity is ensured by adding a sparsifying logistic between the encoder and the decoder, while the shift-invariance is handled by including an additional hidden variable to the objective function. The temporal evolution of the obtained sparse features is learned by a long short-term memory recurrent neural network trained to classify each sequence. We show that, since the feature learning process is problem-independent, the model achieves outstanding performances when applied to two different problems, namely human action and facial expression recognition. Obtained results are superior to the state of the art on the GEMEP-FERA dataset and among the very best on the KTH dataset.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] The Unsupervised Hierarchical Convolutional Sparse Auto-Encoder for Neuroimaging Data Classification
    Han, Xiaobing
    Zhong, Yanfei
    He, Lifang
    Yu, Philip S.
    Zhang, Liangpei
    [J]. BRAIN INFORMATICS AND HEALTH (BIH 2015), 2015, 9250 : 156 - 166
  • [2] Data imputation in IoT using Spatio-Temporal Variational Auto-Encoder
    Zhang, Shuo
    Chen, Jinyi
    Chen, Jiayuan
    Chen, Xiaofei
    Huang, Hejiao
    [J]. NEUROCOMPUTING, 2023, 529 : 23 - 32
  • [3] Joint Sparse Auto-encoder: A Semi-supervised Spatio-temporal Approach in Mapping Large-scale Croplands
    Jia, Xiaowei
    Hu, Yifan
    Khandelwal, Ankush
    Karpatne, Anuj
    Kumar, Vipin
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 1173 - 1182
  • [4] Unsupervised Hierarchical Convolutional Sparse Auto-encoder For High Spatial Resolution Imagery Scene Classification
    Han, Xiaobing
    Zhong, Yanfei
    Zhao, Bei
    Zhang, Liangpei
    [J]. 2015 11TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2015, : 42 - 46
  • [5] Scene classification based on a hierarchical convolutional sparse auto-encoder for high spatial resolution imagery
    Han, Xiaobing
    Zhong, Yanfei
    Zhao, Bei
    Zhang, Liangpei
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2017, 38 (02) : 514 - 536
  • [6] An Ensemble Net of Convolutional Auto-Encoder and Graph Auto-Encoder for Auto-Diagnosis
    Li, Jianqiang
    Ji, Changping
    Yan, Guokai
    You, Linlin
    Chen, Jie
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (01) : 189 - 199
  • [7] Efficient sparse spiking auto-encoder for reconstruction, denoising and classification
    Walters, Ben
    Kalatehbali, Hamid Rahimian
    Cai, Zhengyu
    Genov, Roman
    Amirsoleimani, Amirali
    Eshraghian, Jason
    Azghadi, Mostafa Rahimi
    [J]. NEUROMORPHIC COMPUTING AND ENGINEERING, 2024, 4 (03):
  • [8] Tobacco leaf maturity classification based on sparse auto-encoder
    Wang, Jie
    Jia, Yuheng
    Zhao, Xin
    [J]. Tobacco Science and Technology, 2014, (09): : 18 - 22
  • [9] Locality-Constrained Sparse Auto-Encoder for Image Classification
    Luo, Wei
    Yang, Jian
    Xu, Wei
    Fu, Tao
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (08) : 1070 - 1073
  • [10] Convolutional sparse auto-encoder for image super-resolution reconstruction
    Zhang X.
    Zhou W.
    Duan Z.
    Wei H.
    [J]. Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2019, 48 (01):