Pairwise-Covariance Multi-view Discriminant Analysis for Robust Cross-View Human Action Recognition

被引:3
|
作者
Tran, Hoang-Nhat [1 ]
Nguyen, Hong-Quan [1 ,2 ]
Doan, Huong-Giang [3 ]
Tran, Thanh-Hai [1 ,4 ]
Le, Thi-Lan [1 ,4 ]
Vu, Hai [1 ,4 ]
机构
[1] Hanoi Univ Sci & Technol, Internat Res Inst MICA, Hanoi 10000, Vietnam
[2] Viet Hung Univ, Fac Informat Technol, Dept Informat Technol, Hanoi 10000, Vietnam
[3] Elect Power Univ, Fac Control & Automat, Dept Measurement Engn, Hanoi 10000, Vietnam
[4] Hanoi Univ Sci & Technol, Sch Elect & Telecommun, Hanoi 10000, Vietnam
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Feature extraction; Training; Three-dimensional displays; Neural networks; Cameras; Deep learning; Correlation; Multi-view analysis; action recognition; deep learning; cross-view recognition;
D O I
10.1109/ACCESS.2021.3082142
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition (HAR) under different camera viewpoints is the most critical requirement for practical deployment. In this paper, we propose a novel method that leverages successful deep learning-based features for action representation and multi-view analysis to accomplish robust HAR under viewpoint changes. Specifically, we investigate various deep learning techniques, from 2D CNNs to 3D CNNs to capture spatial and temporal characteristics of actions at each separated camera view. A common feature space is then constructed to keep view-invariant features among extracted streams. This is carried out by learning a set of linear transformations that project private features into the common space in which the classes are well distinguished from each other. To this end, we first adopt Multi-view Discriminant Analysis (MvDA). The original MvDA suffers from odd situations in which the most class-discrepant common space could not be found because its objective is overly concentrated on pushing classes from the global mean but unaware of the distance between specific pairs of adjoining classes. We then introduce a pairwise-covariance maximizing extension that takes pairwise distances between classes into account, namely pc-MvDA. The novel method also differs in the way that could be more favorably applied for large high-dimensional multi-view datasets. Extensive experimental results on four datasets (IXMAS, MuHAVi, MICAGes, NTU RGB+D) show that pc-MvDA achieves consistent performance gain, especially for harder classes. The code is publicly available for research purpose at https://github.com/inspiros/pcmvda.
引用
收藏
页码:76097 / 76111
页数:15
相关论文
共 50 条
  • [1] Multi-view common component discriminant analysis for cross-view classification
    You, Xinge
    Xu, Jiamiao
    Yuan, Wei
    Jing, Xiao-Yuan
    Tao, Dacheng
    Zhang, Taiping
    [J]. PATTERN RECOGNITION, 2019, 92 : 37 - 51
  • [2] Heterogeneous discriminant analysis for cross-view action recognition
    Sui, Wanchen
    Wu, Xinxiao
    Feng, Yang
    Jia, Yunde
    [J]. NEUROCOMPUTING, 2016, 191 : 286 - 295
  • [3] Multi-view Discriminant Analysis with Tensor Representation and Its Application to Cross-view Gait Recognition
    Makihara, Yasushi
    Al Mansur
    Muramatsu, Daigo
    Uddin, Zasim
    Yagi, Yasushi
    [J]. 2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), VOL. 1, 2015,
  • [4] Heterogeneous Discriminant Analysis for Cross-View Action Recognition
    Sui, Wanchen
    Wu, Xinxiao
    Feng, Yang
    Liang, Wei
    Jia, Yunde
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 566 - 573
  • [5] Multi-View Gait Image Generation for Cross-View Gait Recognition
    Chen, Xin
    Luo, Xizhao
    Weng, Jian
    Luo, Weiqi
    Li, Huiting
    Tian, Qi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3041 - 3055
  • [6] Robust multi-view discriminant analysis with view-consistency
    Yang, Xiang-Fei
    Li, Chun-Na
    Shao, Yuan-Hai
    [J]. INFORMATION SCIENCES, 2022, 596 : 153 - 168
  • [7] VIEW-INDEPENDENT HUMAN ACTION RECOGNITION BASED ON MULTI-VIEW ACTION IMAGES AND DISCRIMINANT LEARNING
    Iosifidis, Alexandros
    Tefas, Anastasios
    Pitas, Ioannis
    [J]. 2013 IEEE 11TH IVMSP WORKSHOP: 3D IMAGE/VIDEO TECHNOLOGIES AND APPLICATIONS (IVMSP 2013), 2013,
  • [8] Discriminative Multi-View Dynamic Image Fusion for Cross-View 3-D Action Recognition
    Wang, Yancheng
    Xiao, Yang
    Lu, Junyi
    Tan, Bo
    Cao, Zhiguo
    Zhang, Zhenjun
    Zhou, Joey Tianyi
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5332 - 5345
  • [9] MULTI-TASK LINEAR DISCRIMINANT ANALYSIS FOR MULTI-VIEW ACTION RECOGNITION
    Yan, Yan
    Liu, Gaowen
    Ricci, Elisa
    Sebe, Nicu
    [J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2837 - 2841
  • [10] Multi-view Deep Network for Cross-view Classification
    Kan, Meina
    Shan, Shiguang
    Chen, Xilin
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4847 - 4855