Multiview Regularized Discriminant Canonical Correlation Analysis: Sequential Extraction of Relevant Features From Multiblock Data

被引:3
|
作者
Mandal, Ankita [1 ]
Maji, Pradipta [1 ]
机构
[1] Indian Stat Inst, Machine Intelligence Unit, Biomed Imaging & Bioinformat Lab, Kolkata 700108, India
关键词
Feature extraction; Correlation; Covariance matrices; Data mining; Data analysis; Optimization; Statistical analysis; Canonical correlation analysis (CCA); feature extraction; multimodal data analysis; ridge regression optimization; SETS;
D O I
10.1109/TCYB.2022.3155875
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the important issues associated with real-life high-dimensional data analysis is how to extract significant and relevant features from multiview data. The multiset canonical correlation analysis (MCCA) is a well-known statistical method for multiview data integration. It finds a linear subspace that maximizes the correlations among different views. However, the existing methods to find the multiset canonical variables are computationally very expensive, which restricts the application of the MCCA in real-life big data analysis. The covariance matrix of each high-dimensional view may also suffer from the singularity problem due to the limited number of samples. Moreover, the MCCA-based existing feature extraction algorithms are, in general, unsupervised in nature. In this regard, a new supervised feature extraction algorithm is proposed, which integrates multimodal multidimensional data sets by solving maximal correlation problem of the MCCA. A new block matrix representation is introduced to reduce the computational complexity for computing the canonical variables of the MCCA. The analytical formulation enables efficient computation of the multiset canonical variables under supervised ridge regression optimization technique. It deals with the ``curse of dimensionality'' problem associated with high-dimensional data and facilitates the sequential generation of relevant features with significantly lower computational cost. The effectiveness of the proposed multiblock data integration algorithm, along with a comparison with other existing methods, is demonstrated on several benchmark and real-life cancer data.
引用
收藏
页码:5497 / 5509
页数:13
相关论文
共 44 条
  • [21] Extraction of correlated gene clusters from multiple genomic data by generalized kernel canonical correlation analysis
    Yamanishi, Y.
    Vert, J. -P.
    Nakaya, A.
    Kanehisa, M.
    BIOINFORMATICS, 2003, 19 : i323 - i330
  • [22] Sparse regularized regression identifies behaviorally-relevant stimulus features from psychophysical data
    Schoenfelder, Vinzenz H.
    Wichmann, Felix A.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (05): : 3953 - 3969
  • [23] HIGHLIGHTING RELATIONSHIPS BETWEEN HETEROGENEOUS BIOLOGICAL DATA THROUGH GRAPHICAL DISPLAYS BASED ON REGULARIZED CANONICAL CORRELATION ANALYSIS
    Gonzalez, I.
    Dejean, S.
    Martin, P. G. P.
    Goncalves, O.
    Besse, P.
    Baccini, A.
    JOURNAL OF BIOLOGICAL SYSTEMS, 2009, 17 (02) : 173 - 199
  • [24] A Tutorial on Multiblock Discriminant Correspondence Analysis (MUDICA): A New Method for Analyzing Discourse Data From Clinical Populations
    Williams, Lynne J.
    Abdi, Herve
    French, Rebecca
    Orange, Joseph B.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2010, 53 (05): : 1372 - 1393
  • [25] ACQUISITION, SPECTRAL ANALYSIS AND EXTRACTION OF CLINICALLY RELEVANT FEATURES FROM SLOW-WAVE EEGS
    GOTMAN, J
    GLOOR, P
    ELECTROENCEPHALOGRAPHY AND CLINICAL NEUROPHYSIOLOGY, 1973, 34 (07): : 748 - 748
  • [26] Data-Driven Distributed Local Fault Detection for Large-Scale Processes Based on the GA-Regularized Canonical Correlation Analysis
    Jiang, Qingchao
    Ding, Steven X.
    Wang, Yang
    Yan, Xuefeng
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 64 (10) : 8148 - 8157
  • [27] Estimation of customer questionnaire responses from purchase transaction data using canonical correlation analysis
    Sano, Natsuki
    Kimura, Fuminori
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 : 1855 - 1862
  • [28] Calibration transfer of near-infrared spectra for extraction of informative components from spectra with canonical correlation analysis
    Zheng, Kaiyi
    Zhang, Xuan
    Iqbal, Jibran
    Fan, Wei
    Wu, Ting
    Du, Yiping
    Liang, Yizeng
    JOURNAL OF CHEMOMETRICS, 2014, 28 (10) : 773 - 784
  • [29] A Generalized Canonical Correlation Analysis Based Method for Blind Source Separation from Related Data Sets
    Karhunen, Juha
    Hao, Tele
    Ylipaavalniemi, Jarkko
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [30] 3D and infrared face reconstruction from RGB data using Canonical Correlation Analysis
    Reiter, Michael
    Donner, Rene
    Langs, Georg
    Bischof, Horst
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 425 - +