Nonlinear Sparse Component Analysis with a Reference: Variable Selection in Genomics and Proteomics

被引:0
|
作者
Kopriva, Ivica [1 ]
Kapitanovic, Sanja [2 ]
Cacev, Tamara [2 ]
机构
[1] Rudjer Boskovic Inst, Div Laser & Atom R&D, Zagreb 10000, Croatia
[2] Rudjer Boskovic Inst, Div Mol Med, Zagreb 10000, Croatia
关键词
Variable selection; Nonlinear mixture model; Empirical kernel maps; Sparse component analysis; CANCER; CLASSIFICATION; ALGORITHMS; PATTERNS; DISCOVERY; SERUM;
D O I
10.1007/978-3-319-22482-4_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many scenarios occurring in genomics and proteomics involve small number of labeled data and large number of variables. To create prediction models robust to overfitting variable selection is necessary. We propose variable selection method using nonlinear sparse component analysis with a reference representing either negative (healthy) or positive (cancer) class. Thereby, component comprised of cancer related variables is automatically inferred from the geometry of nonlinear mixture model with a reference. Proposed method is compared with 3 supervised and 2 unsupervised variable selection methods on two-class problems using 2 genomic and 2 proteomic datasets. Obtained results, which include analysis of biological relevance of selected genes, are comparable with those achieved by supervised methods. Thus, proposed method can possibly perform better on unseen data of the same cancer type.
引用
收藏
页码:168 / 175
页数:8
相关论文
共 50 条
  • [1] Sparse supervised principal component analysis (SSPCA) for dimension reduction and variable selection
    Sharifzadeh, Sara
    Ghodsi, Ali
    Clemmensen, Line H.
    Ersboll, Bjarne K.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 65 : 168 - 177
  • [2] Leveraging pleiotropic association using sparse group variable selection in genomics data
    Matthew Sutton
    Pierre-Emmanuel Sugier
    Therese Truong
    Benoit Liquet
    BMC Medical Research Methodology, 22
  • [3] Leveraging pleiotropic association using sparse group variable selection in genomics data
    Sutton, Matthew
    Sugier, Pierre-Emmanuel
    Truong, Therese
    Liquet, Benoit
    BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
  • [4] Sparse variable principal component analysis with application to fMRI
    Ulfarsson, Magnus O.
    Solo, Victor
    2007 4TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING : MACRO TO NANO, VOLS 1-3, 2007, : 460 - +
  • [5] SPARSE PRINCIPAL COMPONENT ANALYSIS VIA VARIABLE PROJECTION
    Erichson, N. Benjamin
    Zheng, Peng
    Manohar, Krithika
    Brunton, Steven L.
    Kutz, J. Nathan
    Aravkin, Aleksandr Y.
    SIAM JOURNAL ON APPLIED MATHEMATICS, 2020, 80 (02) : 977 - 1002
  • [6] NATIONAL REFERENCE CENTRE FOR GENOMICS AND PROTEOMICS - MACPROGEN
    Plaseska-Karanfilska, D.
    BALKAN JOURNAL OF MEDICAL GENETICS, 2012, 15 : 9 - 12
  • [7] Sparse Regression in Cancer Genomics: Comparing Variable Selection and Predictions in Real World Data
    O'Shea, Robert J.
    Tsoka, Sophia
    Cook, Gary J. R.
    Goh, Vicky
    CANCER INFORMATICS, 2021, 20
  • [8] Clustering and feature selection using sparse principal component analysis
    Ronny Luss
    Alexandre d’Aspremont
    Optimization and Engineering, 2010, 11 : 145 - 157
  • [9] Clustering and feature selection using sparse principal component analysis
    Luss, Ronny
    d'Aspremont, Alexandre
    OPTIMIZATION AND ENGINEERING, 2010, 11 (01) : 145 - 157
  • [10] Improving independent component analysis performances by variable selection
    Vrins, F
    Lee, JA
    Verleysen, M
    Vigneron, V
    Jutten, C
    2003 IEEE XIII WORKSHOP ON NEURAL NETWORKS FOR SIGNAL PROCESSING - NNSP'03, 2003, : 359 - 368