Adaptive Sparse Multi-Block PLS Discriminant Analysis: An Integrative Method for Identifying Key Biomarkers from Multi-Omics Data

被引:2
|
作者
Zhang, Runzhi [1 ]
Datta, Susmita [1 ]
机构
[1] Univ Florida, Dept Biostat, Gainesville, FL 32603 USA
关键词
data integration; multi-omics; asmbPLS-DA; classification; BREAST-CANCER CELLS; VARIABLE SELECTION; EXPRESSION; FAMILY; OVEREXPRESSION; REGULARIZATION; PROLIFERATION; METASTASIS; MIGRATION; TARGET;
D O I
10.3390/genes14050961
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
With the growing use of high-throughput technologies, multi-omics data containing various types of high-dimensional omics data is increasingly being generated to explore the association between the molecular mechanism of the host and diseases. In this study, we present an adaptive sparse multi-block partial least square discriminant analysis (asmbPLS-DA), an extension of our previous work, asmbPLS. This integrative approach identifies the most relevant features across different types of omics data while discriminating multiple disease outcome groups. We used simulation data with various scenarios and a real dataset from the TCGA project to demonstrate that asmbPLS-DA can identify key biomarkers from each type of omics data with better biological relevance than existing competitive methods. Moreover, asmbPLS-DA showed comparable performance in the classification of subjects in terms of disease status or phenotypes using integrated multi-omics molecular profiles, especially when combined with other classification algorithms, such as linear discriminant analysis and random forest. We have made the R package called asmbPLS that implements this method publicly available on GitHub. Overall, asmbPLS-DA achieved competitive performance in terms of feature selection and classification. We believe that asmbPLS-DA can be a valuable tool for multi-omics research.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Multi-Omics Analysis Identifying Key Biomarkers in Ovarian Cancer
    Li, Ju-Yueh
    Li, Chia-Jung
    Lin, Li-Te
    Tsui, Kuan-Hao
    CANCER CONTROL, 2020, 27 (01)
  • [2] Multi-block PLS discriminant analysis for the joint analysis of metabolomic and epidemiological data
    Marion Brandolini-Bunlon
    Mélanie Pétéra
    Pierrette Gaudreau
    Blandine Comte
    Stéphanie Bougeard
    Estelle Pujos-Guillot
    Metabolomics, 2019, 15
  • [3] Multi-block PLS discriminant analysis for the joint analysis of metabolomic and epidemiological data
    Brandolini-Bunlon, Marion
    Petera, Melanie
    Gaudreau, Pierrette
    Comte, Blandine
    Bougeard, Stephanie
    Pujos-Guillot, Estelle
    METABOLOMICS, 2019, 15 (10)
  • [4] Integrative Analysis of Multi-Omics Data Based on Blockwise Sparse Principal Components
    Park, Mira
    Kim, Doyoen
    Moon, Kwanyoung
    Park, Taesung
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2020, 21 (21) : 1 - 17
  • [5] Integrative analysis of multi-omics data for identifying multi-markers for diagnosing pancreatic cancer
    Min-Seok Kwon
    Yongkang Kim
    Seungyeoun Lee
    Junghyun Namkung
    Taegyun Yun
    Sung Gon Yi
    Sangjo Han
    Meejoo Kang
    Sun Whe Kim
    Jin-Young Jang
    Taesung Park
    BMC Genomics, 16
  • [6] Integrative analysis of multi-omics data for identifying multi-markers for diagnosing pancreatic cancer
    Kwon, Min-Seok
    Kim, Yongkang
    Lee, Seungyeoun
    Namkung, Junghyun
    Yun, Taegyun
    Yi, Sung Gon
    Han, Sangjo
    Kang, Meejoo
    Kim, Sun Whe
    Jang, Jin-Young
    Park, Taesung
    BMC GENOMICS, 2015, 16
  • [7] Sparse Overlapping Group Lasso for Integrative Multi-Omics Analysis
    Park, Heewon
    Niida, Atushi
    Miyano, Satoru
    Imoto, Seiya
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2015, 22 (02) : 73 - 84
  • [8] DIABLO: an integrative approach for identifying key molecular drivers from multi-omics assays
    Singh, Amrit
    Shannon, Casey P.
    Gautier, Benoit
    Rohart, Florian
    Vacher, Michael
    Tebbutt, Scott J.
    Le Cao, Kim-Anh
    BIOINFORMATICS, 2019, 35 (17) : 3055 - 3062
  • [9] A powerful framework for an integrative study with heterogeneous omics data: from univariate statistics to multi-block analysis
    Durufle, Harold
    Selmani, Merwann
    Ranocha, Philippe
    Jamet, Elisabeth
    Dunand, Christophe
    Dejean, Sebastien
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
  • [10] Combining SO-PLS and linear discriminant analysis for multi-block classification
    Biancolillo, Alessandra
    Mage, Ingrid
    Naes, Tormod
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2015, 141 : 58 - 67