Sparse reduced-rank regression for integrating omics data

被引:3
|
作者
Hilafu, Haileab [1 ]
Safo, Sandra E. [2 ]
Haine, Lillian [2 ]
机构
[1] Univ Tennessee, Dept Business Analyt & Stat, Knoxville, TN 37996 USA
[2] Univ Minnesota, Div Biostat, Minneapolis, MN 55455 USA
关键词
Integrative analysis; Multi-view data; Reduced rank regression; High dimensional data; SIMULTANEOUS DIMENSION REDUCTION; GENOMICS; DISEASE; METABOLOMICS; ESTIMATORS; BIOMARKERS; SELECTION; MATRIX;
D O I
10.1186/s12859-020-03606-2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background The problem of assessing associations between multiple omics data including genomics and metabolomics data to identify biomarkers potentially predictive of complex diseases has garnered considerable research interest nowadays. A popular epidemiology approach is to consider an association of each of the predictors with each of the response using a univariate linear regression model, and to select predictors that meet a priori specified significance level. Although this approach is simple and intuitive, it tends to require larger sample size which is costly. It also assumes variables for each data type are independent, and thus ignores correlations that exist between variables both within each data type and across the data types. Results We consider a multivariate linear regression model that relates multiple predictors with multiple responses, and to identify multiple relevant predictors that are simultaneously associated with the responses. We assume the coefficient matrix of the responses on the predictors is both row-sparse and of low-rank, and propose a group Dantzig type formulation to estimate the coefficient matrix. Conclusion Extensive simulations demonstrate the competitive performance of our proposed method when compared to existing methods in terms of estimation, prediction, and variable selection. We use the proposed method to integrate genomics and metabolomics data to identify genetic variants that are potentially predictive of atherosclerosis cardiovascular disease (ASCVD) beyond well-established risk factors. Our analysis shows some genetic variants that increase prediction of ASCVD beyond some well-established factors of ASCVD, and also suggest a potential utility of the identified genetic variants in explaining possible association between certain metabolites and ASCVD.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Sparse reduced-rank regression for integrating omics data
    Haileab Hilafu
    Sandra E. Safo
    Lillian Haine
    [J]. BMC Bioinformatics, 21
  • [2] Sparse reduced-rank regression with covariance estimation
    Lisha Chen
    Jianhua Z. Huang
    [J]. Statistics and Computing, 2016, 26 : 461 - 470
  • [3] Fast Algorithms for Sparse Reduced-Rank Regression
    Dubois, Benjamin
    Delmas, Jean-Francois
    Obozinski, Guillaume
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [4] Sparse reduced-rank regression with covariance estimation
    Chen, Lisha
    Huang, Jianhua Z.
    [J]. STATISTICS AND COMPUTING, 2016, 26 (1-2) : 461 - 470
  • [5] Sparse reduced-rank regression for exploratory visualisation of paired multivariate data
    Kobak, Dmitry
    Bernaerts, Yves
    Weis, Marissa A.
    Scala, Federico
    Tolias, Andreas
    Berens, Philipp
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2021, 70 (04) : 980 - 1000
  • [6] Efficient Sparse Reduced-Rank Regression With Covariance Estimation
    Li, Fengpei
    Zhao, Ziping
    [J]. 2023 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP, SSP, 2023, : 46 - 50
  • [7] Robust Sparse Reduced-Rank Regression with Response Dependency
    Liu, Wenchen
    Liu, Guanfu
    Tang, Yincai
    [J]. SYMMETRY-BASEL, 2022, 14 (08):
  • [8] A Sparse Reduced-Rank Regression Approach for Hyperspectral Image Unmixing
    Giampouras, Paris V.
    Rontogiannis, Athanasios A.
    Koutroumbas, Konstantinos D.
    Themelis, Konstantinos E.
    [J]. 2015 3RD INTERNATIONAL WORKSHOP ON COMPRESSED SENSING THEORY AND ITS APPLICATION TO RADAR, SONAR, AND REMOTE SENSING (COSERA), 2015, : 139 - 143
  • [9] A fully Bayesian approach to sparse reduced-rank multivariate regression
    Yang, Dunfu
    Goh, Gyuhyeong
    Wang, Haiyan
    [J]. STATISTICAL MODELLING, 2022, 22 (03) : 199 - 220
  • [10] Rank estimation in reduced-rank regression
    Bura, E
    Cook, RD
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2003, 87 (01) : 159 - 176