Sparse reduced-rank regression for integrating omics data

被引:3
|
作者
Hilafu, Haileab [1 ]
Safo, Sandra E. [2 ]
Haine, Lillian [2 ]
机构
[1] Univ Tennessee, Dept Business Analyt & Stat, Knoxville, TN 37996 USA
[2] Univ Minnesota, Div Biostat, Minneapolis, MN 55455 USA
关键词
Integrative analysis; Multi-view data; Reduced rank regression; High dimensional data; SIMULTANEOUS DIMENSION REDUCTION; GENOMICS; DISEASE; METABOLOMICS; ESTIMATORS; BIOMARKERS; SELECTION; MATRIX;
D O I
10.1186/s12859-020-03606-2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background The problem of assessing associations between multiple omics data including genomics and metabolomics data to identify biomarkers potentially predictive of complex diseases has garnered considerable research interest nowadays. A popular epidemiology approach is to consider an association of each of the predictors with each of the response using a univariate linear regression model, and to select predictors that meet a priori specified significance level. Although this approach is simple and intuitive, it tends to require larger sample size which is costly. It also assumes variables for each data type are independent, and thus ignores correlations that exist between variables both within each data type and across the data types. Results We consider a multivariate linear regression model that relates multiple predictors with multiple responses, and to identify multiple relevant predictors that are simultaneously associated with the responses. We assume the coefficient matrix of the responses on the predictors is both row-sparse and of low-rank, and propose a group Dantzig type formulation to estimate the coefficient matrix. Conclusion Extensive simulations demonstrate the competitive performance of our proposed method when compared to existing methods in terms of estimation, prediction, and variable selection. We use the proposed method to integrate genomics and metabolomics data to identify genetic variants that are potentially predictive of atherosclerosis cardiovascular disease (ASCVD) beyond well-established risk factors. Our analysis shows some genetic variants that increase prediction of ASCVD beyond some well-established factors of ASCVD, and also suggest a potential utility of the identified genetic variants in explaining possible association between certain metabolites and ASCVD.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Stability Approach to Regularization Selection for Reduced-Rank Regression
    Wen, Canhong
    Wang, Qin
    Jiang, Yuan
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (03) : 974 - 984
  • [42] Tuning Parameter Selection for Underdetermined Reduced-Rank Regression
    Ulfarsson, Magnus O.
    Solo, Victor
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (09) : 881 - 884
  • [43] Parametric and semiparametric reduced-rank regression with flexible sparsity
    Lian, Heng
    Feng, Sanying
    Zhao, Kaifeng
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2015, 136 : 163 - 174
  • [44] Psychological disorders and dietary patterns by reduced-rank regression
    Hosseinzadeh, Mandieh
    Vafa, Mohammad-Reza
    Esmaillzadeh, Ahmad
    Feizi, Awat
    Majdzadeh, Reza
    Afshar, Hamidreza
    Keshtel, Ammar Hassanzadeh
    Adibi, Peyman
    [J]. EUROPEAN JOURNAL OF CLINICAL NUTRITION, 2019, 73 (03) : 408 - 415
  • [45] On multivariate linear regression shrinkage and reduced-rank procedures
    Reinsel, GC
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1999, 81 (02) : 311 - 321
  • [46] MULTIPLE QUANTILE MODELING VIA REDUCED-RANK REGRESSION
    Lian, Heng
    Zhao, Weihua
    Ma, Yanyuan
    [J]. STATISTICA SINICA, 2019, 29 (03) : 1439 - 1464
  • [47] Crosslingual Document Embedding as Reduced-Rank Ridge Regression
    Josifoski, Martin
    Paskov, Ivan S.
    Paskov, Hristo S.
    Jaggi, Martin
    West, Robert
    [J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 744 - 752
  • [48] On the degrees of freedom of reduced-rank estimators in multivariate regression
    Mukherjee, A.
    Chen, K.
    Wang, N.
    Zhu, J.
    [J]. BIOMETRIKA, 2015, 102 (02) : 457 - 477
  • [49] Integrative sparse reduced-rank regression via orthogonal rotation for analysis of high-dimensional multi-source data
    Kim, Kipoong
    Jung, Sungkyu
    [J]. STATISTICS AND COMPUTING, 2024, 34 (01)
  • [50] Integrative sparse reduced-rank regression via orthogonal rotation for analysis of high-dimensional multi-source data
    Kipoong Kim
    Sungkyu Jung
    [J]. Statistics and Computing, 2024, 34