A penalized linear mixed model with generalized method of moments for prediction analysis on high-dimensional multi-omics data

被引:3
|
作者
Wang, Xiaqiong [1 ]
Wen, Yalu [1 ]
机构
[1] Univ Auckland, Dept Stat, 38 Princes St, Auckland 1010, New Zealand
关键词
generalized method of moments; high dimensionality; penalized linear mixed models; risk prediction; ALZHEIMERS-DISEASE; INTEGRATIVE ANALYSIS; RISK PREDICTION; ASSOCIATION; GENOTYPE; POLYMORPHISM; GENE; PHENOTYPES; PROGRESS; TRAITS;
D O I
10.1093/bib/bbac193
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
With the advances in high-throughput biotechnologies, high-dimensional multi-layer omics data become increasingly available. They can provide both confirmatory and complementary information to disease risk and thus have offered unprecedented opportunities for risk prediction studies. However, the high-dimensionality and complex inter/intra-relationships among multi-omics data have brought tremendous analytical challenges. Here we present a computationally efficient penalized linear mixed model with generalized method of moments estimator (MpLMMGMM) for the prediction analysis on multi-omics data. Our method extends the widely used linear mixed model proposed for genomic risk predictions to model multi-omics data, where kernel functions are used to capture various types of predictive effects from different layers of omics data and penalty terms are introduced to reduce the impact of noise. Compared with existing penalized linear mixed models, the proposed method adopts the generalized method of moments estimator and it is much more computationally efficient. Through extensive simulation studies and the analysis of positron emission tomography imaging outcomes, we have demonstrated that MpLMMGMM can simultaneously consider a large number of variables and efficiently select those that are predictive from the corresponding omics layers. It can capture both linear and nonlinear predictive effects and achieves better prediction performance than competing methods.
引用
下载
收藏
页数:11
相关论文
共 50 条
  • [21] Lasso penalized model selection criteria for high-dimensional multivariate linear regression analysis
    Katayama, Shota
    Imori, Shinpei
    JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 132 : 138 - 150
  • [22] Statistical analysis for a penalized EM algorithm in high-dimensional mixture linear regression model
    Wang, Ning
    Zhang, Xin
    Mai, Qing
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [23] High-dimensional generalized median adaptive lasso with application to omics data
    Liu, Yahang
    Gao, Qian
    Wei, Kecheng
    Huang, Chen
    Wang, Ce
    Yu, Yongfu
    Qin, Guoyou
    Wang, Tong
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (02)
  • [24] Homogeneity detection for the high-dimensional generalized linear model
    Jeon, Jong-June
    Kwon, Sunghoon
    Choi, Hosik
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 114 : 61 - 74
  • [25] Penalized regression calibration: A method for the prediction of survival outcomes using complex longitudinal and high-dimensional data
    Signorelli, Mirko
    Spitali, Pietro
    Szigyarto, Cristina Al-Khalili
    Tsonaka, Roula
    STATISTICS IN MEDICINE, 2021, 40 (27) : 6178 - 6196
  • [26] Double penalized variable selection for high-dimensional partial linear mixed effects models
    Yang, Yiping
    Luo, Chuanqin
    Yang, Weiming
    JOURNAL OF MULTIVARIATE ANALYSIS, 2024, 204
  • [27] Statistical quality control analysis of high-dimensional omics data
    Kim, Yongkang
    Kim, Gyu-Tae
    Kwon, Min-Seok
    Park, Taesung
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 18 (03) : 210 - 222
  • [28] Multiset sparse redundancy analysis for high-dimensional omics data
    Csala, Attila
    Hof, Michel H.
    Zwinderman, Aeilko H.
    BIOMETRICAL JOURNAL, 2019, 61 (02) : 406 - 423
  • [29] Generalized Bayesian Factor Analysis for Integrative Clustering with Applications to Multi-Omics Data
    Min, Eun Jeong
    Chang, Changgee
    Long, Qi
    2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 109 - 119
  • [30] Generalized autoregressive linear models for discrete high-dimensional data
    Pandit P.
    Sahraee-Ardakan M.
    Amini A.A.
    Rangan S.
    Fletcher A.K.
    IEEE Journal on Selected Areas in Information Theory, 2020, 1 (03): : 884 - 896