Penalized linear mixed models for structured genetic data

被引:1
|
作者
Reisetter, Anna C. [1 ]
Breheny, Patrick [1 ]
机构
[1] Univ Iowa, Dept Biostat, Iowa City, IA 52242 USA
关键词
confounding; lasso; linear mixed model; penalized regression; population stratification; POPULATION-STRUCTURE; CRYPTIC RELATEDNESS; ASSOCIATION ANALYSIS; VARIABLE SELECTION; HAN CHINESE; STRATIFICATION; HERITABILITY; REGRESSION; ACCOUNT;
D O I
10.1002/gepi.22384
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Many genetic studies that aim to identify genetic variants associated with complex phenotypes are subject to unobserved confounding factors arising from environmental heterogeneity. This poses a challenge to detecting associations of interest and is known to induce spurious associations when left unaccounted for. Penalized linear mixed models (LMMs) are an attractive method to correct for unobserved confounding. These methods correct for varying levels of relatedness and population structure by modeling it as a random effect with a covariance structure estimated from observed genetic data. Despite an extensive literature on penalized regression and LMMs separately, the two are rarely discussed together. The aim of this review is to do so while examining the statistical properties of penalized LMMs in the genetic association setting. Specifically, the ability of penalized LMMs to accurately estimate genetic effects in the presence of environmental confounding has not been well studied. To clarify the important yet subtle distinction between population structure and environmental heterogeneity, we present a detailed review of relevant concepts and methods. In addition, we evaluate the performance of penalized LMMs and competing methods in terms of estimation and selection accuracy in the presence of a number of confounding structures.
引用
收藏
页码:427 / 444
页数:18
相关论文
共 50 条
  • [1] Linear mixed models and penalized least squares
    Bates, DM
    DebRoy, S
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 91 (01) : 1 - 17
  • [2] Efficient penalized generalized linear mixed models for variable selection and genetic risk prediction in high-dimensional data
    St-Pierre, Julien
    Oualkacha, Karim
    Bhatnagar, Sahir Rai
    [J]. BIOINFORMATICS, 2023, 39 (02)
  • [3] Genetic Mixed Linear Models for Twin Survival Data
    Il Do Ha
    Youngjo Lee
    Yudi Pawitan
    [J]. Behavior Genetics, 2007, 37 : 621 - 630
  • [4] Genetic mixed linear models for twin survival data
    Ha, Il Do
    Lee, Youngjo
    Pawitan, Yudi
    [J]. BEHAVIOR GENETICS, 2007, 37 (04) : 621 - 630
  • [5] glmmPen: High Dimensional Penalized Generalized Linear Mixed Models
    Heiling, Hillary M.
    Rashid, Naim U.
    Li, Quefeng
    Ibrahim, Joseph G.
    [J]. R JOURNAL, 2023, 15 (04): : 106 - 128
  • [6] Polygenic Risk Scores Based on Penalized Generalized Linear Mixed Models
    St-Pierre, Julien
    [J]. GENETIC EPIDEMIOLOGY, 2022, 46 (07) : 535 - 535
  • [7] On the use of a penalized quasilikelihood information criterion for generalized linear mixed models
    Hui, Francis K. C.
    [J]. BIOMETRIKA, 2021, 108 (02) : 353 - 365
  • [8] Nonconcave penalized estimation for partially linear models with longitudinal data
    Yang, Yiping
    Li, Gaorong
    Lian, Heng
    [J]. STATISTICS, 2016, 50 (01) : 43 - 59
  • [9] Penalized empirical likelihood for generalized linear models with longitudinal data
    Tan, Xiaoyan
    Yan, Li
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2021, 50 (02) : 608 - 623
  • [10] Penalized contrast estimation in functional linear models with circular data
    Brunel, E.
    Roche, A.
    [J]. STATISTICS, 2015, 49 (06) : 1298 - 1321