Principal Component Regression and Linear Mixed Model in Association Analysis of Structured Samples: Competitors or Complements?

被引：31

作者：

Zhang, Yiwei ^{[1
]}

Pan, Wei ^{[1
]}

机构：

[1] Univ Minnesota, Sch Publ Hlth, Div Biostat, Minneapolis, MN 55455 USA

来源：

GENETIC EPIDEMIOLOGY | 2015年 / 39卷 / 03期

基金：

欧洲研究理事会;

关键词：

association testing; confounding; environmental risk; population stratification; probabilistic principal component analysis; POPULATION STRATIFICATION; VARIANTS; SCALE;

D O I：

10.1002/gepi.21879

中图分类号：

Q3 [遗传学];

学科分类号：

071007 ; 090102 ;

摘要：

Genome-wide association studies (GWAS) have been established as a major tool to identify genetic variants associated with complex traits, such as common diseases. However, GWAS may suffer from false positives and false negatives due to confounding population structures, including known or unknown relatedness. Another important issue is unmeasured environmental risk factors. Among many methods for adjusting for population structures, two approaches stand out: one is principal component regression (PCR) based on principal component analysis, which is perhaps the most popular due to its early appearance, simplicity, and general effectiveness; the other is based on a linear mixed model (LMM) that has emerged recently as perhaps the most flexible and effective, especially for samples with complex structures as in model organisms. As shown previously, the PCR approach can be regarded as an approximation to an LMM; such an approximation depends on the number of the top principal components (PCs) used, the choice of which is often difficult in practice. Hence, in the presence of population structure, the LMM appears to outperform the PCR method. However, due to the different treatments of fixed vs. random effects in the two approaches, we show an advantage of PCR over LMM: in the presence of an unknown but spatially confined environmental confounder (e.g., environmental pollution or lifestyle), the PCs may be able to implicitly and effectively adjust for the confounder whereas the LMM cannot. Accordingly, to adjust for both population structures and nongenetic confounders, we propose a hybrid method combining the use and, thus, strengths of PCR and LMM. We use real genotype data and simulated phenotypes to confirm the above points, and establish the superior performance of the hybrid method across all scenarios.

引用

页码：149 / 155

页数：7

共 50 条

[21] Linear Principal Component Discriminant Analysis
Pei, Yan
2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 2108 - 2113
[22] Face Expression Recognition Based on Equable Principal Component Analysis and Linear Regression Classification
Zhu, Yani
Li, Xiaoxin
Wu, Guohua
2016 3RD INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2016, : 876 - 880
[23] ADAPTIVE FUNCTIONAL LINEAR REGRESSION VIA FUNCTIONAL PRINCIPAL COMPONENT ANALYSIS AND BLOCK THRESHOLDING
Cai, T. Tony
Zhang, Linjun
Zhou, Harrison H.
STATISTICA SINICA, 2018, 28 (04) : 2455 - 2468
[24] Predicting pellet quality using multiple linear regression with Principal Component Analysis (PCA)
You, Jihao
Tulpan, Dan
Ellis, Jennifer L.
JOURNAL OF ANIMAL SCIENCE, 2024, 102
[25] Predicting pellet quality using multiple linear regression with Principal Component Analysis (PCA)
You, Jihao
Tulpan, Dan
Ellis, Jennifer L.
JOURNAL OF ANIMAL SCIENCE, 2024, 102 : 154 - 155
[26] Common principle for the decomposition of the total variability in principal component analysis and in linear regression - Solution
Rolle, JD
ECONOMETRIC THEORY, 2000, 16 (06) : 1044 - 1045
[27] PRINCIPAL COMPONENT ESTIMATORS IN REGRESSION-ANALYSIS
CHENG, DC
IGLARSH, HJ
REVIEW OF ECONOMICS AND STATISTICS, 1976, 58 (02) : 229 - 234
[28] POLAROGRAPHIC ANALYSIS OF PYRAZINES BY PRINCIPAL COMPONENT REGRESSION
Yong Nian NI Department of Chemistry
Mark Selby and Mark Hodgkinson School of Chemistry
Chinese Chemical Letters, 1992, (09) : 721 - 722
[29] Application of Principal Component Regression Analysis in Economic Analysis
Chen Ming-ming
Ma Jing-lian
PROCEEDINGS OF THE 2015 3RD INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE, EDUCATION TECHNOLOGY, ARTS, SOCIAL SCIENCE AND ECONOMICS (MSETASSE 2015), 2015, 41 : 1205 - 1208
[30] On the Principal Component Liu-type Estimator in Linear Regression
Wu, Jibo
Yang, Hu
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2015, 44 (08) : 2061 - 2072

← 1 2 3 4 5 →