Analysis of multiple SNPs in genetic association studies: Comparison of three multi-locus methods to prioritize and select SNPs

被引:34
|
作者
Heidema, A. Geert [1 ,2 ]
Feskens, Edith J. M. [3 ]
Doevendans, Pieter A. F. M. [4 ]
Ruven, Henk J. T. [5 ]
Van Houwelingen, Hans C. [1 ,2 ,6 ]
Mariman, Edwin C. M.
Boer, Jolanda M. A. [1 ,2 ]
机构
[1] Maastricht Univ, Dept Human Biol, NL-6200 MD Maastricht, Netherlands
[2] Natl Inst Publ Hlth & Environm, Ctr Nutr & Hlth, NL-3720 BA Bilthoven, Netherlands
[3] Univ Wageningen & Res Ctr, Div Human Nutr, Wageningen, Netherlands
[4] Univ Utrecht, Med Ctr, Heart Lung Ctr Utrecht, Utrecht, Netherlands
[5] St Antonius Hosp, Dept Clin Chem, Nieuwegein, Netherlands
[6] Leiden Univ, Med Ctr, Dept Med Stat, Leiden, Netherlands
关键词
multi-locus methods; set association; random forests; MDR;
D O I
10.1002/gepi.20251
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Nonparametric approaches have been developed that are able to analyze large numbers of single nucleoticle polymorphisms (SNPs) in modest sample sizes. These approaches have different selection features and may not provide similar results when applied to the same dataset. Therefore, we compared the results of three approaches (set association, random forests and multifactor dimensionality reduction [MDR]) to select from a total of 93 candidate SNPs a subset of SNPs that are important in determining high-density lipoprotein (HDL)-cholesterol levels. The study population consisted of a random sample from a Dutch monitoring project for cardiovascular disease risk factors and was dichotomized into cases (low HDL-cholesterol, n = 533) and non-cases (high HDL-cholesterol, n = 545) based on gender-specific median values for HDL cholesterol. Clearly, all three approaches prioritized three SNPs as important (CETP Taq1B, CETP-629 C/A and LPL Ser447X). Two SNPs with weaker main effects were additionally prioritized by random forests (APOC3 3175 G/C and CCR2 Va162Ile), whereas MTHFR 677 C/T was selected in combination with CETP Taq1B as best model by MDR. Obtained p-values for the selected models were significant for the set association approach (p =.0019), random forests (p <.01) and MDR (p <.02). In conclusion, the application of a combination of multi-locus methods is a useful approach in genetic association studies to select a well-defined set of important SNPs for further statistical and epidemiological interpretation, providing increased confidence and more information compared with the application of only one method.
引用
收藏
页码:910 / 921
页数:12
相关论文
共 47 条
  • [1] Prioritize and select SNPs for association studies with multi-stage designs
    Li, Jing
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2008, 15 (03) : 241 - 257
  • [2] Application of three approaches to select relevant SNPs in genetic association studies
    Heidema, A. G.
    Feskens, E. J. M.
    van Houwe-Lingen, J. C.
    Doevendans, P. A. F. M.
    Mariman, E. C. M.
    Boer, J. M. A.
    GENETIC EPIDEMIOLOGY, 2007, 31 (05) : 450 - 450
  • [3] A Novel Method to Select Informative SNPs and Their Application in Genetic Association Studies
    Liao, Bo
    Li, Xiong
    Zhu, Wen
    Cao, Zhi
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (05) : 1529 - 1534
  • [4] Transferability of tag SNPs in genetic association studies in multiple populations
    Paul I W de Bakker
    Noël P Burtt
    Robert R Graham
    Candace Guiducci
    Roman Yelensky
    Jared A Drake
    Todd Bersaglieri
    Kathryn L Penney
    Johannah Butler
    Stanton Young
    Robert C Onofrio
    Helen N Lyon
    Daniel O Stram
    Christopher A Haiman
    Matthew L Freedman
    Xiaofeng Zhu
    Richard Cooper
    Leif Groop
    Laurence N Kolonel
    Brian E Henderson
    Mark J Daly
    Joel N Hirschhorn
    David Altshuler
    Nature Genetics, 2006, 38 : 1298 - 1303
  • [5] Transferability of tag SNPs in genetic association studies in multiple populations
    de Bakker, Paul I. W.
    Burtt, Noel P.
    Graham, Robert R.
    Guiducci, Candace
    Yelensky, Roman
    Drake, Jared A.
    Bersaglieri, Todd
    Penney, Kathryn L.
    Butler, Johannah
    Young, Stanton
    Onofrio, Robert C.
    Lyon, Helen N.
    O Stram, Daniel
    Haiman, Christopher A.
    Freedman, Matthew L.
    Zhu, Xiaofeng
    Cooper, Richard
    Groop, Leif
    Kolonel, Laurence N.
    Henderson, Brian E.
    Daly, Mark J.
    Hirschhorn, Joel N.
    Altshuler, David
    NATURE GENETICS, 2006, 38 (11) : 1298 - 1303
  • [6] How to Select Tag SNPs in Genetic Association Studies? The CLONTagger Method with Parameter Optimization
    Ilhan, Ilhan
    Tezel, Gulay
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2013, 17 (07) : 368 - 383
  • [7] Meta-Analysis of Genetic Association Studies and Adjustment for Multiple Testing of Correlated SNPs and Traits
    Conneely, Karen N.
    Boehnke, Michael
    GENETIC EPIDEMIOLOGY, 2010, 34 (07) : 739 - 746
  • [8] A Joint Association Test for Multiple SNPs in Genetic Case-Control Studies
    Wang, Tao
    Jacob, Howard
    Ghosh, Soumitra
    Wang, Xujing
    Zeng, Zhao-Bang
    GENETIC EPIDEMIOLOGY, 2009, 33 (02) : 151 - 163
  • [9] Genetic Association Analysis and Meta-Analysis of Imputed SNPs in Longitudinal Studies
    Subirana, Isaac
    Gonzalez, Juan R.
    GENETIC EPIDEMIOLOGY, 2013, 37 (05) : 465 - 477
  • [10] Discrimination Between Correlated SNPs in Genetic Association Studies: Comparison Between Case-Control and Familial Studies
    Dandine-Roulland, Claire
    Clerget-Darpoux, Francoise
    Perdry, Herve
    HUMAN HEREDITY, 2013, 76 (02) : 102 - 102