Mapping complex traits using Random Forests

被引:0
|
作者
Alexandre Bureau
Josée Dupuis
Brooke Hayward
Kathleen Falls
Paul Van Eerdewegh
机构
[1] Genome Therapeutics Corporation,School of Health Sciences
[2] University of Lethbridge,Department of Biostatistics
[3] Boston University,Department of Psychiatry
[4] Harvard Medical School,undefined
来源
关键词
Candidate Gene Approach; Genetic Analysis Workshop; Predictive Error; Candidate Gene Analysis; Importance Index;
D O I
暂无
中图分类号
学科分类号
摘要
Random Forest is a prediction technique based on growing trees on bootstrap samples of data, in conjunction with a random selection of explanatory variables to define the best split at each node. In the case of a quantitative outcome, the tree predictor takes on a numerical value. We applied Random Forest to the first replicate of the Genetic Analysis Workshop 13 simulated data set, with the sibling pairs as our units of analysis and identity by descent (IBD) at selected loci as our explanatory variables. With the knowledge of the true model, we performed two sets of analyses on three phenotypes: HDL, triglycerides, and glucose. The goal was to approach the mapping of complex traits from a multivariate perspective. The first set of analyses mimics a candidate gene approach with a high proportion of true genes among the predictors while the second set represents a genome scan analysis using microsatellite markers. Random Forest was able to identify a few of the major genes influencing the phenotypes, such as baseline HDL and triglycerides, but failed to identify the major genes regulating baseline glucose levels.
引用
收藏
相关论文
共 50 条
  • [1] Mapping complex traits using Random Forests
    Bureau, A
    Dupuis, J
    Hayward, B
    Falls, K
    Van Eerdewegh, P
    [J]. BMC GENETICS, 2003, 4 (Suppl 1)
  • [2] Random forests, SNP importance and complex traits.
    Van Eerdewegh, P
    Bureau, A
    Lunetta, K
    Hayward, B
    Falls, K
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (05) : 194 - 194
  • [3] Object-oriented mapping of landslides using Random Forests
    Stumpf, Andre
    Kerle, Norman
    [J]. REMOTE SENSING OF ENVIRONMENT, 2011, 115 (10) : 2564 - 2577
  • [4] Geological Mapping in Western Tasmania Using Radar and Random Forests
    Radford, Declan D. G.
    Cracknell, Matthew J.
    Roach, Michael J.
    Cumming, Grace V.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (09) : 3075 - 3087
  • [5] Association mapping, using a mixture model for complex traits
    Zhu, XF
    Zhang, SL
    Zhao, HY
    Cooper, RS
    [J]. GENETIC EPIDEMIOLOGY, 2002, 23 (02) : 181 - 196
  • [6] QTL mapping for growth traits of pigs using random regression models
    Pinheiro, Valeria Rosado
    Fonseca e Silva, Fabyano
    Facioni Guimaraes, Simone Eliza
    Vilela de Resende, Marcos Deon
    Lopes, Paulo Savio
    Cruz, Cosme Damiao
    Azevedo, Camila Ferreira
    [J]. PESQUISA AGROPECUARIA BRASILEIRA, 2013, 48 (02) : 190 - 196
  • [7] Mapping canopy nitrogen in European forests using remote sensing and environmental variables with the random forests method
    Loozen, Yasmina
    Rebel, Karin T.
    de Jong, Steven M.
    Lu, Meng
    Ollinger, Scott, V
    Wassen, Martin J.
    Karssenberg, Derek
    [J]. REMOTE SENSING OF ENVIRONMENT, 2020, 247
  • [8] Diversity Forests: Using Split Sampling to Enable Innovative Complex Split Procedures in Random Forests
    Roman Hornung
    [J]. SN Computer Science, 2022, 3 (1)
  • [9] The utility of Random Forests for wildfire severity mapping
    Collins, L.
    Griffioen, P.
    Newell, G.
    Mellor, A.
    [J]. REMOTE SENSING OF ENVIRONMENT, 2018, 216 : 374 - 384
  • [10] Random Forests for mapping and analysis of microkinetics models
    Partopour, Behnam
    Paffenroth, Randy C.
    Dixon, Anthony G.
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2018, 115 : 286 - 294