Mapping complex traits using Random Forests

被引:0
|
作者
Alexandre Bureau
Josée Dupuis
Brooke Hayward
Kathleen Falls
Paul Van Eerdewegh
机构
[1] Genome Therapeutics Corporation,School of Health Sciences
[2] University of Lethbridge,Department of Biostatistics
[3] Boston University,Department of Psychiatry
[4] Harvard Medical School,undefined
来源
关键词
Candidate Gene Approach; Genetic Analysis Workshop; Predictive Error; Candidate Gene Analysis; Importance Index;
D O I
暂无
中图分类号
学科分类号
摘要
Random Forest is a prediction technique based on growing trees on bootstrap samples of data, in conjunction with a random selection of explanatory variables to define the best split at each node. In the case of a quantitative outcome, the tree predictor takes on a numerical value. We applied Random Forest to the first replicate of the Genetic Analysis Workshop 13 simulated data set, with the sibling pairs as our units of analysis and identity by descent (IBD) at selected loci as our explanatory variables. With the knowledge of the true model, we performed two sets of analyses on three phenotypes: HDL, triglycerides, and glucose. The goal was to approach the mapping of complex traits from a multivariate perspective. The first set of analyses mimics a candidate gene approach with a high proportion of true genes among the predictors while the second set represents a genome scan analysis using microsatellite markers. Random Forest was able to identify a few of the major genes influencing the phenotypes, such as baseline HDL and triglycerides, but failed to identify the major genes regulating baseline glucose levels.
引用
收藏
相关论文
共 50 条
  • [41] Memory Mapping and Parallelizing Random Forests for Speed and Cache Efficiency
    Romero, Eduardo
    Li, Angela
    Stewart, Christopher
    Hale, Kyle
    Morris, Nathaniel
    [J]. 50TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOP PROCEEDINGS - ICPP WORKSHOPS '21, 2021,
  • [42] Influence of outliers on QTL mapping for complex traits
    Yousaf HAYAT
    [J]. Journal of Zhejiang University-Science B(Biomedicine & Biotechnology), 2008, (12) : 931 - 937
  • [43] Epistasis: Obstacle or Advantage for Mapping Complex Traits?
    Verhoeven, Koen J. F.
    Casella, George
    McIntyre, Lauren M.
    [J]. PLOS ONE, 2010, 5 (08):
  • [44] Use of population isolates for mapping complex traits
    Peltonen, L
    Palotie, A
    Lange, K
    [J]. NATURE REVIEWS GENETICS, 2000, 1 (03) : 182 - 190
  • [45] Approaches to mapping genetically correlated complex traits
    George, AW
    Basu, S
    Li, N
    Rothstein, JH
    Sieberts, SK
    Stewart, W
    Wijsman, EM
    Thompson, EA
    [J]. BMC GENETICS, 2003, 4 (Suppl 1)
  • [46] Mapping complex traits in heterogeneous stock mice
    Flint, J.
    [J]. CALCIFIED TISSUE INTERNATIONAL, 2008, 82 : S17 - S17
  • [47] Mapping genes for complex traits in founder populations
    Ober, C
    Cox, NJ
    [J]. CLINICAL AND EXPERIMENTAL ALLERGY, 1998, 28 : 101 - 105
  • [48] A fast algorithm for functional mapping of complex traits
    Zhao, W
    Wu, RL
    Ma, CX
    Casella, G
    [J]. GENETICS, 2004, 167 (04) : 2133 - 2137
  • [49] Tutorial in biostatistics genetic mapping of complex traits
    Olson, JM
    Witte, JS
    Elston, RC
    [J]. STATISTICS IN MEDICINE, 1999, 18 (21) : 2961 - 2981
  • [50] Mapping complex traits with single nucleotide polymorphisms
    Lue Ping Zhao
    Corinne Aragaki
    Li Hsu
    Filemon Quiaoit
    [J]. Nature Genetics, 1999, 23 (Suppl 3) : 84 - 84