Sensible Initialization Using Expert Knowledge for Genome-Wide Analysis of Epistasis Using Genetic Programming

被引:6
|
作者
Greene, Casey S. [1 ]
White, Bill C. [1 ]
Moore, Jason H. [1 ]
机构
[1] Dartmouth Med Sch, Dept Genet, Lebanon, NH USA
关键词
ASSOCIATION; SUSCEPTIBILITY; RELIEFF; CANCER;
D O I
10.1109/CEC.2009.4983093
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For biomedical researchers it is now possible to measure large numbers of DNA sequence variations across the human genome. Measuring hundreds of thousands of variations is now routine, but single variations which consistently predict an individual's risk of common human disease have proven elusive. Instead of single variants determining the risk of common human diseases, it seems more likely that disease risk is best modeled by interactions between biological components. The evolutionary computing challenge now is to effectively explore interactions in these large datasets and identify combinations of variations which are robust predictors of common human diseases such as bladder cancer. One promising approach to this problem is genetic programming (GP). A GP approach for this problem will use darwinian inspired evolution to evolve programs which find and model attribute interactions which predict an individual's risk of common human diseases. The goal of this study is to develop and evaluate two initializers for this domain. We develop a probabilistic initializer which uses expert knowledge to select attributes and an enumerative initializer which maximizes attribute diversity in the generated population. We compare these initializers to a random initializer which displays no preference for attributes. We show that the expert-knowledge-aware probabilistic initializer significantly outperforms both the random initializer and the enumerative initializer. We discuss implications of these results for the design of GP strategies which are able to detect and characterize predictors of common human diseases.
引用
收藏
页码:1289 / 1296
页数:8
相关论文
共 50 条
  • [41] Genome-wide two-locus epistasis scans in prostate cancer using two European populations
    Tao, Sha
    Feng, Junjie
    Webster, Timothy
    Jin, Guangfu
    Hsu, Fang-Chi
    Chen, Shyh-Huei
    Kim, Seong-Tae
    Wang, Zhong
    Zhang, Zheng
    Zheng, Siqun L.
    Isaacs, William B.
    Xu, Jianfeng
    Sun, Jielin
    HUMAN GENETICS, 2012, 131 (07) : 1225 - 1234
  • [42] Genome-wide two-locus epistasis scans in prostate cancer using two European populations
    Sha Tao
    Junjie Feng
    Timothy Webster
    Guangfu Jin
    Fang-Chi Hsu
    Shyh-Huei Chen
    Seong-Tae Kim
    Zhong Wang
    Zheng Zhang
    Siqun L. Zheng
    William B. Isaacs
    Jianfeng Xu
    Jielin Sun
    Human Genetics, 2012, 131 : 1225 - 1234
  • [43] Genetic dissection of wheat panicle traits using linkage analysis and a genome-wide association study
    Liu, Kai
    Sun, Xiaoxiao
    Ning, Tangyuan
    Duan, Xixian
    Wang, Qiaoling
    Liu, Tongtong
    An, Yuling
    Guan, Xin
    Tian, Jichun
    Chen, Jiansheng
    THEORETICAL AND APPLIED GENETICS, 2018, 131 (05) : 1073 - 1090
  • [44] Genetic dissection of wheat panicle traits using linkage analysis and a genome-wide association study
    Kai Liu
    Xiaoxiao Sun
    Tangyuan Ning
    Xixian Duan
    Qiaoling Wang
    Tongtong Liu
    Yuling An
    Xin Guan
    Jichun Tian
    Jiansheng Chen
    Theoretical and Applied Genetics, 2018, 131 : 1073 - 1090
  • [45] Analysis of population structure and genetic matching in European samples using genome-wide marker sets
    Nothnagel, M.
    Lu, Timothy T.
    Grueco, O. L.
    Junge, O.
    Freitag-Wolf, S.
    Caliebe, A.
    Kayser, M.
    Krawczak, M.
    ANNALS OF HUMAN GENETICS, 2008, 72 : 687 - 687
  • [46] Genetic variants and risk of prostate cancer using pathway analysis of a genome-wide association study
    Kim, Y. S.
    Kim, Y.
    Choi, J. W.
    Oh, H. E.
    Lee, J. H.
    NEOPLASMA, 2016, 63 (04) : 629 - 634
  • [47] Genome-wide epistasis study highlights genetic interactions influencing severity of COVID-19
    Shiqi Lin
    Xingjian Gao
    Frauke Degenhardt
    Yu Qian
    Tianzi Liu
    Xavier Farre Ramon
    Syed Sibte Hadi
    Manuel Romero-Gómez
    Javier Fernández
    Agustín Albillos
    Maria Buti Ferret
    Luis Bujanda
    Antonio Julià
    Rafael de Cid
    Rosanna Asselta
    Andre Franke
    Fan Liu
    European Journal of Epidemiology, 2023, 38 (8) : 883 - 889
  • [48] Genome-wide epistasis study highlights genetic interactions influencing severity of COVID-19
    Lin, Shiqi
    Gao, Xingjian
    Degenhardt, Frauke
    Qian, Yu
    Liu, Tianzi
    Ramon, Xavier Farre
    Hadi, Syed Sibte
    Romero-Gomez, Manuel
    Fernandez, Javier
    Albillos, Agustin
    Ferret, Maria Buti
    Bujanda, Luis
    Julia, Antonio
    de Cid, Rafael
    Asselta, Rosanna
    Franke, Andre
    Liu, Fan
    EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2023, 38 (08) : 883 - 889
  • [49] Genome-wide analysis of chromatin status using tiling microarrays
    Shivaswamy, Sushma
    Iyer, Vishwanath R.
    METHODS, 2007, 41 (03) : 304 - 311
  • [50] Leveraging the genetic correlation between traits improves the detection of epistasis in genome-wide association studies
    Stamp, Julian
    DenAdel, Alan
    Weinreich, Daniel
    Crawford, Lorin
    G3-GENES GENOMES GENETICS, 2023, 13 (08):