Sensible Initialization Using Expert Knowledge for Genome-Wide Analysis of Epistasis Using Genetic Programming

被引:6
|
作者
Greene, Casey S. [1 ]
White, Bill C. [1 ]
Moore, Jason H. [1 ]
机构
[1] Dartmouth Med Sch, Dept Genet, Lebanon, NH USA
关键词
ASSOCIATION; SUSCEPTIBILITY; RELIEFF; CANCER;
D O I
10.1109/CEC.2009.4983093
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For biomedical researchers it is now possible to measure large numbers of DNA sequence variations across the human genome. Measuring hundreds of thousands of variations is now routine, but single variations which consistently predict an individual's risk of common human disease have proven elusive. Instead of single variants determining the risk of common human diseases, it seems more likely that disease risk is best modeled by interactions between biological components. The evolutionary computing challenge now is to effectively explore interactions in these large datasets and identify combinations of variations which are robust predictors of common human diseases such as bladder cancer. One promising approach to this problem is genetic programming (GP). A GP approach for this problem will use darwinian inspired evolution to evolve programs which find and model attribute interactions which predict an individual's risk of common human diseases. The goal of this study is to develop and evaluate two initializers for this domain. We develop a probabilistic initializer which uses expert knowledge to select attributes and an enumerative initializer which maximizes attribute diversity in the generated population. We compare these initializers to a random initializer which displays no preference for attributes. We show that the expert-knowledge-aware probabilistic initializer significantly outperforms both the random initializer and the enumerative initializer. We discuss implications of these results for the design of GP strategies which are able to detect and characterize predictors of common human diseases.
引用
收藏
页码:1289 / 1296
页数:8
相关论文
共 50 条
  • [1] Genome-wide genetic analysis using genetic programming: The critical need for expert knowledge
    Moore, Jason H.
    White, Bill C.
    GENETIC PROGRAMMING THEORY AND PRACTICE IV, 2007, 4 : 11 - +
  • [2] Exploiting expert knowledge in genetic programming for genome-wide genetic analysis
    Moore, Jason H.
    White, Bill C.
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN IX, PROCEEDINGS, 2006, 4193 : 969 - 977
  • [3] An expert knowledge-guided mutation operator for genome-wide genetic analysis using genetic programming
    Greene, Casey S.
    White, Bill C.
    Moore, Jason H.
    PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2007, 4774 : 30 - 40
  • [4] GP-Pi: Using Genetic Programming with Penalization and Initialization on Genome-Wide Association Study
    Sze-To, Ho-Yin
    Lee, Kwan-Yeung
    Tso, Kai-Yuen
    Wong, Man-Hon
    Lee, Kin-Hong
    Tang, Nelson L. S.
    Leung, Kwong-Sak
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT II, 2013, 7895 : 330 - +
  • [5] GENOME-WIDE GENETIC INTERACTION ANALYSIS OF GLAUCOMA USING EXPERT KNOWLEDGE DERIVED FROM HUMAN PHENOTYPE NETWORKS
    Hu, Ting
    Darabos, Christian
    Cricco, Maria E.
    Kong, Emily
    Moore, Jason H.
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2015 (PSB), 2015, : 207 - 218
  • [6] EpICS: A System for Genome-wide Epistasis and Genetic Variation Analysis using Protein-Protein Interactions
    Sultana, Kazi Zakia
    Bhattacharjee, Anupam
    Jamil, Hasan
    BIBMW: 2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOP, 2009, : 256 - 261
  • [7] Using Biological Knowledge to Uncover the Mystery in the Search for Epistasis in Genome-Wide Association Studies
    Ritchie, Marylyn D.
    ANNALS OF HUMAN GENETICS, 2011, 75 : 172 - 182
  • [8] Epistasis Analysis Goes Genome-Wide
    Zhang, Jianzhi
    PLOS GENETICS, 2017, 13 (02):
  • [9] SuperDCA for genome-wide epistasis analysis
    Puranen, Santeri
    Pesonen, Maiju
    Pensar, Johan
    Xu, Ying Ying
    Lees, John A.
    Bentley, Stephen D.
    Croucher, Nicholas J.
    Corander, Jukka
    MICROBIAL GENOMICS, 2018, 4 (06):
  • [10] Toward the automated analysis of complex diseases in genome-wide association studies using genetic programming
    Sohn, Andrew
    Olson, Randal S.
    Moore, Jason H.
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 489 - 496