Efficient learning of microbial genotype-phenotype association rules

被引:22
|
作者
MacDonald, Norman J. [1 ]
Beiko, Robert G. [1 ]
机构
[1] Dalhousie Univ, Fac Comp Sci, Halifax, NS, Canada
基金
加拿大自然科学与工程研究理事会; 加拿大创新基金会;
关键词
GENOME ANALYSIS; GENE; CLASSIFICATION; SELECTION; TRAITS; SYSTEM;
D O I
10.1093/bioinformatics/btq305
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Finding biologically causative genotype-phenotype associations from whole-genome data is difficult due to the large gene feature space to mine, the potential for interactions among genes and phylogenetic correlations between genomes. Associations within phylogentically distinct organisms with unusual molecular mechanisms underlying their phenotype may be particularly difficult to assess. Results: We have developed a new genotype-phenotype association approach that uses Classification based on Predictive Association Rules (CPAR), and compare it with NETCAR, a recently published association algorithm. Our implementation of CPAR gave on average slightly higher classification accuracy, with approximately 100 time faster running times. Given the influence of phylogenetic correlations in the extraction of genotype-phenotype association rules, we furthermore propose a novel measure for downweighting the dependence among samples by modeling shared ancestry using conditional mutual information, and demonstrate its complementary nature to traditional mining approaches.
引用
收藏
页码:1834 / 1840
页数:7
相关论文
共 50 条
  • [1] Microbial genotype-phenotype mapping by class association rule mining
    Tamura, Makio
    D'haeseleer, Patrik
    BIOINFORMATICS, 2008, 24 (13) : 1523 - 1529
  • [2] The dynamics of the genotype-phenotype association
    Kuhnlein, U
    Parsanejad, R
    Zadworny, D
    Aggrey, SE
    POULTRY SCIENCE, 2003, 82 (06) : 876 - 881
  • [3] Multivariate Analysis of Genotype-Phenotype Association
    Mitteroecker, Philipp
    Cheverud, James M.
    Pavlicev, Mihaela
    GENETICS, 2016, 202 (04) : 1345 - +
  • [4] An efficient approach to large-scale genotype-phenotype association analyses
    Yang, Runqing
    Li, Hongwang
    Fu, Lina
    Liu, Yongxin
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (05) : 814 - 822
  • [5] Learning an Evolvable Genotype-Phenotype Mapping
    Moreno, Matthew Andres
    Banzhaf, Wolfgang
    Ofria, Charles
    GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2018, : 983 - 990
  • [6] Learning Monotonic Genotype-Phenotype Maps
    Beerenwinkel, Niko
    Knupfer, Patrick
    Tresch, Achim
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01):
  • [7] Assumptions in studies of heritability and genotype-phenotype association
    Miller, Michael B.
    DeYoung, Colin G.
    McGue, Matt
    BEHAVIORAL AND BRAIN SCIENCES, 2012, 35 (05) : 372 - 373
  • [8] Pyruvate kinase deficiency: The genotype-phenotype association
    Zanella, Alberto
    Fermo, Elisa
    Bianchi, Paola
    Chiarelli, Laurent Roberto
    Valentini, Giovanna
    BLOOD REVIEWS, 2007, 21 (04) : 217 - 231
  • [9] Efficient techniques for genotype-phenotype correlational analysis
    Saha, Subrata
    Rajasekaran, Sanguthevar
    Bi, Jinbo
    Pathak, Sudipta
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2013, 13
  • [10] Machine learning, the kidney, and genotype-phenotype analysis
    Sealfon, Rachel S. G.
    Mariani, Laura H.
    Kretzler, Matthias
    Troyanskaya, Olga G.
    KIDNEY INTERNATIONAL, 2020, 97 (06) : 1141 - 1149