Efficient learning of microbial genotype-phenotype association rules

被引:22
|
作者
MacDonald, Norman J. [1 ]
Beiko, Robert G. [1 ]
机构
[1] Dalhousie Univ, Fac Comp Sci, Halifax, NS, Canada
基金
加拿大自然科学与工程研究理事会; 加拿大创新基金会;
关键词
GENOME ANALYSIS; GENE; CLASSIFICATION; SELECTION; TRAITS; SYSTEM;
D O I
10.1093/bioinformatics/btq305
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Finding biologically causative genotype-phenotype associations from whole-genome data is difficult due to the large gene feature space to mine, the potential for interactions among genes and phylogenetic correlations between genomes. Associations within phylogentically distinct organisms with unusual molecular mechanisms underlying their phenotype may be particularly difficult to assess. Results: We have developed a new genotype-phenotype association approach that uses Classification based on Predictive Association Rules (CPAR), and compare it with NETCAR, a recently published association algorithm. Our implementation of CPAR gave on average slightly higher classification accuracy, with approximately 100 time faster running times. Given the influence of phylogenetic correlations in the extraction of genotype-phenotype association rules, we furthermore propose a novel measure for downweighting the dependence among samples by modeling shared ancestry using conditional mutual information, and demonstrate its complementary nature to traditional mining approaches.
引用
收藏
页码:1834 / 1840
页数:7
相关论文
共 50 条
  • [41] GENERALIZING GENOTYPE-PHENOTYPE RELATIONSHIPS
    Graham, Dustin M.
    LAB ANIMAL, 2016, 45 (11) : 411 - 411
  • [42] Generalizing genotype-phenotype relationships
    Dustin M. Graham
    Lab Animal, 2016, 45 : 411 - 411
  • [43] Expanding the phenotype half of the genotype-phenotype space
    Bucan, Maja
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (06) : 1477 - 1479
  • [44] Replicating genotype-phenotype associations
    Chanock, Stephen J.
    Manolio, Teri
    Boehnke, Michael
    Boerwinkle, Eric
    Hunter, David J.
    Thomas, Gilles
    Hirschhorn, Joel N.
    Abecasis, Goncalo
    Altshuler, David
    Bailey-Wilson, Joan E.
    Brooks, Lisa D.
    Cardon, Lon R.
    Daly, Mark
    Donnelly, Peter
    Fraumeni, Joseph F., Jr.
    Freimer, Nelson B.
    Gerhard, Daniela S.
    Gunter, Chris
    Guttmacher, Alan E.
    Guyer, Mark S.
    Harris, Emily L.
    Hoh, Josephine
    Hoover, Robert
    Kong, C. Augustine
    Merikangas, Kathleen R.
    Morton, Cynthia C.
    Palmer, Lyle J.
    Phimister, Elizabeth G.
    Rice, John P.
    Roberts, Jerry
    Rotimi, Charles
    Tucker, Margaret A.
    Vogan, Kyle J.
    Wacholder, Sholom
    Wijsman, Ellen M.
    Winn, Deborah M.
    Collins, Francis S.
    NATURE, 2007, 447 (7145) : 655 - 660
  • [45] Longitudinal Genotype-Phenotype Association Study through Temporal Structure Auto-Learning Predictive Model
    Wang, Xiaoqian
    Yan, Jingwen
    Yao, Xiaohui
    Kim, Sungeun
    Nho, Kwangsik
    Risacher, Shannon L.
    Saykin, Andrew J.
    Shen, Li
    Huang, Heng
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2018, 25 (07) : 809 - 824
  • [46] Brain imaging indicates genotype-phenotype association in Duchenne muscular dystrophy
    Doorenweerd, N.
    Bettolo, C.
    Hollingsworth, K.
    Hendriksen, J.
    Niks, E.
    Straub, V.
    Kan, H.
    NEUROMUSCULAR DISORDERS, 2017, 27 : S249 - S249
  • [47] Brain imaging indicates genotype-phenotype association in Duchenne muscular dystrophy
    Doorenweerd, N.
    Bettolo, C. M.
    Hollingsworth, K. G.
    Hendriksen, J. G. M.
    Niks, E. H.
    Straub, V.
    Kan, H. E.
    NEUROMUSCULAR DISORDERS, 2018, 28 : S10 - S10
  • [48] A genotype-phenotype association approach to reveal thermal adaptation in Daphnia galeata
    Herrmann, Maike
    Henning-Lucass, Nicole
    Cordellier, Mathilde
    Schwenk, Klaus
    JOURNAL OF EXPERIMENTAL ZOOLOGY PART A-ECOLOGICAL AND INTEGRATIVE PHYSIOLOGY, 2017, 327 (01) : 53 - 65
  • [49] Molecular Markers EST-SSRs for Genotype-Phenotype Association in Sugarcane
    Diola, Valdir
    Barbosa, M. H. P.
    Veiga, C. F. M.
    Fernandes, E. C.
    SUGAR TECH, 2014, 16 (03) : 241 - 249
  • [50] Molecular Markers EST-SSRs for Genotype-Phenotype Association in Sugarcane
    Valdir Diola
    M. H. P. Barbosa
    C. F. M. Veiga
    E. C. Fernandes
    Sugar Tech, 2014, 16 : 241 - 249