Microbial genotype-phenotype mapping by class association rule mining

被引:34
|
作者
Tamura, Makio [1 ]
D'haeseleer, Patrik [1 ]
机构
[1] Lawrence Livermore Natl Lab, Comp Applicat & Res Dept, Chem Mat Earth & Life Sci Dept, Microbial Syst Biol Grp, Livermore, CA 94550 USA
关键词
D O I
10.1093/bioinformatics/btn210
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Microbial phenotypes are typically due to the concerted action of multiple gene functions, yet the presence of each gene may have only a weak correlation with the observed phenotype. Hence, it may be more appropriate to examine co-occurrence between sets of genes and a phenotype (multiple-to-one) instead of pairwise relations between a single gene and the phenotype. Here, we propose an efficient class association rule mining algorithm, NETCAR, in order to extract sets of COGs (clusters of orthologous groups of proteins) associated with a phenotype from COG phylogenetic profiles and a phenotype profile. NETCAR takes into account the phylogenetic co-occurrence graph between COGs to restrict hypothesis space, and uses mutual information to evaluate the biconditional relation. Results: We examined the mining capability of pairwise and multiple-to-one association by using NETCAR to extract COGs relevant to six microbial phenotypes (aerobic, anaerobic, facultative, endospore, motility and Gram negative) from 11 969 unique COG profiles across 155 prokaryotic organisms. With the same level of false discovery rate, multiple-to-one association can extract about 10 times more relevant COGs than one-to-one association. We also reveal various topologies of association networks among COGs (modules) from extracted multiple-to-one correlation rules relevant with the six phenotypes; including a well-connected network for motility, a star-shaped network for aerobic and intermediate topologies for the other phenotypes. NETCAR outperforms a standard CAR mining algorithm, CARAPRIORI, while requiring several orders of magnitude less computational time for extracting 3-COG sets.
引用
收藏
页码:1523 / 1529
页数:7
相关论文
共 50 条
  • [41] Linking microbial communities to ecosystem functions: what we can learn from genotype-phenotype mapping in organisms
    Morriss, Andrew
    Meyer, Kyle
    Bohannan, Brendan
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2020, 375 (1798)
  • [42] Genotype-phenotype association research in ethnically-complex samples
    Miller, Michael
    BEHAVIOR GENETICS, 2014, 44 (06) : 673 - 673
  • [43] Genotype-phenotype association in infants with cystic fibrosis at the time of diagnosis
    Kraemer, R
    Birrer, P
    Liechti-Gallati, S
    PEDIATRIC RESEARCH, 1998, 44 (06) : 920 - 926
  • [44] Strategies for performing genotype-phenotype association studies in nonhuman primates
    Barr, Christina S.
    METHODS, 2009, 49 (01) : 56 - 62
  • [45] The DU Map: A Visualization to Gain Insights into Genotype-Phenotype Mapping and Diversity
    Medvet, Eric
    Tusar, Tea
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCO'17 COMPANION), 2017, : 1705 - 1712
  • [46] Mapping phyllosphere microbiota interactions in planta to establish genotype-phenotype relationships
    Schaefer, Martin
    Vogel, Christine M.
    Bortfeld-Miller, Miriam
    Mittelviefhaus, Maximilian
    Vorholt, Julia A.
    NATURE MICROBIOLOGY, 2022, 7 (06) : 856 - +
  • [47] Multivariate Genotype-Phenotype Mapping for Craniofacial Shape in the Diversity Outbred Population
    Aponte, J. D.
    Katz, D. C.
    Percival, C. J.
    Roseman, C. C.
    Cheverud, J. M.
    Marcucio, R. S.
    Hallgrimsson, B.
    JOURNAL OF MORPHOLOGY, 2019, 280 : S76 - S77
  • [48] Establishment of neuroendocrine neoplasms organoid biobank enables genotype-phenotype mapping
    Kawasaki, K.
    Fujii, M.
    Kudo, A.
    Kanai, T.
    Nakagawa, H.
    Sato, T.
    JOURNAL OF NEUROENDOCRINOLOGY, 2021, 33 : 29 - 29
  • [49] Genotype-phenotype mapping with polyominos made from DNA origami tiles
    Dreher, Yannik
    Fichtler, Julius
    Karfusehr, Christoph
    Jahnke, Kevin
    Xin, Yang
    Keller, Adrian
    Goepfrich, Kerstin
    BIOPHYSICAL JOURNAL, 2022, 121 (24) : 4840 - 4848
  • [50] GENETIC EVOLUTION OF 'SORTING' PROGRAMS THROUGH A NOVEL GENOTYPE-PHENOTYPE MAPPING
    Xhemali, Daniela
    Hinde, Christopher J.
    Stone, Roger G.
    ICEC 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION, 2010, : 190 - 198