Microbial genotype-phenotype mapping by class association rule mining

被引:34
|
作者
Tamura, Makio [1 ]
D'haeseleer, Patrik [1 ]
机构
[1] Lawrence Livermore Natl Lab, Comp Applicat & Res Dept, Chem Mat Earth & Life Sci Dept, Microbial Syst Biol Grp, Livermore, CA 94550 USA
关键词
D O I
10.1093/bioinformatics/btn210
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Microbial phenotypes are typically due to the concerted action of multiple gene functions, yet the presence of each gene may have only a weak correlation with the observed phenotype. Hence, it may be more appropriate to examine co-occurrence between sets of genes and a phenotype (multiple-to-one) instead of pairwise relations between a single gene and the phenotype. Here, we propose an efficient class association rule mining algorithm, NETCAR, in order to extract sets of COGs (clusters of orthologous groups of proteins) associated with a phenotype from COG phylogenetic profiles and a phenotype profile. NETCAR takes into account the phylogenetic co-occurrence graph between COGs to restrict hypothesis space, and uses mutual information to evaluate the biconditional relation. Results: We examined the mining capability of pairwise and multiple-to-one association by using NETCAR to extract COGs relevant to six microbial phenotypes (aerobic, anaerobic, facultative, endospore, motility and Gram negative) from 11 969 unique COG profiles across 155 prokaryotic organisms. With the same level of false discovery rate, multiple-to-one association can extract about 10 times more relevant COGs than one-to-one association. We also reveal various topologies of association networks among COGs (modules) from extracted multiple-to-one correlation rules relevant with the six phenotypes; including a well-connected network for motility, a star-shaped network for aerobic and intermediate topologies for the other phenotypes. NETCAR outperforms a standard CAR mining algorithm, CARAPRIORI, while requiring several orders of magnitude less computational time for extracting 3-COG sets.
引用
收藏
页码:1523 / 1529
页数:7
相关论文
共 50 条
  • [31] Genotype-phenotype mapping of human gastrointestinal cancers using organoids
    Sato, Toshiro
    CANCER SCIENCE, 2023, 114 : 248 - 248
  • [32] Genotype-Phenotype Mapping of Skull Development and Adaptation in Squamate Reptiles
    Ollonen, J.
    Da Silva, F. O.
    Di-Poi, N.
    JOURNAL OF MORPHOLOGY, 2019, 280 : S61 - S62
  • [33] Evolution of genetic redundancy: the relevance of complexity in genotype-phenotype mapping
    Saito, Nen
    Ishihara, Shuji
    Kaneko, Kunihiko
    NEW JOURNAL OF PHYSICS, 2014, 16
  • [34] Genotype-Phenotype Maps
    Stadler P.F.
    Stadler B.M.R.
    Biological Theory, 2006, 1 (3) : 268 - 279
  • [35] A GENETIC ALGORITHM WITH A MULTI-LAYERED GENOTYPE-PHENOTYPE MAPPING
    Hill, Seamus
    O'Riordan, Colm
    ICEC 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION, 2010, : 369 - 372
  • [36] Mining for genotype-phenotype relations in Saccharomyces using partial least squares
    Tahir Mehmood
    Harald Martens
    Solve Sæbø
    Jonas Warringer
    Lars Snipen
    BMC Bioinformatics, 12
  • [37] G6PD deficiency: the genotype-phenotype association
    Mason, Philip J.
    Bautista, Jose M.
    Gilsanz, Florinda
    BLOOD REVIEWS, 2007, 21 (05) : 267 - 283
  • [38] Mining for genotype-phenotype relations in Saccharomyces using partial least squares
    Mehmood, Tahir
    Martens, Harald
    Saebo, Solve
    Warringer, Jonas
    Snipen, Lars
    BMC BIOINFORMATICS, 2011, 12
  • [39] Genotype-Phenotype Association in Infants with Cystic Fibrosis at the Time of Diagnosis
    Richard Kraemer
    Peter Birrer
    Sabina Liechti-Gallati
    Pediatric Research, 1998, 44 : 920 - 926
  • [40] InGene: a multimodal approach to the genotype-phenotype association in neuromuscular diseases
    Conte, Raffaele
    Sansone, Francesco
    Tonacci, Alessandro
    Roccella, Stefano
    Spezzaneve, Andrea
    Rateni, Giovanni
    Tesconi, Mario
    Calderisi, Marco
    Fantacci, Maria Evelina
    Astrea, Guja
    Santorelli, Filippo Maria
    Diodato, Gianluca
    Grande, Andrea
    Landi, Patrizia
    Pala, Anna Paola
    Raciti, Mauro
    Sansone, Francesco
    Scudellari, Maria Cristina
    Frosini, Silvia
    Trovato, Rosanna
    Zupone, Giuseooe
    Cecchi, Antonio
    Kull, Silvia
    Turchi, Fabrizia
    Sodini, Filippo
    Tacchi, Paolo
    Ceppa, Ilaria
    Giorgolo, Francesca
    Palmas, Federica
    Betti, Cinzia
    Tupler, Rosella
    Zhu, Song
    2018 IEEE 8TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - BERLIN (ICCE-BERLIN), 2018,