Naïve Bayes Classifiers and accompanying dataset for Pseudomonas syringae isolate characterization

被引:0
|
作者
Fautt, Chad [1 ,2 ,3 ]
Couradeau, Estelle [2 ,3 ]
Hockett, Kevin L. [1 ,3 ]
机构
[1] Penn State Univ, Dept Plant Pathol & Environm Microbiol, University Pk, PA 16802 USA
[2] Penn State Univ, Dept Ecosyst Sci & Management, University Pk, PA 16802 USA
[3] Penn State Univ, Intercoll Grad Degree Program Ecol, University Pk, PA 16802 USA
基金
美国食品与农业研究所;
关键词
BACTERIAL CANKER; CAUSAL AGENT; LIFE-HISTORY; PV; TOMATO; PLANT; PATHOGEN; EPIDEMIOLOGY; VIRULENCE; EVOLUTION; OUTBREAK;
D O I
10.1038/s41597-024-03003-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The Pseudomonas syringae species complex (PSSC) is a diverse group of plant pathogens with a collective host range encompassing almost every food crop grown today. As a threat to global food security, rapid detection and characterization of epidemic and emerging pathogenic lineages is essential. However, phylogenetic identification is often complicated by an unclarified and ever-changing taxonomy, making practical use of available databases and the proper training of classifiers difficult. As such, while amplicon sequencing is a common method for routine identification of PSSC isolates, there is no efficient method for accurate classification based on this data. Here we present a suite of five Naive bayes classifiers for PCR primer sets widely used for PSSC identification, trained on in-silico amplicon data from 2,161 published PSSC genomes using the life identification number (LIN) hierarchical clustering algorithm in place of traditional Linnaean taxonomy. Additionally, we include a dataset for translating classification results back into traditional taxonomic nomenclature (i.e. species, phylogroup, pathovar), and for predicting virulence factor repertoires.
引用
收藏
页数:8
相关论文
共 15 条
  • [1] Naïve Bayes Classifiers and accompanying dataset for Pseudomonas syringae isolate characterization
    Chad Fautt
    Estelle Couradeau
    Kevin L. Hockett
    Scientific Data, 11
  • [2] Landscapes of Naïve Bayes classifiers
    Zoë Hoare
    Pattern Analysis and Applications, 2008, 11 : 59 - 72
  • [3] Heterodimeric protein complex identification by naïve Bayes classifiers
    Osamu Maruyama
    BMC Bioinformatics, 14
  • [4] Tool wear monitoring using naïve Bayes classifiers
    Jaydeep Karandikar
    Tom McLeay
    Sam Turner
    Tony Schmitz
    The International Journal of Advanced Manufacturing Technology, 2015, 77 : 1613 - 1626
  • [5] Classification optimization for training a large dataset with Naïve Bayes
    Thi Thanh Sang Nguyen
    Pham Minh Thu Do
    Journal of Combinatorial Optimization, 2020, 40 : 141 - 169
  • [6] Human activity classification using Decision Tree and Naïve Bayes classifiers
    Kholoud Maswadi
    Norjihan Abdul Ghani
    Suraya Hamid
    Muhammads Babar Rasheed
    Multimedia Tools and Applications, 2021, 80 : 21709 - 21726
  • [7] Quantile-distribution functions and their use for classification, with application to naïve Bayes classifiers
    Edoardo Redivo
    Cinzia Viroli
    Alessio Farcomeni
    Statistics and Computing, 2023, 33
  • [8] Corruption risk analysis using semi-supervised naïve Bayes classifiers
    Balaniuk, R. (remis@ucb.br), 1600, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (05):
  • [9] Learning accurate and concise naïve Bayes classifiers from attribute value taxonomies and data
    J. Zhang
    D.-K. Kang
    A. Silvescu
    V. Honavar
    Knowledge and Information Systems, 2006, 9 : 157 - 179
  • [10] Segmentation of white blood cells and comparison of cell morphology by linear and naïve Bayes classifiers
    Jaroonrut Prinyakupt
    Charnchai Pluempitiwiriyawej
    BioMedical Engineering OnLine, 14