A machine-learning approach to detecting unknown bacterial serovars

被引:12
|
作者
Akova F. [1 ]
Dundar M. [1 ]
Davisson V.J. [2 ,3 ]
Hirleman E.D. [4 ]
Bhunia A.K. [5 ]
Robinson J.P. [3 ]
Rajwa B. [3 ]
机构
[1] Department of Computer and Information Science, Indiana University-Purdue University, Indianapolis
[2] Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, W. Lafayette
[3] Bindley Bioscience Center, Purdue University, W. Lafayette
[4] School of Mechanical Engineering, Purdue University, W. Lafayette
[5] Department of Food Science, Purdue University, W. Lafayette
来源
关键词
Anomaly detection; Bayesian classifier; Nonexhaustive training data; Novelty detection;
D O I
10.1002/sam.10085
中图分类号
学科分类号
摘要
Technologies for rapid detection of bacterial pathogens are crucial for securing the food supply. A light-scattering sensor recently developed for real-time identification of multiple colonies has shown great promise for distinguishing bacteria cultures. The classification approach currently used with this system relies on supervised learning. For accurate classification of bacterial pathogens, the training library should be exhaustive, i.e., should consist of samples of all possible pathogens. Yet, the sheer number of existing bacterial serovars and more importantly the effect of their high mutation rate would not allow for a practical and manageable training. In this study, we propose a Bayesian approach to learning with a nonexhaustive training dataset for automated detection of unknown bacterial serovars, i.e., serovars for which no samples exist in the training library. The main contribution of our work is the Wishart conjugate priors defined over class distributions. This allows us to employ the prior information obtained from known classes to make inferences about unknown classes as well. By this means, we identify new classes of informational value and dynamically update the training dataset with these classes to make it increasingly more representative of the sample population. This results in a classifier with improved predictive performance for future samples. We evaluated our approach on a 28-class bacteria dataset and also on the benchmark 26-class letter recognition dataset for further validation. The proposed approach is compared against state-of-the-art involving density-based approaches and support vector domain description, as well as a recently introduced Bayesian approach based on simulated classes. © 2010 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 3: 289-301, 2010 Copyright © 2010.
引用
下载
收藏
页码:289 / 301
页数:12
相关论文
共 50 条
  • [41] Automated coding of implicit motives: A machine-learning approach
    Pang, Joyce S.
    Ring, Hiram
    MOTIVATION AND EMOTION, 2020, 44 (04) : 549 - 566
  • [42] A Machine-Learning Approach to Predicting Need for Hospitalization for Pediatric
    Patel, Shilpa J.
    Chamberlain, Daniel
    Chamberlain, James M.
    PEDIATRICS, 2018, 142
  • [43] Machine-learning approach to the design of OSDAs for zeolite beta
    Daeyaert, Frits
    Ye, Fengdan
    Deem, Michael W.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (09) : 3413 - 3418
  • [44] Overachieving Municipalities in Public Health: A Machine-learning Approach
    Porto Chiavegatto Filho, Alexandre Dias
    dos Santos, Hellen Geremias
    do Nascimento, Carla Ferreira
    Massa, Kaio
    Kawachi, Ichiro
    EPIDEMIOLOGY, 2018, 29 (06) : 836 - 840
  • [45] Parametrization of Sunspot Groups Based on Machine-Learning Approach
    Egor Illarionov
    Andrey Tlatov
    Solar Physics, 2022, 297
  • [46] A machine-learning approach for nonalcoholic steatohepatitis susceptibility estimation
    Ghadiri, Fatemeh
    Husseini, Abbas Ali
    Oztas, Oguzhan
    INDIAN JOURNAL OF GASTROENTEROLOGY, 2022, 41 (05) : 475 - 482
  • [47] Performance Prediction of NUMA Placement: a Machine-Learning Approach
    Arapidis, Fanourios
    Karakostas, Vasileios
    Papadopoulou, Nikela
    Nikas, Konstantinos
    Goumas, Georgios
    Koziris, Nectarios
    2018 16TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2018), 2018, : 296 - 301
  • [48] Optimizing Count Responses in Surveys: A Machine-learning Approach
    Fu, Qiang
    Guo, Xin
    Land, Kenneth C.
    SOCIOLOGICAL METHODS & RESEARCH, 2020, 49 (03) : 637 - 671
  • [49] A Machine-Learning Approach to Application of Intelligent Artificial Reverberation
    Chourdakis, Emmanouil T.
    Reiss, Joshua D.
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (1-2): : 56 - 65
  • [50] A new approach of clustering based machine-learning algorithm
    Al-Omary, Alauddin Yousif
    Jamil, Mohammad Shahid
    KNOWLEDGE-BASED SYSTEMS, 2006, 19 (04) : 248 - 258