A machine-learning approach to detecting unknown bacterial serovars

被引:12
|
作者
Akova F. [1 ]
Dundar M. [1 ]
Davisson V.J. [2 ,3 ]
Hirleman E.D. [4 ]
Bhunia A.K. [5 ]
Robinson J.P. [3 ]
Rajwa B. [3 ]
机构
[1] Department of Computer and Information Science, Indiana University-Purdue University, Indianapolis
[2] Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, W. Lafayette
[3] Bindley Bioscience Center, Purdue University, W. Lafayette
[4] School of Mechanical Engineering, Purdue University, W. Lafayette
[5] Department of Food Science, Purdue University, W. Lafayette
来源
关键词
Anomaly detection; Bayesian classifier; Nonexhaustive training data; Novelty detection;
D O I
10.1002/sam.10085
中图分类号
学科分类号
摘要
Technologies for rapid detection of bacterial pathogens are crucial for securing the food supply. A light-scattering sensor recently developed for real-time identification of multiple colonies has shown great promise for distinguishing bacteria cultures. The classification approach currently used with this system relies on supervised learning. For accurate classification of bacterial pathogens, the training library should be exhaustive, i.e., should consist of samples of all possible pathogens. Yet, the sheer number of existing bacterial serovars and more importantly the effect of their high mutation rate would not allow for a practical and manageable training. In this study, we propose a Bayesian approach to learning with a nonexhaustive training dataset for automated detection of unknown bacterial serovars, i.e., serovars for which no samples exist in the training library. The main contribution of our work is the Wishart conjugate priors defined over class distributions. This allows us to employ the prior information obtained from known classes to make inferences about unknown classes as well. By this means, we identify new classes of informational value and dynamically update the training dataset with these classes to make it increasingly more representative of the sample population. This results in a classifier with improved predictive performance for future samples. We evaluated our approach on a 28-class bacteria dataset and also on the benchmark 26-class letter recognition dataset for further validation. The proposed approach is compared against state-of-the-art involving density-based approaches and support vector domain description, as well as a recently introduced Bayesian approach based on simulated classes. © 2010 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 3: 289-301, 2010 Copyright © 2010.
引用
下载
收藏
页码:289 / 301
页数:12
相关论文
共 50 条
  • [21] A machine-learning approach to optimal bid pricing
    Lawrence, RD
    COMPUTATIONAL MODELING AND PROBLEM SOLVING IN THE NETWORKED WORLD: INTERFACES IN COMPUTER SCIENCE AND OPERATIONS RESEARCH, 2002, 21 : 97 - 118
  • [22] A Machine-Learning Approach to Autonomous Music Composition
    Lichtenwalter, Ryan
    Lichtenwalter, Katerina
    Chawla, Nitesh
    JOURNAL OF INTELLIGENT SYSTEMS, 2010, 19 (02) : 95 - 123
  • [23] Machine-learning Approach to Microbial Colony Localisation
    Michal, Cicatka
    Radim, Burget
    Jan, Karasek
    2022 45TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING, TSP, 2022, : 206 - 211
  • [24] Machine-learning approach identifies wolfcamp reservoirs
    Carpenter C.
    JPT, Journal of Petroleum Technology, 2019, 71 (03): : 87 - 89
  • [25] Machine-learning approach to holographic particle characterization
    1600, OSA - The Optical Society (22):
  • [26] A machine-learning approach to predict postprandial hypoglycemia
    Wonju Seo
    You-Bin Lee
    Seunghyun Lee
    Sang-Man Jin
    Sung-Min Park
    BMC Medical Informatics and Decision Making, 19
  • [27] XFinder: Detecting Unknown Anomalies in Distributed Machine Learning Scenario
    Du, Haizhou
    Wang, Shiwei
    Huo, Huan
    FRONTIERS IN COMPUTER SCIENCE, 2021, 3
  • [28] Toward Detecting Illegal Transactions on Bitcoin Using Machine-Learning Methods
    Lee, Chaehyeon
    Maharjan, Sajan
    Ko, Kyungchan
    Hong, James Won-Ki
    BLOCKCHAIN AND TRUSTWORTHY SYSTEMS, BLOCKSYS 2019, 2020, 1156 : 520 - 533
  • [29] Automotive Feature Coordination based on a Machine-Learning Approach
    Dominka, Sven
    Tabrizi, Sarah
    Mandl, Michael
    Duebner, Michael
    2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, : 726 - 731
  • [30] A Machine-learning based Unbiased Phishing Detection Approach
    Shirazi, Hossein
    Zweigle, Landon
    Ray, Indrakshi
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON E-BUSINESS AND TELECOMMUNICATIONS (SECRYPT), VOL 1, 2020, : 423 - 430