A machine-learning approach to detecting unknown bacterial serovars

被引:12
|
作者
Akova F. [1 ]
Dundar M. [1 ]
Davisson V.J. [2 ,3 ]
Hirleman E.D. [4 ]
Bhunia A.K. [5 ]
Robinson J.P. [3 ]
Rajwa B. [3 ]
机构
[1] Department of Computer and Information Science, Indiana University-Purdue University, Indianapolis
[2] Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, W. Lafayette
[3] Bindley Bioscience Center, Purdue University, W. Lafayette
[4] School of Mechanical Engineering, Purdue University, W. Lafayette
[5] Department of Food Science, Purdue University, W. Lafayette
来源
关键词
Anomaly detection; Bayesian classifier; Nonexhaustive training data; Novelty detection;
D O I
10.1002/sam.10085
中图分类号
学科分类号
摘要
Technologies for rapid detection of bacterial pathogens are crucial for securing the food supply. A light-scattering sensor recently developed for real-time identification of multiple colonies has shown great promise for distinguishing bacteria cultures. The classification approach currently used with this system relies on supervised learning. For accurate classification of bacterial pathogens, the training library should be exhaustive, i.e., should consist of samples of all possible pathogens. Yet, the sheer number of existing bacterial serovars and more importantly the effect of their high mutation rate would not allow for a practical and manageable training. In this study, we propose a Bayesian approach to learning with a nonexhaustive training dataset for automated detection of unknown bacterial serovars, i.e., serovars for which no samples exist in the training library. The main contribution of our work is the Wishart conjugate priors defined over class distributions. This allows us to employ the prior information obtained from known classes to make inferences about unknown classes as well. By this means, we identify new classes of informational value and dynamically update the training dataset with these classes to make it increasingly more representative of the sample population. This results in a classifier with improved predictive performance for future samples. We evaluated our approach on a 28-class bacteria dataset and also on the benchmark 26-class letter recognition dataset for further validation. The proposed approach is compared against state-of-the-art involving density-based approaches and support vector domain description, as well as a recently introduced Bayesian approach based on simulated classes. © 2010 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 3: 289-301, 2010 Copyright © 2010.
引用
收藏
页码:289 / 301
页数:12
相关论文
共 50 条
  • [1] Detecting neurodevelopmental trajectories in congenital heart diseases with a machine-learning approach
    Cainelli, Elisa
    Bisiacchi, Patrizia S.
    Cogo, Paola
    Padalino, Massimo
    Simonato, Manuela
    Vergine, Michela
    Lanera, Corrado
    Vedovelli, Luca
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [2] Detecting neurodevelopmental trajectories in congenital heart diseases with a machine-learning approach
    Elisa Cainelli
    Patrizia S. Bisiacchi
    Paola Cogo
    Massimo Padalino
    Manuela Simonato
    Michela Vergine
    Corrado Lanera
    Luca Vedovelli
    [J]. Scientific Reports, 11
  • [3] Prediction of bacterial associations with plants using a supervised machine-learning approach
    Manuel Martinez-Garcia, Pedro
    Lopez-Solanilla, Emilia
    Ramos, Cayo
    Rodriguez-Palenzuela, Pablo
    [J]. ENVIRONMENTAL MICROBIOLOGY, 2016, 18 (12) : 4847 - 4861
  • [4] Machine-Learning Techniques for Detecting Attacks in SDN
    Elsayed, Mahmoud Said
    Nhien-An Le-Khac
    Dev, Soumyabrata
    Jurcut, Anca Delia
    [J]. PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 277 - 281
  • [5] A Machine-Learning Approach to Time Discrimination
    Hansen, Peter
    [J]. 2010 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD (NSS/MIC), 2010, : 2132 - 2133
  • [6] IMPROVEMENTS IN A MACHINE-LEARNING ALGORITHM FOR DETECTING STATUS EPILEPTICUS
    Kamousi, Baharan
    Gupta, Archit
    Karunakaran, Suganya
    Marjaninejad, Ali
    Woo, Raymond
    Parvizi, Josef
    [J]. CRITICAL CARE MEDICINE, 2024, 52
  • [7] Theory Identity: A Machine-Learning Approach
    Larsen, Kai R.
    Hovorka, Dirk
    West, Jevin
    Birt, James
    Pfaff, James R.
    Chambers, Trevor W.
    Sampedro, Zebula R.
    Zager, Nick
    Vanstone, Bruce
    [J]. 2014 47TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2014, : 4639 - 4648
  • [8] Evaluating Machine-Learning Techniques for Detecting Smart Ponzi Schemes
    Ibba, Giacomo
    Pierro, Giuseppe Antonio
    Di Francesco, Marco
    [J]. 2021 IEEE/ACM 4TH INTERNATIONAL WORKSHOP ON EMERGING TRENDS IN SOFTWARE ENGINEERING FOR BLOCKCHAIN (WETSEB 2021), 2021, : 34 - 40
  • [9] A machine-learning algorithm for detecting seizure termination in scalp EEG
    Shoeb, Ali
    Kharbouch, Alaa
    Soegaard, Jacqueline
    Schachter, Steven
    Guttag, John
    [J]. EPILEPSY & BEHAVIOR, 2011, 22 : S36 - S43
  • [10] How can machine-learning methods assist in virtual screening for hyperuricemia? A healthcare machine-learning approach
    Ichikawa, Daisuke
    Saito, Toki
    Ujita, Waka
    Oyama, Hiroshi
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 64 : 20 - 24