Machine learning and feature extraction for rapid antimicrobial resistance prediction of Acinetobacter baumannii from whole-genome sequencing data

被引:3
|
作者
Gao, Yue [1 ,2 ]
Li, Henan [2 ]
Zhao, Chunjiang [2 ]
Li, Shuguang [2 ]
Yin, Guankun [2 ]
Wang, Hui [1 ,2 ]
机构
[1] Peking Univ, Inst Med Technol, Hlth Sci Ctr, Beijing, Peoples R China
[2] Peking Univ, Peoples Hosp, Dept Clin Lab, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Acinetobacter baumannii; whole-genome sequencing; antimicrobial resistance prediction; machine learning; k-mer; ESCHERICHIA-COLI;
D O I
10.3389/fmicb.2023.1320312
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Background Whole-genome sequencing (WGS) has contributed significantly to advancements in machine learning methods for predicting antimicrobial resistance (AMR). However, the comparisons of different methods for AMR prediction without requiring prior knowledge of resistance remains to be conducted. Methods We aimed to predict the minimum inhibitory concentrations (MICs) of 13 antimicrobial agents against Acinetobacter baumannii using three machine learning algorithms (random forest, support vector machine, and XGBoost) combined with k-mer features extracted from WGS data. Results A cohort of 339 isolates was used for model construction. The average essential agreement and category agreement of the best models exceeded 90.90% (95%CI, 89.03-92.77%) and 95.29% (95%CI, 94.91-95.67%), respectively; the exceptions being levofloxacin, minocycline and imipenem. The very major error rates ranged from 0.0 to 5.71%. We applied feature selection pipelines to extract the top-ranked 11-mers to optimise training time and computing resources. This approach slightly improved the prediction performance and enabled us to obtain prediction results within 10 min. Notably, when employing these top-ranked 11-mers in an independent test dataset (120 isolates), we achieved an average accuracy of 0.96. Conclusion Our study is the first to demonstrate that AMR prediction for A. baumannii using machine learning methods based on k-mer features has competitive performance over traditional workflows; hence, sequence-based AMR prediction and its application could be further promoted. The k-mer-based workflow developed in this study demonstrated high recall/sensitivity and specificity, making it a dependable tool for MIC prediction in clinical settings.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Prediction of antimicrobial resistance based on whole-genome sequencing and machine learning
    Ren, Yunxiao
    Chakraborty, Trinad
    Doijad, Swapnil
    Falgenhauer, Linda
    Falgenhauer, Jane
    Goesmann, Alexander
    Hauschild, Anne-Christin
    Schwengers, Oliver
    Heider, Dominik
    [J]. BIOINFORMATICS, 2022, 38 (02) : 325 - 334
  • [2] Whole-genome sequencing in the prediction of antimicrobial resistance
    Chan, Kok-Gan
    [J]. EXPERT REVIEW OF ANTI-INFECTIVE THERAPY, 2016, 14 (07) : 617 - 619
  • [3] Prediction of antimicrobial resistance in clinicalCampylobacter jejuniisolates from whole-genome sequencing data
    Dahl, Louise Gade
    Joensen, Katrine Grimstrup
    Osterlund, Mark Thomas
    Kiil, Kristoffer
    Nielsen, Eva Moller
    [J]. EUROPEAN JOURNAL OF CLINICAL MICROBIOLOGY & INFECTIOUS DISEASES, 2021, 40 (04) : 673 - 682
  • [4] Prediction of Antimicrobial Resistance in Gram-Negative Bacteria From Whole-Genome Sequencing Data
    Van Camp, Pieter-Jan
    Haslam, David B.
    Porollo, Aleksey
    [J]. FRONTIERS IN MICROBIOLOGY, 2020, 11
  • [5] Prediction of antimicrobial resistance in clinical Campylobacter jejuni isolates from whole-genome sequencing data
    Louise Gade Dahl
    Katrine Grimstrup Joensen
    Mark Thomas Østerlund
    Kristoffer Kiil
    Eva Møller Nielsen
    [J]. European Journal of Clinical Microbiology & Infectious Diseases, 2021, 40 : 673 - 682
  • [6] Prediction of Staphylococcus aureus Antimicrobial Resistance by Whole-Genome Sequencing
    Gordon, N. C.
    Price, J. R.
    Cole, K.
    Everitt, R.
    Morgan, M.
    Finney, J.
    Kearns, A. M.
    Pichon, B.
    Young, B.
    Wilson, D. J.
    Llewelyn, M. J.
    Paul, J.
    Peto, T. E. A.
    Crook, D. W.
    Walker, A. S.
    Golubchik, T.
    [J]. JOURNAL OF CLINICAL MICROBIOLOGY, 2014, 52 (04) : 1182 - 1191
  • [7] Whole-Genome Sequencing Elucidates Epidemiology of Nosocomial Clusters of Acinetobacter baumannii
    Willems, Stefanie
    Kampmeier, Stefanie
    Bletz, Stefan
    Kossow, Annelene
    Koeck, Robin
    Kipp, Frank
    Mellmann, Alexander
    [J]. JOURNAL OF CLINICAL MICROBIOLOGY, 2016, 54 (09) : 2391 - 2394
  • [8] Whole-genome sequencing for the characterization of resistance mechanisms and epidemiology of colistin-resistant Acinetobacter baumannii
    Hahm, Chorong
    Chung, Hae-Sun
    Lee, Miae
    [J]. PLOS ONE, 2022, 17 (03):
  • [9] Detection of Antimicrobial Resistance Genes Associated with Carbapenem Resistance from the Whole-Genome Sequence of Acinetobacter baumannii Isolates from Malaysia
    Rao, Mohan
    Rashid, Fairuz A.
    Shukor, Surianti
    Hashim, Rohaidah
    Ahmad, Norazah
    [J]. CANADIAN JOURNAL OF INFECTIOUS DISEASES & MEDICAL MICROBIOLOGY, 2020, 2020
  • [10] Whole-genome sequencing to control antimicrobial resistance
    Koeser, Claudio U.
    Ellington, Matthew J.
    Peacock, Sharon J.
    [J]. TRENDS IN GENETICS, 2014, 30 (09) : 401 - 407