Using genomic data and machine learning to predict antibiotic resistance: A tutorial paper

被引:0
|
作者
Orcales, Faye [1 ,2 ]
Tan, Lucy Moctezuma [1 ,3 ]
Johnson-Hagler, Meris [1 ]
Suntay, John Matthew [1 ,2 ]
Ali, Jameel [1 ]
Recto, Kristiene [1 ]
Glenn, Phelan [1 ,4 ]
Pennings, Pleuni [1 ]
机构
[1] San Francisco State Univ, Dept Biol, San Francisco, CA 94132 USA
[2] Univ Calif San Francisco, San Francisco, CA 94143 USA
[3] Calif State Univ East Bay, Dept Stat, Hayward, CA USA
[4] Univ Calif Los Angeles, David Geffen Sch Med, Los Angeles, CA USA
关键词
All Open Access; Gold;
D O I
10.1371/journal.pcbi.1012579
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Antibiotic resistance is a global public health concern. Bacteria have evolved resistance to most antibiotics, which means that for any given bacterial infection, the bacteria may be resistant to one or several antibiotics. It has been suggested that genomic sequencing and machine learning (ML) could make resistance testing more accurate and cost-effective. Given that ML is likely to become an ever more important tool in medicine, we believe that it is important for pre-health students and others in the life sciences to learn to use ML tools. This paper provides a step-by-step tutorial to train 4 different ML models (logistic regression, random forests, extreme gradient-boosted trees, and neural networks) to predict drug resistance for Escherichia coli isolates and to evaluate their performance using different metrics and cross-validation techniques. We also guide the user in how to load and prepare the data used for the ML models. The tutorial is accessible to beginners and does not require any software to be installed as it is based on Google Colab notebooks and provides a basic understanding of the different ML models. The tutorial can be used in undergraduate and graduate classes for students in Biology, Public Health, Computer Science, or related fields.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Understanding Antibiotic Resistance Using Different Machine Learning Approaches
    Pradhan, Tanaya Priyadarshini
    Debata, N. K.
    Swarnkar, Tripti
    ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 63 - 73
  • [2] Using Machine Learning to Predict Antimicrobial Resistance?A Literature Review
    Sakagianni, Aikaterini
    Koufopoulou, Christina
    Feretzakis, Georgios
    Kalles, Dimitris
    Verykios, Vassilios S.
    Myrianthefs, Pavlos
    Fildisis, Georgios
    ANTIBIOTICS-BASEL, 2023, 12 (03):
  • [3] Phenotypic antibiotic resistance prediction using antibiotic resistance genes and machine learning models in Mannheimia haemolytica
    Wickware, Carmen L.
    Ellis, Audrey C.
    Verma, Mohit
    Johnson, Timothy A.
    VETERINARY MICROBIOLOGY, 2025, 302
  • [4] Using Visualization to Illustrate Machine Learning Models for Genomic Data
    Qu, Zhonglin
    Zhou, Yi
    Quang Vinh Nguyen
    Catchpoole, Daniel R.
    PROCEEDINGS OF THE AUSTRALASIAN COMPUTER SCIENCE WEEK MULTICONFERENCE (ACSW 2019), 2019,
  • [5] Determination of Antibiotic Resistance Level in Klebsiella using Machine Learning Models
    Gupta, Snehal
    Chatterjee, Sreemoyee
    Sharma, Amita
    Popolizio, Marina
    Di Lecce, Vincenzo
    Succi, Mariantonietta
    Tremonte, Patrizio
    Dario, Rita
    Rathore, Vijay Singh
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL 1, 2023, 447 : 913 - 924
  • [6] Development of Machine Learning Models to Predict the Weld Defect Using Resistance Spot Welding Experimental Data
    Mathi, Santhosh
    Bamberg, Pedro
    Schiebahn, Alexander
    Reisgen, Uwe
    SOLDAGEM & INSPECAO, 2023, 28
  • [7] Applications of Machine Learning on Electronic Health Record Data to Combat Antibiotic Resistance
    Blechman, Samuel E.
    Wright, Erik S.
    JOURNAL OF INFECTIOUS DISEASES, 2024, 230 (05): : 1073 - 1082
  • [8] Using machine learning to predict student difficulties from learning session data
    Mushtaq Hussain
    Wenhao Zhu
    Wu Zhang
    Syed Muhammad Raza Abidi
    Sadaqat Ali
    Artificial Intelligence Review, 2019, 52 : 381 - 407
  • [9] Using machine learning to predict student difficulties from learning session data
    Hussain, Mushtaq
    Zhu, Wenhao
    Zhang, Wu
    Abidi, Syed Muhammad Raza
    Ali, Sadaqat
    ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (01) : 381 - 407
  • [10] Ensemble Feature Learning of Genomic Data Using Support Vector Machine
    Anaissi, Ali
    Goyal, Madhu
    Catchpoole, Daniel R.
    Braytee, Ali
    Kennedy, Paul J.
    PLOS ONE, 2016, 11 (06):