MarkerML - Marker Feature Identification in Metagenomic Datasets Using Interpretable Machine Learning

被引:6
|
作者
Nagpal, Sunil [1 ,2 ,3 ]
Singh, Rohan [1 ]
Taneja, Bhupesh [2 ,3 ]
Mande, Sharmila S. [1 ]
机构
[1] Tata Consultancy Serv Ltd, TCS Res, Pune 411013, India
[2] CSIR, Inst Genom & Integrat Biol GIB, New Delhi 110025, India
[3] Acad Sci & Innovat Res AcSIR, Ghaziabad 201002, India
关键词
metagenomic biomarkers; interpretable machine learning; SHAP; microbiome; marker features; DATABASE;
D O I
10.1016/j.jmb.2022.167589
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Identification of environment specific marker-features is one of the key objectives of many metagenomic studies. It aims to identify such features in microbiome datasets that may serve as markers of the contrasting or comparable states. Hypothesis testing and black-box machine learnt models which are conventionally used for identification of these features are generally not exhaustive, especially because they generally do-not provide any quantifiable relevance (context) of/between the identified features. We present MarkerML web-server, that seeks to leverage the emergence of interpretable machine learning for facilitating the contextual discovery of metagenomic features of interest. It does so through a comprehensive and automated application of the concept of Shapley Additive Explanations in companionship to the compositionality accounted hypothesis testing for the multi-variate microbiome datasets. MarkerML not only helps in identification of marker-features, but also enables insights into the role and interdependence of the identified features in driving the decision making of the supervised machine learnt model. Generation of high quality and intuitive visualizations spanning prediction effect plots, model performance reports, feature dependency plots, Shapley and abundance informed cladograms (Sungrams), hypothesis tested violin plots along-with necessary provisions for excluding the participant bias and ensuring reproducibility of results, further seek to make the platform a useful asset for the scientists in the field of microbiome (and even beyond). The MarkerML web-server is freely available for the academic community at https://microbiome.igib.res.in/markerml/.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Data-driven identification of predictive risk biomarkers for subgroups of osteoarthritis using interpretable machine learning
    Nielsen, Rikke Linnemann
    Monfeuga, Thomas
    Kitchen, Robert R.
    Egerod, Line
    Leal, Luis G.
    Schreyer, August Thomas Hjortshoj
    Gade, Frederik Steensgaard
    Sun, Carol
    Helenius, Marianne
    Simonsen, Lotte
    Willert, Marianne
    Tahrani, Abd A.
    McVey, Zahra
    Gupta, Ramneek
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [42] Data oversampling and imbalanced datasets: an investigation of performance for machine learning and feature engineering
    Mujahid, Muhammad
    Kina, Erol
    Rustam, Furqan
    Villar, Monica Gracia
    Alvarado, Eduardo Silva
    Diez, Isabel De La Torre
    Ashraf, Imran
    JOURNAL OF BIG DATA, 2024, 11 (01)
  • [43] Feature Extraction and Classification from Planetary Science Datasets enabled by Machine Learning
    Nixon, Conor A.
    Yahn, Zachary
    Duncan, Ethan
    Neidel, Ian
    Mills, Alyssa C.
    Seignovert, Benoit
    Larsen, Andrew
    Gansler, Kathryn
    Liles, Charles
    Walker, Catherine C.
    Trent, Douglas M.
    Santerre, John
    2023 IEEE AEROSPACE CONFERENCE, 2023,
  • [44] Epileptic seizure detection by using interpretable machine learning models
    Zhao, Xuyang
    Yoshida, Noboru
    Ueda, Tetsuya
    Sugano, Hidenori
    Tanaka, Toshihisa
    JOURNAL OF NEURAL ENGINEERING, 2023, 20 (01)
  • [45] Prediction of Diabetes at Early Stage using Interpretable Machine Learning
    Islam, Mohammad Sajidul
    Alam, Md Minul
    Ahamed, Afsana
    Meerza, Syed Imran Ali
    SOUTHEASTCON 2023, 2023, : 261 - 265
  • [46] Interpretable Catalysis Models Using Machine Learning with Spectroscopic Descriptors
    Wang, Song
    Jiang, Jun
    ACS CATALYSIS, 2023, 13 (11) : 7428 - 7436
  • [47] Toward Interpretable Machine Learning: Constructing Polynomial Models Based on Feature Interaction Trees
    Jang, Jisoo
    Kim, Mina
    Bui, Tien-Cuong
    Li, Wen-Syan
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT II, 2023, 13936 : 159 - 170
  • [48] Interpretable Stroke Risk Prediction Using Machine Learning Algorithms
    Zafeiropoulos, Nikolaos
    Mavrogiorgou, Argyro
    Kleftakis, Spyridon
    Mavrogiorgos, Konstantinos
    Kiourtis, Athanasios
    Kyriazis, Dimosthenis
    INTELLIGENT SUSTAINABLE SYSTEMS, WORLDS4 2022, VOL 2, 2023, 579 : 647 - 656
  • [49] Prediction of wave runup on beaches using interpretable machine learning
    Kim, Taeyoon
    Lee, Woo-Dong
    OCEAN ENGINEERING, 2024, 297
  • [50] SeizFt: Interpretable Machine Learning for Seizure Detection Using Wearables
    Al-Hussaini, Irfan
    Mitchell, Cassie S.
    BIOENGINEERING-BASEL, 2023, 10 (08):