MarkerML - Marker Feature Identification in Metagenomic Datasets Using Interpretable Machine Learning

被引:6
|
作者
Nagpal, Sunil [1 ,2 ,3 ]
Singh, Rohan [1 ]
Taneja, Bhupesh [2 ,3 ]
Mande, Sharmila S. [1 ]
机构
[1] Tata Consultancy Serv Ltd, TCS Res, Pune 411013, India
[2] CSIR, Inst Genom & Integrat Biol GIB, New Delhi 110025, India
[3] Acad Sci & Innovat Res AcSIR, Ghaziabad 201002, India
关键词
metagenomic biomarkers; interpretable machine learning; SHAP; microbiome; marker features; DATABASE;
D O I
10.1016/j.jmb.2022.167589
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Identification of environment specific marker-features is one of the key objectives of many metagenomic studies. It aims to identify such features in microbiome datasets that may serve as markers of the contrasting or comparable states. Hypothesis testing and black-box machine learnt models which are conventionally used for identification of these features are generally not exhaustive, especially because they generally do-not provide any quantifiable relevance (context) of/between the identified features. We present MarkerML web-server, that seeks to leverage the emergence of interpretable machine learning for facilitating the contextual discovery of metagenomic features of interest. It does so through a comprehensive and automated application of the concept of Shapley Additive Explanations in companionship to the compositionality accounted hypothesis testing for the multi-variate microbiome datasets. MarkerML not only helps in identification of marker-features, but also enables insights into the role and interdependence of the identified features in driving the decision making of the supervised machine learnt model. Generation of high quality and intuitive visualizations spanning prediction effect plots, model performance reports, feature dependency plots, Shapley and abundance informed cladograms (Sungrams), hypothesis tested violin plots along-with necessary provisions for excluding the participant bias and ensuring reproducibility of results, further seek to make the platform a useful asset for the scientists in the field of microbiome (and even beyond). The MarkerML web-server is freely available for the academic community at https://microbiome.igib.res.in/markerml/.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Interpretable machine learning for imbalanced credit scoring datasets
    Chen, Yujia
    Calabrese, Raffaella
    Martin-Barragan, Belen
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 312 (01) : 357 - 372
  • [2] Using interpretable machine learning to extend heterogeneous antibody-virus datasets
    Einav, Tal
    Ma, Rong
    CELL REPORTS METHODS, 2023, 3 (08):
  • [3] Identification of Marker Genes in Infectious Diseases from ScRNA-seq Data Using Interpretable Machine Learning
    Martinez, Gustavo Sganzerla
    Garduno, Alexis
    Ostadgavahi, Ali Toloue
    Hewins, Benjamin
    Dutt, Mansi
    Kumar, Anuj
    Martin-Loeches, Ignacio
    Kelvin, David J.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (11)
  • [4] Machine Learning for detection of viral sequences in human metagenomic datasets
    Bzhalava, Zurab
    Tampuu, Ardi
    Bala, Piotr
    Vicente, Raul
    Dillner, Joakim
    BMC BIOINFORMATICS, 2018, 19
  • [5] Machine Learning for detection of viral sequences in human metagenomic datasets
    Zurab Bzhalava
    Ardi Tampuu
    Piotr Bała
    Raul Vicente
    Joakim Dillner
    BMC Bioinformatics, 19
  • [6] Metrologically interpretable feature extraction for industrial machine vision using generative deep learning
    Schmitt, Robert H.
    Wolfschlaeger, Dominik
    Masliankova, Evelina
    Montavon, Benjamin
    CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2022, 71 (01) : 433 - 436
  • [7] Feature mining for thermoelectric materials based on interpretable machine learning
    Liu, Yiyu
    Mu, Zilong
    Hong, Peichao
    Yang, Yun
    Lin, Changxu
    Nanoscale, 2024, 17 (04) : 2200 - 2214
  • [8] Interpretable machine learning identification of arginine methylation sites
    Ali, Syed Danish
    Tayara, Hilal
    Chong, Kil To
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 147
  • [9] An Identification Method of Feature Interpretation for Melanoma Using Machine Learning
    Li, Zhenwei
    Ji, Qing
    Yang, Xiaoli
    Zhou, Yu
    Zhi, Shulong
    APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [10] Is Interpretable Machine Learning Effective at Feature Selection for Neural Learning-to-Rank?
    Lyu, Lijun
    Roy, Nirmal
    Oosterhuis, Harrie
    Anand, Avishek
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT IV, 2024, 14611 : 384 - 402