Machine learning models with distinct Shapley and interpretation for chemical compound predictions

被引:0
|
作者
Roth, Jannik P. [1 ]
Bajorath, Juergen [1 ,2 ,3 ]
机构
[1] Univ Bonn, Dept Life Sci Informat & Data Sci, B IT, Friedrich Hirzebruch Allee 5-6, D-53115 Bonn, Germany
[2] Univ Bonn, Lamarr Inst Machine Learning & Artificial Intellig, Friedrich Hirzebruch Allee 5-6, D-53115 Bonn, Germany
[3] Univ Bonn, Limes Inst, Program Unit Chem Biol & Med Chem, Friedrich Hirzebruch Allee 5-6, D-53115 Bonn, Germany
来源
CELL REPORTS PHYSICAL SCIENCE | 2024年 / 5卷 / 08期
关键词
EXPLAINABLE AI;
D O I
10.1016/j.xcrp.2024.102110
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Explaining black box predictions of machine learning (ML) models is a topical issue in artificial intelligence (AI) research. For the identification of features determining predictions, the Shapley value formalism originally developed in game theory is widely used in different fields. Typically, Shapley values quantifying feature contributions to predictions need to be approximated in machine learning. We introduce a framework for the calculation of exact Shapley values for 4 kernel functions used in support vector machine (SVM) models and analyze consistently accurate compound activity predictions based on exact Shapley values. Dramatic changes in feature contributions are detected depending on the kernel function, leading to mostly distinct explanations of predictions of the same test compounds. Very different feature contributions yield comparable predictions, which complicate numerical and graphical model explanation and decouple feature attribution and human interpretability.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Impact of data bias on machine learning for crystal compound synthesizability predictions
    Davariashtiyani, Ali
    Wang, Busheng
    Hajinazar, Samad
    Zurek, Eva
    Kadkhodaei, Sara
    Machine Learning: Science and Technology, 2024, 5 (04):
  • [22] Reliable prediction of software defects using Shapley interpretable machine learning models
    Al-Smadi, Yazan
    Eshtay, Mohammed
    Al-Qerem, Ahmad
    Nashwan, Shadi
    Ouda, Osama
    Abd El-Aziz, A. A.
    EGYPTIAN INFORMATICS JOURNAL, 2023, 24 (03)
  • [23] Ab Initio Machine Learning in Chemical Compound Space
    Huang, Bing
    von Lilienfeld, O. Anatole
    CHEMICAL REVIEWS, 2021, 121 (16) : 10001 - 10036
  • [24] Machine learning architectures to enhance chemical functional group predictions
    Fine, Jonathan
    Rajasekar, Anand
    Chopra, Gaurav
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257
  • [25] Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values
    Lamens, Alec
    Bajorath, Juergen
    MOLECULES, 2023, 28 (14):
  • [26] Machine Learning Based Spectral Interpretation in Chemical Detection
    Riley, Patrick C.
    Deshpande, Samir V.
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS, 2019, 11006
  • [27] Analysis of Conditions for Reliable Predictions by Moodle Machine Learning Models
    Bognar, Laszlo
    Fauszt, Tibor
    Nagy, Gabor Zsolt
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2021, 16 (06) : 106 - 121
  • [28] MolPredictX: Online Biological Activity Predictions by Machine Learning Models
    Scotti, Marcus Tullius
    Herrera-Acevedo, Chonny
    Barros de Menezes, Renata Priscila
    Martin, Holli-Joi
    Muratov, Eugene N.
    de Souza Silva, Avilla Italo
    Albuquerque, Emmanuella Faustino
    Calado, Lucas Ferreira
    Coy-Barrera, Ericsson
    Scotti, Luciana
    MOLECULAR INFORMATICS, 2022, 41 (12)
  • [29] An Empirical Study on Machine Learning Models for Wind Power Predictions
    Liu, Yiqian
    Zhang, Huajie
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 758 - 763
  • [30] Toward interpretable machine learning: evaluating models of heterogeneous predictions
    Zhang, Ruixun
    ANNALS OF OPERATIONS RESEARCH, 2024,