Perspective Uncovering and tackling fundamental limitations of compound potency predictions using machine learning models

被引:0
|
作者
Janela, Tiago [1 ,2 ]
Bajorath, Juergen [1 ,2 ,3 ]
机构
[1] Univ Bonn, Dept Life Sci Informat & Data Sci, B IT, Friedrich Hirzebruch Allee 5-6, D-53115 Bonn, Germany
[2] Univ Bonn, Lamarr Inst Machine Learning & Artificial Intellig, Friedrich Hirzebruch Allee 5-6, D-53115 Bonn, Germany
[3] Univ Bonn, Limes Inst, Program Unit Chem Biol & Med Chem, B IT, Friedrich Hirzebruch Allee 5-6, D-53115 Bonn, Germany
来源
CELL REPORTS PHYSICAL SCIENCE | 2024年 / 5卷 / 06期
关键词
DRUG DISCOVERY; QSAR;
D O I
10.1016/j.xcrp.2024.101988
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Molecular property predictions play a central role in computer-aided drug discovery. Although a variety of physicochemical (e.g., solubility or chemical reactivity) or physiological properties (e.g., metabolic stability or toxicity) can be predicted, biological activity is by far the most frequently investigated compound feature. Activity predictions are carried out in a qualitative (target-based activity, through compound classification) or quantitative (compound potency or studies have evaluated and compared different machine learning methods for activity and potency predictions, recently with a focus on deep learning. Regardless of the methods used, these studies generally rely on conventional benchmark settings. Recent work has shown that potency prediction benchmarks have severe general limitations that have long been unnoticed but prevent a reliable assessment of different methods and their relative performance. In this perspective, we outline general limitations of benchmark settings for compound potency predictions, introduce potential alternatives enabling a more realistic assessment of state-of-the-art predictive models, and discuss future directions for elucidating predictions and further increasing their impact.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions
    Rodriguez-Perez, Raquel
    Bajorath, Juergen
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2020, 34 (10) : 1013 - 1026
  • [2] Simple nearest-neighbour analysis meets the accuracy of compound potency predictions using complex machine learning models
    Tiago Janela
    Jürgen Bajorath
    Nature Machine Intelligence, 2022, 4 : 1246 - 1255
  • [3] Simple nearest-neighbour analysis meets the accuracy of compound potency predictions using complex machine learning models
    Janela, Tiago
    Bajorath, Juergen
    NATURE MACHINE INTELLIGENCE, 2022, 4 (12) : 1246 - +
  • [4] Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions
    Raquel Rodríguez-Pérez
    Jürgen Bajorath
    Journal of Computer-Aided Molecular Design, 2020, 34 : 1013 - 1026
  • [5] Machine learning models with distinct Shapley and interpretation for chemical compound predictions
    Roth, Jannik P.
    Bajorath, Juergen
    CELL REPORTS PHYSICAL SCIENCE, 2024, 5 (08):
  • [6] Nuclear mass predictions using machine learning models
    Yuksel, Esra
    Soydaner, Derya
    Bahtiyar, Huseyin
    PHYSICAL REVIEW C, 2024, 109 (06)
  • [7] Interpretation of Compound Activity Predictions from Complex Machine Learning Models Using Local Approximations and Shapley Values
    Rodriguez-Perez, Raquel
    Bajorath, Juergen
    JOURNAL OF MEDICINAL CHEMISTRY, 2020, 63 (16) : 8761 - 8777
  • [8] Explainable Machine Learning for Property Predictions in Compound Optimization
    Rodriguez-Perez, Raquel
    Bajorath, Jurgen
    JOURNAL OF MEDICINAL CHEMISTRY, 2021, 64 (24) : 17744 - 17752
  • [9] Understanding predictions of drug profiles using explainable machine learning models
    Konig, Caroline
    Vellido, Alfredo
    BIODATA MINING, 2024, 17 (01):
  • [10] Explaining machine learning models in sales predictions
    Bohanec, Marko
    Borstnar, Mirjana Kljajic
    Robnik-Sikonja, Marko
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 71 : 416 - 428