Performance Analysis of XGBoost Models with Ultrafast Shape Recognition Descriptors in Ligand-Based Virtual Screening

被引:2
|
作者
Robles, Jose [1 ]
Sotelo, Freedy [1 ]
Rojas, Carlos [1 ]
Hurtado, Jose [1 ]
Lopez, Jorge [2 ]
机构
[1] Univ Nacl Ingn, Fac Ingn Mecan, Lima, Peru
[2] Univ Ricardo Palma, Santiago De Surco, Peru
来源
2021 8TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS RESEARCH AND APPLICATIONS, ICBRA 2021 | 2021年
关键词
Drug discovery; machine learning; extreme gradient boosting; virtual screening; ELECTROSHAPE;
D O I
10.1145/3487027.3487029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ligand-Based Virtual Screening (LBVS) is a powerful computational approach to develop drug discovery studies when at least an active ligand from a target molecule is known. In these studies, different metrics are used to measure the similarity of physicochemical properties of the actives and query molecules. However, as the metric selected directly affects the performance of the study, machine learning techniques have shown to be a good alternative to replace and compute these similarity measurements. In this context, we developed an XGBoost Model to perform similarity measurements as a new alternative in LBVS studies. For this purpose, we used a diverse dataset from the Directory of Useful Decoys-Enhanced (DUD-E) and applied the Ultrafast Shape Recognition (USR) and USR with CREDO atom types (USRCAT) methods as a baseline to compare the performance of the XGBoost model in terms of the Enrichment Factor (EF) and Area Under the Curve (AUC) metrics. Moreover, to compare the performance of XGBoost over other machine learning techniques, an Artificial Neural Network (ANN) model was implemented. Results from both machine learning models shows a good improvement achieving more than twice the EF of traditional USR methods. However, the XGBoost Model has the highest performance over ANN's with both USR and USRCAT descriptors.
引用
收藏
页码:8 / 14
页数:7
相关论文
共 50 条
  • [1] Applying Machine Learning to Ultrafast Shape Recognition in Ligand-Based Virtual Screening
    Bonanno, Etienne
    Ebejer, Jean-Paul
    FRONTIERS IN PHARMACOLOGY, 2020, 10
  • [2] Ultrafast shape recognition: Evaluating a new ligand-based virtual screening technology
    Ballester, Pedro J.
    Finn, Paul W.
    Richards, W. Graham
    JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2009, 27 (07): : 836 - 845
  • [3] Deep Sequence Models for Ligand-Based Virtual Screening
    Nair, Viswajit Vinod
    Pradeep, Sonaal Pathlai
    Nair, Vaishnavi Sudheer
    Pournami, P. N.
    Gopakumar, G.
    Jayaraj, P. B.
    JOURNAL OF COMPUTATIONAL BIOPHYSICS AND CHEMISTRY, 2022, 21 (02): : 207 - 217
  • [4] Ligand-based virtual screening under partial shape constraints
    von Behren, Mathias M.
    Rarey, Matthias
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2017, 31 (04) : 335 - 347
  • [5] Ligand-based virtual screening under partial shape constraints
    Mathias M. von Behren
    Matthias Rarey
    Journal of Computer-Aided Molecular Design, 2017, 31 : 335 - 347
  • [6] Performance of Machine Learning Methods for Ligand-Based Virtual Screening
    Plewczynski, Dariusz
    Spieser, Stephane A. H.
    Koch, Uwe
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2009, 12 (04) : 358 - 368
  • [7] Ligand-based approaches in virtual screening
    Douguet, Dominique
    CURRENT COMPUTER-AIDED DRUG DESIGN, 2008, 4 (03) : 180 - 190
  • [8] Benchmarking Platform for Ligand-Based Virtual Screening
    Skoda, Petr
    Hoksza, David
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 1220 - 1227
  • [9] Ligand-based structural hypotheses for virtual screening
    Jain, AN
    JOURNAL OF MEDICINAL CHEMISTRY, 2004, 47 (04) : 947 - 961
  • [10] New methodologies for ligand-based virtual screening
    Stahura, FL
    Bajorath, M
    CURRENT PHARMACEUTICAL DESIGN, 2005, 11 (09) : 1189 - 1202