Using Machine Learning Methods to Predict Experimental High Throughput Screening Data

被引:7
|
作者
Mballo, Cherif [1 ]
Makarenkov, Vladimir [1 ]
机构
[1] Univ Quebec, Dept Informat, Montreal, PQ H3C 3P8, Canada
关键词
CART; decision trees; drug target; hit; k-nearest neighbors (kNN); linear discriminant analysis (LDA); neural networks (NN); partial least squares (PLS); ROC curve; sampling; support vector machines (SVM); virtual high throughput screening; SELECTION; LIKENESS;
D O I
10.2174/138620710791292958
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High throughput screening (HTS) remains a very costly process notwithstanding many recent technological advances in the field of biotechnology. In this study we consider the application of machine learning methods for predicting experimental HTS measurements. Such a virtual HTS analysis can be based on the results of real HTS campaigns carried out with similar compounds libraries and similar drug targets. In this way, we analyzed Test assay from McMaster University Data Mining and Docking Competition [1] using binary decision trees, neural networks, support vector machines (SVM), linear discriminant analysis, k-nearest neighbors and partial least squares. First, we studied separately the sets of molecular and atomic descriptors in order to establish which of them provides a better prediction. Then, the comparison of the six considered machine learning methods was made in terms of false positives and false negatives, method's sensitivity and enrichment factor. Finally, a variable selection procedure allowing one to improve the method's sensitivity was implemented and applied in the framework of polynomial SVM.
引用
收藏
页码:430 / 441
页数:12
相关论文
共 50 条
  • [21] High throughput screening of new piezoelectric materials using graph machine learning and knowledge graph approach
    Anand, Archit
    Kumari, Priyanka
    Kalyani, Ajay Kumar
    COMPUTATIONAL MATERIALS SCIENCE, 2025, 246
  • [22] High-Throughput Screening and Accurate Prediction of Ionic Liquid Viscosities Using Interpretable Machine Learning
    Mohan, Mood
    Jetti, Karuna Devi
    Guggilam, Sreelekha
    Smith, Micholas Dean
    Kidder, Michelle K.
    Smith, Jeremy C.
    ACS SUSTAINABLE CHEMISTRY & ENGINEERING, 2024, 12 (18): : 7040 - 7054
  • [23] High-throughput screening of tribological properties of monolayer films using molecular dynamics and machine learning
    Quach, Co D.
    Gilmer, Justin B.
    Pert, Daniel
    Mason-Hogans, Akanke
    Iacovella, Christopher R.
    Cummings, Peter T.
    McCabe, Clare
    JOURNAL OF CHEMICAL PHYSICS, 2022, 156 (15):
  • [24] Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires
    Perez-Porras, Fernando-Juan
    Trivino-Tarradas, Paula
    Cima-Rodriguez, Carmen
    Merono-de-Larriva, Jose-Emilio
    Garcia-Ferrer, Alfonso
    Mesas-Carrascosa, Francisco-Javier
    SENSORS, 2021, 21 (11)
  • [25] Data-driven discovery of innate immunomodulators via machine learning-guided high throughput screening
    Tang, Yifeng
    Kim, Jeremiah Y.
    Ip, Carman K. M.
    Bahmani, Azadeh
    Chen, Qing
    Rosenberger, Matthew G.
    Esser-Kahn, Aaron P.
    Ferguson, Andrew L.
    CHEMICAL SCIENCE, 2023, 14 (44) : 12747 - 12766
  • [26] A high-throughput architecture for anomaly detection in streaming data using machine learning algorithms
    Surianarayanan C.
    Kunasekaran S.
    Chelliah P.R.
    International Journal of Information Technology, 2024, 16 (1) : 493 - 506
  • [27] Machine Learning Assisted Hit Prioritization for High Throughput Screening in Drug Discovery
    Boldini, Davide
    Friedrich, Lukas
    Kuhn, Daniel
    Sieber, Stephan A.
    ACS CENTRAL SCIENCE, 2024, 10 (04) : 823 - 832
  • [28] Multidimensional high-throughput screening for mixed perovskite materials with machine learning
    Chen, Chengbing
    Xiao, Jianrong
    Wang, Zhiyong
    JOURNAL OF CHEMICAL PHYSICS, 2025, 162 (11):
  • [29] A multi-fidelity machine learning approach to high throughput materials screening
    Fare, Clyde
    Fenner, Peter
    Benatan, Matthew
    Varsi, Alessandro
    Pyzer-Knapp, Edward O. O.
    NPJ COMPUTATIONAL MATERIALS, 2022, 8 (01)
  • [30] A multi-fidelity machine learning approach to high throughput materials screening
    Clyde Fare
    Peter Fenner
    Matthew Benatan
    Alessandro Varsi
    Edward O. Pyzer-Knapp
    npj Computational Materials, 8