Using Machine Learning Methods to Predict Experimental High Throughput Screening Data

被引:7
|
作者
Mballo, Cherif [1 ]
Makarenkov, Vladimir [1 ]
机构
[1] Univ Quebec, Dept Informat, Montreal, PQ H3C 3P8, Canada
关键词
CART; decision trees; drug target; hit; k-nearest neighbors (kNN); linear discriminant analysis (LDA); neural networks (NN); partial least squares (PLS); ROC curve; sampling; support vector machines (SVM); virtual high throughput screening; SELECTION; LIKENESS;
D O I
10.2174/138620710791292958
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High throughput screening (HTS) remains a very costly process notwithstanding many recent technological advances in the field of biotechnology. In this study we consider the application of machine learning methods for predicting experimental HTS measurements. Such a virtual HTS analysis can be based on the results of real HTS campaigns carried out with similar compounds libraries and similar drug targets. In this way, we analyzed Test assay from McMaster University Data Mining and Docking Competition [1] using binary decision trees, neural networks, support vector machines (SVM), linear discriminant analysis, k-nearest neighbors and partial least squares. First, we studied separately the sets of molecular and atomic descriptors in order to establish which of them provides a better prediction. Then, the comparison of the six considered machine learning methods was made in terms of false positives and false negatives, method's sensitivity and enrichment factor. Finally, a variable selection procedure allowing one to improve the method's sensitivity was implemented and applied in the framework of polynomial SVM.
引用
收藏
页码:430 / 441
页数:12
相关论文
共 50 条
  • [1] Virtual High Throughput Screening Using Machine Learning Methods
    Mballo, Cherif
    Makarenkov, Vladimir
    [J]. CLASSIFICATION AS A TOOL FOR RESEARCH, 2010, : 517 - 524
  • [2] BRAIN CANCER PREDICTION USING MACHINE LEARNING METHODS AND HIGH-THROUGHPUT MOLECULAR DATA
    Ma, B. S.
    Chang, Q.
    Geng, Y.
    Liu, G. H.
    Dong, H.
    Sun, Y. Q.
    [J]. JOURNAL OF INVESTIGATIVE MEDICINE, 2017, 65 (07) : A1 - A1
  • [3] Two effective methods for correcting experimental high-throughput screening data
    Dragiev, Plamen
    Nadon, Robert
    Makarenkov, Vladimir
    [J]. BIOINFORMATICS, 2012, 28 (13) : 1775 - 1782
  • [4] Novel machine learning models to predict endocrine disruption activity for high-throughput chemical screening
    Collins, Sean P.
    Barton-Maclaren, Tara S.
    [J]. FRONTIERS IN TOXICOLOGY, 2022, 4
  • [5] A machine learning approach to predict radioxenon isotopes concentrations using experimental data
    Azimi, Sepideh Alsadat
    Afarideh, Hossein
    Chai, Jong-Seo
    Kalinowski, Martin
    [J]. RADIATION PHYSICS AND CHEMISTRY, 2023, 213
  • [6] High-Throughput Screening and Prediction of High Modulus of Resilience Polymers Using Explainable Machine Learning
    Yue, Tianle
    He, Jinlong
    Tao, Lei
    Li, Ying
    [J]. JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2023, 19 (14) : 4641 - 4653
  • [7] High-throughput screening, next generation sequencing and machine learning: advanced methods in enzyme engineering
    Vanella, Rosario
    Kovacevic, Gordana
    Doffini, Vanni
    de Santaella, Jaime Fernandez
    Nash, Michael A.
    [J]. CHEMICAL COMMUNICATIONS, 2022, 58 (15) : 2455 - 2467
  • [8] Using machine learning methods to predict hepatic encephalopathy in cirrhotic patients with unbalanced data
    Yang, Hong
    Li, Xinxin
    Cao, Hongyan
    Cui, Yuehua
    Luo, Yanhong
    Liu, Jinchun
    Zhang, Yanbo
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 211
  • [9] Using Machine Learning Methods to Predict Autism Syndrome
    Alhakami, Hosam
    Alajlani, Fatimah
    Alghamdi, Alshymaa
    Baz, Abdullah
    Alsubait, Tahani
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (04): : 221 - 228
  • [10] Missing data analysis using machine learning methods to predict the performance of technical students
    Melo Junior, Gilberto de
    Alcala, Symone G. Soares
    Furriel, Geovanne Pereira
    Vieira, Silvio L.
    [J]. REVISTA BRASILEIRA DE COMPUTACAO APLICADA, 2020, 12 (02): : 134 - 143