Using Machine Learning Methods to Predict Experimental High Throughput Screening Data

被引:7
|
作者
Mballo, Cherif [1 ]
Makarenkov, Vladimir [1 ]
机构
[1] Univ Quebec, Dept Informat, Montreal, PQ H3C 3P8, Canada
关键词
CART; decision trees; drug target; hit; k-nearest neighbors (kNN); linear discriminant analysis (LDA); neural networks (NN); partial least squares (PLS); ROC curve; sampling; support vector machines (SVM); virtual high throughput screening; SELECTION; LIKENESS;
D O I
10.2174/138620710791292958
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High throughput screening (HTS) remains a very costly process notwithstanding many recent technological advances in the field of biotechnology. In this study we consider the application of machine learning methods for predicting experimental HTS measurements. Such a virtual HTS analysis can be based on the results of real HTS campaigns carried out with similar compounds libraries and similar drug targets. In this way, we analyzed Test assay from McMaster University Data Mining and Docking Competition [1] using binary decision trees, neural networks, support vector machines (SVM), linear discriminant analysis, k-nearest neighbors and partial least squares. First, we studied separately the sets of molecular and atomic descriptors in order to establish which of them provides a better prediction. Then, the comparison of the six considered machine learning methods was made in terms of false positives and false negatives, method's sensitivity and enrichment factor. Finally, a variable selection procedure allowing one to improve the method's sensitivity was implemented and applied in the framework of polynomial SVM.
引用
收藏
页码:430 / 441
页数:12
相关论文
共 50 条
  • [31] High throughput screening methods for ω-transaminases
    Mathew, Sam
    Shin, Giyoung
    Shon, Minsu
    Yun, Hyungdon
    BIOTECHNOLOGY AND BIOPROCESS ENGINEERING, 2013, 18 (01) : 1 - 7
  • [32] High throughput screening methods for ω-transaminases
    Sam Mathew
    Giyoung Shin
    Minsu Shon
    Hyungdon Yun
    Biotechnology and Bioprocess Engineering, 2013, 18 : 1 - 7
  • [33] Screening of Enhanced Oil Recovery Methods Using Supervised Machine Learning Predicated on Range Data
    Harrison, Gbubemi H.
    Lamboi, Josephine A.
    APPLIED INTELLIGENCE AND INFORMATICS, AII 2023, 2024, 2065 : 430 - 441
  • [34] EXPERIMENTAL MODELS AND HIGH THROUGHPUT SCREENING
    Du, Yuhong
    Kasinski, Andrea
    Thomas, Shala
    Sun, Aiming
    Gewirtz, Andrew
    Snyder, James
    Liotta, Dennis
    Khuri, Fadlo R.
    Fu, Haian
    JOURNAL OF THORACIC ONCOLOGY, 2011, 6 (06) : S114 - S115
  • [35] Active Learning for High Throughput Screening
    De Grave, Kurt
    Ramon, Jan
    De Raedt, Luc
    DISCOVERY SCIENCE, PROCEEDINGS, 2008, 5255 : 185 - 196
  • [36] Machine Learning Interatomic Potential for High-Throughput Screening of High-Entropy Alloys
    Anup Pandey
    Jonathan Gigax
    Reeju Pokharel
    JOM, 2022, 74 : 2908 - 2920
  • [37] Machine Learning Interatomic Potential for High-Throughput Screening of High-Entropy Alloys
    Pandey, Anup
    Gigax, Jonathan
    Pokharel, Reeju
    JOM, 2022, 74 (08) : 2908 - 2920
  • [38] Machine Learning-Based High-Throughput Screening for High-Stability Polyimides
    Luo, Gaoyang
    Huan, Feicheng
    Sun, Yuwei
    Shi, Feng
    Deng, Shengwei
    Wang, Jian-guo
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2024, 63 (48) : 21110 - 21122
  • [39] ANALYSIS OF LArTPC DATA USING MACHINE LEARNING METHODS
    Falko, A.
    Gogota, O.
    Yermolenko, R.
    Kadenko, I.
    JOURNAL OF PHYSICAL STUDIES, 2024, 28 (01):
  • [40] Accelerated screening of functional atomic impurities in halide perovskites using high-throughput computations and machine learning
    Arun Mannodi-Kanakkithodi
    Maria K. Y. Chan
    Journal of Materials Science, 2022, 57 : 10736 - 10754