Machine Learning Assisted Hit Prioritization for High Throughput Screening in Drug Discovery

被引:7
|
作者
Boldini, Davide [1 ]
Friedrich, Lukas [2 ]
Kuhn, Daniel [2 ]
Sieber, Stephan A. [1 ]
机构
[1] Tech Univ Munich, Ctr Funct Prot Assemblies CPA, TUM Sch Nat Sci, Dept Biosci, D-85748 Garching, Germany
[2] Healthcare business Merck KGaA, Darmstadt, Germany
基金
欧洲研究理事会;
关键词
FALSE POSITIVES; ASSAY INTERFERENCE; PAINS;
D O I
10.1021/acscentsci.3c01517
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Efficient prioritization of bioactive compounds from high throughput screening campaigns is a fundamental challenge for accelerating drug development efforts. In this study, we present the first data-driven approach to simultaneously detect assay interferents and prioritize true bioactive compounds. By analyzing the learning dynamics during training of a gradient boosting model on noisy high throughput screening data using a novel formulation of sample influence, we are able to distinguish between compounds exhibiting the desired biological response and those producing assay artifacts. Therefore, our method enables false positive and true positive detection without relying on prior screens or assay interference mechanisms, making it applicable to any high throughput screening campaign. We demonstrate that our approach consistently excludes assay interferents with different mechanisms and prioritizes biologically relevant compounds more efficiently than all tested baselines, including a retrospective case study simulating its use in a real drug discovery campaign. Finally, our tool is extremely computationally efficient, requiring less than 30 s per assay on low-resource hardware. As such, our findings show that our method is an ideal addition to existing false positive detection tools and can be used to guide further pharmacological optimization after high throughput screening campaigns.
引用
收藏
页码:823 / 832
页数:10
相关论文
共 50 条
  • [1] A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery
    Rahman, A. S. M. Zisanur
    Liu, Chengyou
    Sturm, Hunter
    Hogan, Andrew M.
    Davis, Rebecca
    Hu, Pingzhao
    Cardona, Silvia T.
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (10)
  • [2] High throughput screening in drug discovery
    Carnero A.
    Clinical and Translational Oncology, 2006, 8 (7) : 482 - 490
  • [3] MF-PCBA: Multifidelity High-Throughput Screening Benchmarks for Drug Discovery and Machine Learning
    Buterez, David
    Janet, Jon Paul
    Kiddle, Steven J.
    Lio, Pietro
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (09) : 2667 - 2678
  • [4] Machine learning for hit discovery: Recent work in virtual screening and de novo drug design
    Amaro, Rommie
    Parks, Conor
    Gaieb, Zied
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
  • [5] High throughput screening for new drug discovery
    Lin, BB
    JOURNAL OF FOOD AND DRUG ANALYSIS, 1995, 3 (04) : 233 - 241
  • [6] High-throughput screening for drug discovery
    Broach, JR
    Thorner, J
    NATURE, 1996, 384 (6604) : 14 - 16
  • [7] High Throughput Screening for Drug Discovery of Autophagy Modulators
    Shu, Chih-Wen
    Liu, Pei-Feng
    Huang, Chun-Ming
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2012, 15 (09) : 721 - 729
  • [8] High-throughput screening, metabolomics and drug discovery
    Harrigan, GG
    Yates, LA
    IDRUGS, 2006, 9 (03) : 188 - 192
  • [9] High Throughput Screening for Drug Discovery and Virus Detection
    Oke, Adetola
    Sahin, Deniz
    Chen, Xin
    Shang, Ying
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2022, 25 (09) : 1518 - 1533
  • [10] High Throughput Screening in Drug Discovery: Problems and Solutions
    Hushpulian, D. M.
    Gaisina, I. N.
    Nikulin, S. V.
    Chubar, T. A.
    Savin, S. S.
    Gazaryan, I. G.
    Tishkov, V. I.
    MOSCOW UNIVERSITY CHEMISTRY BULLETIN, 2024, 79 (02) : 93 - 104