Reducing false positive rate of docking-based virtual screening by active learning

被引:7
|
作者
Wang, Lei [1 ]
Shi, Shao-Hua [2 ]
Li, Hui [1 ]
Zeng, Xiang-Xiang [1 ,3 ]
Liu, Su-You
Liu, Zhao-Qian [1 ]
Deng, Ya-Feng [4 ]
Lu, Ai-Ping [5 ]
Hou, Ting-Jun [6 ]
Cao, Dong-Sheng [1 ]
机构
[1] Cent South Univ, Xiangya Sch Pharmaceut Sci, Changsha, Peoples R China
[2] Hong Kong Baptist Univ, Sch Chinese Med, Hong Kong, Peoples R China
[3] Hunan Univ, Dept Comp Sci, Changsha, Peoples R China
[4] CarbonSilicon AI Technol, Hangzhou, Peoples R China
[5] Hong Kong Baptist Univ, Inst Adv Translat Med Bone & Joint Dis, Sch Chinese Med, Hong Kong, Peoples R China
[6] Zhejiang Univ, Coll Pharmaceut Sci, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
molecular docking; machine learning-based scoring function (MLSF); active learning; virtual screening (VS); false positive; SCORING FUNCTIONS;
D O I
10.1093/bib/bbac626
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Machine learning-based scoring functions (MLSFs) have become a very favorable alternative to classical scoring functions because of their potential superior screening performance. However, the information of negative data used to construct MLSFs was rarely reported in the literature, and meanwhile the putative inactive molecules recorded in existing databases usually have obvious bias from active molecules. Here we proposed an easy-to-use method named AMLSF that combines active learning using negative molecular selection strategies with MLSF, which can iteratively improve the quality of inactive sets and thus reduce the false positive rate of virtual screening. We chose energy auxiliary terms learning as the MLSF and validated our method on eight targets in the diverse subset of DUD-E. For each target, we screened the IterBioScreen database by AMLSF and compared the screening results with those of the four control models. The results illustrate that the number of active molecules in the top 1000 molecules identified by AMLSF was significantly higher than those identified by the control models. In addition, the free energy calculation results for the top 10 molecules screened out by the AMLSF, null model and control models based on DUD-E also proved that more active molecules can be identified, and the false positive rate can be reduced by AMLSF.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Boosting Docking-Based Virtual Screening with Deep Learning
    Pereira, Janaina Cruz
    Caffarena, Ernesto Raul
    dos Santos, Cicero Nogueira
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (12) : 2495 - 2506
  • [2] Docking-Based Virtual Screening: Recent Developments
    Tuccinardi, Tiziano
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2009, 12 (03) : 303 - 314
  • [3] Development and application of pfiscore: A filter function for reducing the false positive rate in docking and structure-based virtual screening.
    Cross, JB
    Jalaie, M
    Gantt, SL
    Joshi, NA
    Wild, DJ
    Snow, ME
    Narasimhan, LS
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2004, 227 : U1029 - U1029
  • [4] Enhancing Scoring Performance of Docking-Based Virtual Screening Through Machine Learning
    Silva, Candida G.
    Simoes, Carlos J. V.
    Carreiras, Pedro
    Brito, Rui M. M.
    CURRENT BIOINFORMATICS, 2016, 11 (04) : 408 - 420
  • [5] Preparation of Target CETP in Docking-based Virtual Screening
    Tao, Weiye
    Wang, Laiyou
    Huang, Guoquan
    Luo, Man
    APPLIED MECHANICS AND MATERIALS II, PTS 1 AND 2, 2014, 477-478 : 1495 - +
  • [6] How good are AlphaFold models for docking-based virtual screening?
    Scardino, Valeria
    Di Filippo, Juan I.
    Cavasotto, Claudio N.
    ISCIENCE, 2023, 26 (01)
  • [7] Docking-based inverse virtual screening: methods,applications, and challenges
    Xianjin Xu
    Marshal Huang
    Xiaoqin Zou
    BiophysicsReports, 2018, 4 (01) : 1 - 16
  • [8] Design of a docking-based virtual high throughput screening system
    Qadry, SS
    McDonald, N
    METMBS'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, 2003, : 282 - 287
  • [9] Docking-based virtual screening of Chk1 inhibitors
    Li, Yan
    Kim, Dong Joon
    Bode, Ann M.
    Dong, Zigang
    CANCER RESEARCH, 2011, 71
  • [10] Improving docking-based virtual screening with convolutional neural networks
    Pereira, Janaina Cruz
    Santos, Cicero
    Caffarena, Ernesto
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256