Reducing false positive rate of docking-based virtual screening by active learning

被引:7
|
作者
Wang, Lei [1 ]
Shi, Shao-Hua [2 ]
Li, Hui [1 ]
Zeng, Xiang-Xiang [1 ,3 ]
Liu, Su-You
Liu, Zhao-Qian [1 ]
Deng, Ya-Feng [4 ]
Lu, Ai-Ping [5 ]
Hou, Ting-Jun [6 ]
Cao, Dong-Sheng [1 ]
机构
[1] Cent South Univ, Xiangya Sch Pharmaceut Sci, Changsha, Peoples R China
[2] Hong Kong Baptist Univ, Sch Chinese Med, Hong Kong, Peoples R China
[3] Hunan Univ, Dept Comp Sci, Changsha, Peoples R China
[4] CarbonSilicon AI Technol, Hangzhou, Peoples R China
[5] Hong Kong Baptist Univ, Inst Adv Translat Med Bone & Joint Dis, Sch Chinese Med, Hong Kong, Peoples R China
[6] Zhejiang Univ, Coll Pharmaceut Sci, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
molecular docking; machine learning-based scoring function (MLSF); active learning; virtual screening (VS); false positive; SCORING FUNCTIONS;
D O I
10.1093/bib/bbac626
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Machine learning-based scoring functions (MLSFs) have become a very favorable alternative to classical scoring functions because of their potential superior screening performance. However, the information of negative data used to construct MLSFs was rarely reported in the literature, and meanwhile the putative inactive molecules recorded in existing databases usually have obvious bias from active molecules. Here we proposed an easy-to-use method named AMLSF that combines active learning using negative molecular selection strategies with MLSF, which can iteratively improve the quality of inactive sets and thus reduce the false positive rate of virtual screening. We chose energy auxiliary terms learning as the MLSF and validated our method on eight targets in the diverse subset of DUD-E. For each target, we screened the IterBioScreen database by AMLSF and compared the screening results with those of the four control models. The results illustrate that the number of active molecules in the top 1000 molecules identified by AMLSF was significantly higher than those identified by the control models. In addition, the free energy calculation results for the top 10 molecules screened out by the AMLSF, null model and control models based on DUD-E also proved that more active molecules can be identified, and the false positive rate can be reduced by AMLSF.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Identification of PDZ ligands by docking-based virtual screening for the development of novel analgesic agents
    Bouzidi, Naoual
    Deokar, Hemantkumar
    Vogrig, Alexandre
    Boucherle, Benjamin
    Ripoche, Isabelle
    Abrunhosa-Thomas, Isabelle
    Dorr, Liam
    Wattiez, Anne-Sophie
    Lian, Lu-Yun
    Marin, Philippe
    Courteix, Christine
    Ducki, Sylvie
    BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2013, 23 (09) : 2624 - 2627
  • [32] Docking-based inverse virtual screening strategy for identification of novel protein targets for triclosan
    Bhardwaj, Prashant
    Biswas, G. P.
    Bhunia, Biswanath
    CHEMOSPHERE, 2019, 235 : 976 - 984
  • [33] Docking-Based Virtual Screening of Covalently Binding Ligands: An Orthogonal Lead Discovery Approach
    Schroeder, Joerg
    Klinger, Anette
    Oellien, Frank
    Marhoefer, Richard J.
    Duszenko, Michael
    Selzer, Paul M.
    JOURNAL OF MEDICINAL CHEMISTRY, 2013, 56 (04) : 1478 - 1490
  • [34] Combination of pose and rank consensus in docking-based virtual screening: the best of both worlds
    Scardino, Valeria
    Bollini, Mariela
    Cavasotto, Claudio N.
    RSC ADVANCES, 2021, 11 (56) : 35383 - 35391
  • [35] Discovery of potential Zika virus RNA polymerase inhibitors by docking-based virtual screening
    Singh, Anjali
    Jana, Nandan Kumar
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2017, 71 : 144 - 151
  • [36] Discovery of novel reversible monoacylglycerol lipase inhibitors via docking-based virtual screening
    Xiong, Fengmin
    Ding, Xiaoyu
    Zhang, Hao
    Luo, Xiaomin
    Chen, Kaixian
    Jiang, Hualiang
    Luo, Cheng
    Xu, Heng
    BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2021, 41
  • [37] Novel and selective DNA methyltransferase inhibitors: Docking-based virtual screening and experimental evaluation
    Kuck, Dirk
    Singh, Narender
    Lyko, Frank
    Medina-Franco, Jose L.
    BIOORGANIC & MEDICINAL CHEMISTRY, 2010, 18 (02) : 822 - 829
  • [38] Deep Learning with Geometry-Enhanced Molecular Representation for Augmentation of Large-Scale Docking-Based Virtual Screening
    Yu, Lan
    He, Xiao
    Fang, Xiaomin
    Liu, Lihang
    Liu, Jinfeng
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (21) : 6501 - 6514
  • [39] Docking-based virtual screening of Brazilian natural compounds using the OOMT as the pharmacological target database
    Carregal, Ana Paula
    Maciel, Flavia V.
    Carregal, Juliano B.
    Santos, Bianca dos Reis
    da Silva, Alisson Marques
    Taranto, Alex G.
    JOURNAL OF MOLECULAR MODELING, 2017, 23 (04)
  • [40] Development and Validation of a Docking-Based Virtual Screening Platform for the Identification of New Lactate Dehydrogenase Inhibitors
    Granchi, Carlotta
    Capecchi, Alice
    Del Frate, Gianluca
    Martinelli, Adriano
    Macchia, Marco
    Minutolo, Filippo
    Tuccinardi, Tiziano
    MOLECULES, 2015, 20 (05): : 8772 - 8790