Reducing false positive rate of docking-based virtual screening by active learning

被引:7
|
作者
Wang, Lei [1 ]
Shi, Shao-Hua [2 ]
Li, Hui [1 ]
Zeng, Xiang-Xiang [1 ,3 ]
Liu, Su-You
Liu, Zhao-Qian [1 ]
Deng, Ya-Feng [4 ]
Lu, Ai-Ping [5 ]
Hou, Ting-Jun [6 ]
Cao, Dong-Sheng [1 ]
机构
[1] Cent South Univ, Xiangya Sch Pharmaceut Sci, Changsha, Peoples R China
[2] Hong Kong Baptist Univ, Sch Chinese Med, Hong Kong, Peoples R China
[3] Hunan Univ, Dept Comp Sci, Changsha, Peoples R China
[4] CarbonSilicon AI Technol, Hangzhou, Peoples R China
[5] Hong Kong Baptist Univ, Inst Adv Translat Med Bone & Joint Dis, Sch Chinese Med, Hong Kong, Peoples R China
[6] Zhejiang Univ, Coll Pharmaceut Sci, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
molecular docking; machine learning-based scoring function (MLSF); active learning; virtual screening (VS); false positive; SCORING FUNCTIONS;
D O I
10.1093/bib/bbac626
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Machine learning-based scoring functions (MLSFs) have become a very favorable alternative to classical scoring functions because of their potential superior screening performance. However, the information of negative data used to construct MLSFs was rarely reported in the literature, and meanwhile the putative inactive molecules recorded in existing databases usually have obvious bias from active molecules. Here we proposed an easy-to-use method named AMLSF that combines active learning using negative molecular selection strategies with MLSF, which can iteratively improve the quality of inactive sets and thus reduce the false positive rate of virtual screening. We chose energy auxiliary terms learning as the MLSF and validated our method on eight targets in the diverse subset of DUD-E. For each target, we screened the IterBioScreen database by AMLSF and compared the screening results with those of the four control models. The results illustrate that the number of active molecules in the top 1000 molecules identified by AMLSF was significantly higher than those identified by the control models. In addition, the free energy calculation results for the top 10 molecules screened out by the AMLSF, null model and control models based on DUD-E also proved that more active molecules can be identified, and the false positive rate can be reduced by AMLSF.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Docking-based virtual screening for potential activity against bacterial pyruvate kinase
    Ergun, C.
    Akten, E. D.
    Doruker, P.
    EUROPEAN BIOPHYSICS JOURNAL WITH BIOPHYSICS LETTERS, 2017, 46 : S314 - S314
  • [22] Pharmacophore-based virtual screening versus docking-based virtual screening: a benchmark comparison against eight targets
    Zhi Chen
    Hong-lin Li
    Qi-jun Zhang
    Xiao-guang Bao
    Kun-qian Yu
    Xiao-min Luo
    Wei-liang Zhu
    Hua-liang Jiang
    Acta Pharmacologica Sinica, 2009, 30 : 1694 - 1708
  • [23] Fragment docking-based pharmacophore screening
    Sherman, B. Woody
    Dixon, Steven L.
    Farid, Ramy
    Repasky, Matthew P.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2006, 232 : 70 - 70
  • [24] Molecular docking-based computational platform for high-throughput virtual screening
    Zhang, Baohua
    Li, Hui
    Yu, Kunqian
    Jin, Zhong
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2022, 4 (01) : 63 - 74
  • [25] Discovery of small-molecule modulators of melanogenesis by docking-based virtual screening
    Abudureyimu, Miernisha
    Zang, Deng
    Talifu, Ainiwaer
    Zhu, Weiliang
    Aisa, Haji Akber
    FUTURE MEDICINAL CHEMISTRY, 2022, 14 (04) : 221 - 231
  • [26] A Comparison between Enrichment Optimization Algorithm (EOA)-Based and Docking-Based Virtual Screening
    Spiegel, Jacob
    Senderowitz, Hanoch
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (01)
  • [27] Molecular docking-based computational platform for high-throughput virtual screening
    Baohua Zhang
    Hui Li
    Kunqian Yu
    Zhong Jin
    CCF Transactions on High Performance Computing, 2022, 4 : 63 - 74
  • [28] A Combination of Pharmacophore and Docking-based Virtual Screening to Discover new Tyrosinase Inhibitors
    Vittorio, Serena
    Seidel, Thomas
    Germano, Maria Paola
    Gitto, Rosaria
    Ielo, Laura
    Garon, Arthur
    Rapisarda, Antonio
    Pace, Vittorio
    Langer, Thierry
    De Luca, Laura
    MOLECULAR INFORMATICS, 2020, 39 (03)
  • [29] Discovery of Ligands for ADP-Ribosyltransferases via Docking-Based Virtual Screening
    Andersson, C. David
    Karlberg, Tobias
    Ekblad, Torun
    Lindgren, Anders E. G.
    Thorsell, Ann-Gerd
    Spjut, Sara
    Uciechowska, Urszula
    Niemiec, Moritz S.
    Wittung-Stafshede, Pernilla
    Weigelt, Johan
    Elofsson, Mikael
    Schuler, Herwig
    Linusson, Anna
    JOURNAL OF MEDICINAL CHEMISTRY, 2012, 55 (17) : 7706 - 7718
  • [30] Discovery of Cobimetinib as a novel A-FABP inhibitor using machine learning and molecular docking-based virtual screening
    Yang, Shilun
    Li, Simeng
    Chang, Junlei
    RSC ADVANCES, 2022, 12 (21) : 13500 - 13510