Exploiting cheminformatic and machine learning to navigate the available chemical space of potential small molecule inhibitors of SARS-CoV-2

被引:25
|
作者
Kumar, Abhinit [1 ]
Loharch, Saurabh [1 ]
Kumar, Sunil [1 ]
Ringe, P. Rajesh [1 ]
Parkesh, Raman [1 ,2 ]
机构
[1] Inst Microbial Technol, CSIR, GNRPC, Chandigarh 160036, India
[2] Acad Sci & Innovat Res AcSIR, Ghaziabad 201002, India
关键词
COVID-19; SARS-CoV-2; Repurpose drugs; Chemical space; Gini coefficient; RESPIRATORY SYNDROME CORONAVIRUS; DRUG DISCOVERY; DIVERSITY; CHEMISTRY; POLYPHARMACOLOGY; LIBRARY;
D O I
10.1016/j.csbj.2020.12.028
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The current life-threatening and tenacious pandemic eruption of coronavirus disease in 2019 (COVID-19) has posed a significant global hazard concerning high mortality rate, economic meltdown, and everyday life distress. The rapid spread of COVID-19 demands countermeasures to combat this deadly virus. Currently, there are no drugs approved by the FDA to treat COVID-19. Therefore, discovering small molecule therapeutics for treating COVID-19 infection is essential. So far, only a few small molecule inhibitors are reported for coronaviruses. There is a need to expand the small chemical space of coronaviruses inhibitors by adding potent and selective scaffolds with anti-COVID activity. In this context, the huge antiviral chemical space already available can be analysed using cheminformatic and machine learning to unearth new scaffolds. We created three specific datasets called "antiviral dataset" (N = 38,428) "drug-like antiviral dataset" (N = 20,963) and "anticorona dataset" (N = 433) for this purpose. We analyzed the 433 molecules of "anticorona dataset" for their scaffold diversity, physicochemical distributions, principal component analysis, activity cliffs, R-group decomposition, and scaffold mapping. The scaffold diversity of the "anticorona dataset" in terms of Murcko scaffold analysis demonstrates a thorough representation of diverse chemical scaffolds. However, physicochemical descriptor analysis and principal component analysis demonstrated negligible drug-like features for the "anticorona dataset" molecules. The "antiviral dataset" and "drug-like antiviral dataset" showed low scaffold diversity as measured by the Gini coefficient. The hierarchical clustering of the "antiviral dataset" against the "anticorona dataset" demonstrated little molecular similarity. We generated a library of frequent fragments and polypharmacological ligands targeting various essential viral proteins such as main protease, helicase, papain-like protease, and replicase polyprotein 1ab. Further structural and chemical features of the "anticorona dataset" were compared with SARS-CoV-2 repurposed drugs, FDA-approved drugs, natural products, and drugs currently in clinical trials. Using machine learning tool DCA (DMax Chemistry Assistant), we converted the "anticorona dataset" into an elegant hypothesis with significant functional biological relevance. Machine learning analysis uncovered that FDA approved drugs, Tizanidine HCl, Cefazolin, Raltegravir, Azilsartan, Acalabrutinib, Luliconazole, Sitagliptin, Meloxicam (Mobic), Succinyl sulfathiazole, Fluconazole, and Pranlukast could be repurposed as effective drugs for COVID-19. Fragment-based scaffold analysis and R-group decomposition uncovered pyrrolidine and the indole molecular scaffolds as the potent fragments for designing and synthesizing the novel drug-like molecules for targeting SARS-CoV-2. This comprehensive and systematic assessment of small-molecule viral therapeutics' entire chemical space realised critical insights to potentially privileged scaffolds that could aid in enrichment and rapid discovery of efficacious antiviral drugs for COVID-19. (C) 2020 The Author(s). Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology.
引用
收藏
页码:424 / 438
页数:15
相关论文
共 50 条
  • [1] Exploiting cheminformatic and machine learning to navigate the available chemical space of potential small molecule inhibitors of SARS-CoV-2 (vol 19, pg, 424, 2021)
    Kumar, Abhinit
    Loharch, Saurabh
    Kumar, Sunil
    Ringe, Rajesh P.
    Parkesh, Raman
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2023, 21 : 4408 - 4408
  • [2] SARS-CoV-2 Resistance to Small Molecule Inhibitors
    Lopez, Uxua Modrego
    Hasan, Md. Mehedi
    Havranek, Brandon
    Islam, Shahidul M.
    CURRENT CLINICAL MICROBIOLOGY REPORTS, 2024, 11 (03): : 127 - 139
  • [3] Identification of novel small-molecule inhibitors of SARS-CoV-2 by chemical genetics
    Chan, Chris Chun-Yiu
    Guo, Qian
    Chan, Jasper Fuk-Woo
    Tang, Kaiming
    Cai, Jian-Piao
    Chik, Ka-Heng
    Chik, Kenn Ka-Heng
    Huang, Yixin
    Dai, Mei
    Qin, Bo
    Ong, Chon Phin
    Chu, Allen Wing-Ho
    Chan, Wan-Mui
    Ip, Jonathan Daniel
    Wen, Lei
    Tsang, Jessica Oi-Ling
    Wang, Tong-Yun
    Xie, Yubin
    Qin, Zhenzhi
    Cao, Jianli
    Ye, Zi-Wei
    Chu, Hin
    To, Kelvin Kai-Wang
    Ge, Xing-Yi
    Ni, Tao
    Jin, Dong-Yan
    Cui, Sheng
    Yuen, Kwok-Yung
    Yuan, Shuofeng
    ACTA PHARMACEUTICA SINICA B, 2024, 14 (09) : 4028 - 4044
  • [4] SARS-CoV-2 Mpro: A Potential Target for Peptidomimetics and Small-Molecule Inhibitors
    Citarella, Andrea
    Scala, Angela
    Piperno, Anna
    Micale, Nicola
    BIOMOLECULES, 2021, 11 (04)
  • [5] Exploring the chemical space of BRAF Inhibitors: A cheminformatic and Machine learning analysis
    Aouidate, Adnane
    JOURNAL OF MOLECULAR LIQUIDS, 2024, 401
  • [6] Chemical Proteomics Approaches for Screening Small Molecule Inhibitors Covalently Binding to SARS-Cov-2
    Zheng, Liuhai
    Zhang, Qian
    Luo, Piao
    Shi, Fei
    Zhang, Ying
    He, Xiaoxue
    An, Yehai
    Cheng, Guangqing
    Pan, Xiaoyan
    Li, Zhijie
    Zhou, Boping
    Wang, Jigang
    ADVANCED BIOLOGY, 2024, 8 (11):
  • [7] Machine Learning Models Identify Inhibitors of SARS-CoV-2
    Gawriljuk, Victor O.
    Zin, Phyo Phyo Kyaw
    Puhl, Ana C.
    Zorn, Kimberley M.
    Foil, Daniel H.
    Lane, Thomas R.
    Hurst, Brett
    Tavella, Tatyana Almeida
    Maranhao Costa, Fabio Trindade
    Lakshmanane, Premkumar
    Bernatchez, Jean
    Godoy, Andre S.
    Oliva, Glaucius
    Siqueira-Neto, Jair L.
    Madrid, Peter B.
    Ekins, Sean
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (09) : 4224 - 4235
  • [8] Identification of potent small molecule inhibitors of SARS-CoV-2 entry
    Mediouni, Sonia
    Mou, Huihui
    Otsuka, Yuka
    Jablonski, Joseph Anthony
    Adcock, Robert Scott
    Batra, Lalit
    Chung, Dong-Hoon
    Rood, Christopher
    de Vera, Ian Mitchelle S.
    Rahaim, Ronald, Jr.
    Ullah, Sultan
    Yu, Xuerong
    Getmanenko, Yulia A.
    Kennedy, Nicole M.
    Wang, Chao
    Nguyen, Tu-Trinh
    Hull, Mitchell
    Chen, Emily
    Bannister, Thomas D.
    Baillargeon, Pierre
    Scampavia, Louis
    Farzan, Michael
    Valente, Susana T.
    Spicer, Timothy P.
    SLAS DISCOVERY, 2022, 27 (01) : 8 - 19
  • [9] A machine learning regression model for the screening and design of potential SARS-CoV-2 protease inhibitors
    Janairo, Gabriela Ilona B.
    Yu, Derrick Ethelbhert C.
    Janairo, Jose Isagani B.
    Network Modeling Analysis in Health Informatics and Bioinformatics, 2021, 10 (01)
  • [10] A machine learning regression model for the screening and design of potential SARS-CoV-2 protease inhibitors
    Janairo, Gabriela Ilona B.
    Yu, Derrick Ethelbhert C.
    Janairo, Jose Isagani B.
    NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2021, 10 (01):