共 50 条
QSAR studies of the bioactivity of hepatitis C virus (HCV) NS3/4A protease inhibitors by multiple linear regression (MLR) and support vector machine (SVM)
被引:17
|作者:
Qin, Zijian
[1
]
Wang, Maolin
[1
]
Yan, Aixia
[1
]
机构:
[1] Beijing Univ Chem Technol, Dept Pharmaceut Engn, State Key Lab Chem Resource Engn, POB 53,15 BeiSanHuan East Rd, Beijing 100029, Peoples R China
基金:
中国国家自然科学基金;
关键词:
Hepatitis C virus (HCV) NS3/4A protease inhibitors;
Quantitative structure-activity relationship (QSAR);
Kohonen's self-organizing map (SOM);
Multiple linear regression (MLR);
Support vector machine (SVM);
Sub-dataset analysis;
PEPTIDE-BASED INHIBITORS;
ALPHA-KETOAMIDES;
SERINE-PROTEASE;
ANTIVIRAL ACTIVITY;
POTENT INHIBITORS;
CRYSTAL-STRUCTURE;
DISCOVERY;
DESIGN;
PROTEINASE;
BINDING;
D O I:
10.1016/j.bmcl.2017.05.001
中图分类号:
R914 [药物化学];
学科分类号:
100701 ;
摘要:
In this study, quantitative structure-activity relationship (QSAR) models using various descriptor sets and training/test set selection methods were explored to predict the bioactivity of hepatitis C virus (HCV) NS3/4A protease inhibitors by using a multiple linear regression (MLR) and a support vector machine (SVM) method. 512 HCV NS3/4A protease inhibitors and their IC50 values which were determined by the same FRET assay were collected from the reported literature to build a dataset. All the inhibitors were represented with selected nine global and 12 2D property-weighted autocorrelation descriptors calculated from the program CORINA Symphony. The dataset was divided into a training set and a test set by a random and a Kohonen's self-organizing map (SOM) method. The correlation coefficients (r(2)) of training sets and test sets were 0.75 and 0.72 for the best MLR model, 0.87 and 0.85 for the best SVM model, respectively. In addition, a series of sub-dataset models were also developed. The performances of all the best sub-dataset models were better than those of the whole dataset models. We believe that the combination of the best sub-and whole dataset SVM models can be used as reliable lead designing tools for new NS3/4A protease inhibitors scaffolds in a drug discovery pipeline. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2931 / 2938
页数:8
相关论文