共 50 条
Classification Models for Predicting Cytochrome P450 Enzyme-Substrate Selectivity
被引:17
|作者:
Zhang, Tao
[1
,2
,3
]
Dai, Hao
[1
,2
]
Liu, Limin Angela
[4
]
Lewis, David F. V.
[5
]
Wei, Dongqing
[1
,2
]
机构:
[1] Shanghai Jiao Tong Univ, State Key Lab Microbial Metab, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Coll Life Sci & Biotechnol, Shanghai 200240, Peoples R China
[3] Tian Jin Med Univ, Sch Biomed Engn, Tianjin 300070, Peoples R China
[4] Fred Hutchinson Canc Res Ctr, Seattle, WA 98109 USA
[5] Univ Surrey, Fac Hlth & Med Sci, Guildford GU2 7XH, Surrey, England
关键词:
Bioinformatics;
Decision tree;
Enzymes;
Genetic algorithm;
Neural network;
P450;
PHARMACOPHORE;
3A4;
INHIBITORS;
MECHANISM;
D O I:
10.1002/minf.201100052
中图分类号:
R914 [药物化学];
学科分类号:
100701 ;
摘要:
Cytochrome P450 (CYP) is an important drug-metabolizing enzyme family. Different CYPs often have different substrate preferences. In addition, one drug molecule may be preferentially metabolized by one or more CYP enzymes. Therefore, the classification and prediction of substrate specificity of CYP enzymes are of importance to the understanding of drug metabolisms and may help guide the development of new drugs. In this study, we used three different machine learning methods to classify CYP substrates for predicting CYP-substrate specificity based solely on structural and physicochemical properties of the substrates. We first built a simple decision tree model to classify substrates of four CYP enzymes, 1A2, 2C9, 2D6 and 3A4 with more than 78?% classification accuracy. We then built a single-label eight-class model and a multilabel five-class model to classify substrates of eight CYP enzymes and to classify substrates that can be metabolized by more than one CYP enzymes, respectively. Above 90?% and >80?% prediction accuracy was achieved for the single-label and multilabel models, respectively. The main improvement of our models over existing ones is the automated and unbiased selection of descriptors by genetic algorithms, which makes our methods applicable for larger data sets and increased number of CYP enzymes.
引用
收藏
页码:53 / 62
页数:10
相关论文