Carcinogenicity Prediction of Noncongeneric Chemicals by a Support Vector Machine

被引：25

作者：

Zhong, Min ^{[1
]}

Nie, Xianglei ^{[1
]}

Yan, Aixia ^{[1
]}

Yuan, Qipeng ^{[1
]}

机构：

[1] Beijing Univ Chem Technol, Dept Pharmaceut Engn, State Key Lab Chem Resource Engn, Beijing 100029, Peoples R China

来源：

CHEMICAL RESEARCH IN TOXICOLOGY | 2013年 / 26卷 / 05期

基金：

中国国家自然科学基金;

关键词：

ORBITAL ELECTRONEGATIVITY; QSAR;

D O I：

10.1021/tx4000182

中图分类号：

R914 [药物化学];

学科分类号：

100701 ;

摘要：

The ability to identify carcinogenic compounds is of fundamental importance to the safe application of chemicals. In this study, we generated an array of in silk models allowing the classification of compounds into carcinogenic and noncarcinogenic agents based on a data set of 852 noncongeneric chemicals collected from the Carcinogenic Potency Database (CPDBAS). Twenty-four molecular descriptors were selected by Pearson correlation, F-score, and stepwise regression analysis. These descriptors cover a range of physicochemical properties, including electrophilicity, geometry, molecular weight, size, and solubility. The descriptor mutagenic showed the highest correlation coefficient with carcinogenicity. On the basis of these descriptors, a support vector machine-based (SVM) classification model was developed and fine-tuned by a 10-fold cross-validation approach. Both the SVM model (Model A1) and the best model from the 10-fold cross-validation (Model B3) runs gave good results on the test set with prediction accuracy over 80%, sensitivity over 76%, and specificity over 82%. In addition, extended connectivity fingerprints (ECFPs) and the Toxtree software were used to analyze the functional groups and substructures linked to carcinogenicity. It was found that the results of both methods are in good agreement.

引用

页码：741 / 749

页数：9

共 50 条

[31] Prediction of Tobacco Sales Based on Support Vector Machine
Ding, Fuli
Sun, Limin
LISS 2014, 2015, : 891 - 896
[32] Application of a support vector machine for prediction of slope stability
XUE Xin Hua
YANG Xing Guo
CHEN Xin
Science China(Technological Sciences), 2014, 57 (12) : 2379 - 2386
[33] Application of support vector machine in prediction of reservoir parameters
Ye Duan-nan
Zhang Guang-zhi
2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 2539 - 2542
[34] Application of support vector machine to synthetic earthquake prediction
Chun Jiang 1
Earthquake Science, 2009, 22 (03) : 315 - 320
[35] Application of support vector machine to synthetic earthquake prediction
Jiang, Chun
Wei, Xueli
Cui, Xiaofeng
You, Dexiang
EARTHQUAKE SCIENCE, 2009, 22 (03) : 315 - 320
[36] Online prediction model based on support vector machine
Wang, Wenjian
Men, Changqian
Lu, Weizhen
NEUROCOMPUTING, 2008, 71 (4-6) : 550 - 558
[37] Prediction of the β-Hairpins in Proteins Using Support Vector Machine
Xiu Zhen Hu
Qian Zhong Li
The Protein Journal, 2008, 27 : 115 - 122
[38] Prediction of nucleosome positioning using a support vector machine
Bishop, Eric
Tullius, Thomas D.
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2007, 24 (06): : 624 - 624
[39] Application of a support vector machine for prediction of slope stability
Xue XinHua
Yang XingGuo
Chen Xin
SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2014, 57 (12) : 2379 - 2386
[40] Prediction Based on Wavelet Transform and Support Vector Machine
Liu, Xiaohong
Zhu, Yanwei
Zhang, Yongli
Wang, Xinchun
INFORMATION COMPUTING AND APPLICATIONS, PT I, 2011, 243 : 618 - +

← 1 2 3 4 5 →