Carcinogenicity Prediction of Noncongeneric Chemicals by a Support Vector Machine

被引:25
|
作者
Zhong, Min [1 ]
Nie, Xianglei [1 ]
Yan, Aixia [1 ]
Yuan, Qipeng [1 ]
机构
[1] Beijing Univ Chem Technol, Dept Pharmaceut Engn, State Key Lab Chem Resource Engn, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
ORBITAL ELECTRONEGATIVITY; QSAR;
D O I
10.1021/tx4000182
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The ability to identify carcinogenic compounds is of fundamental importance to the safe application of chemicals. In this study, we generated an array of in silk models allowing the classification of compounds into carcinogenic and noncarcinogenic agents based on a data set of 852 noncongeneric chemicals collected from the Carcinogenic Potency Database (CPDBAS). Twenty-four molecular descriptors were selected by Pearson correlation, F-score, and stepwise regression analysis. These descriptors cover a range of physicochemical properties, including electrophilicity, geometry, molecular weight, size, and solubility. The descriptor mutagenic showed the highest correlation coefficient with carcinogenicity. On the basis of these descriptors, a support vector machine-based (SVM) classification model was developed and fine-tuned by a 10-fold cross-validation approach. Both the SVM model (Model A1) and the best model from the 10-fold cross-validation (Model B3) runs gave good results on the test set with prediction accuracy over 80%, sensitivity over 76%, and specificity over 82%. In addition, extended connectivity fingerprints (ECFPs) and the Toxtree software were used to analyze the functional groups and substructures linked to carcinogenicity. It was found that the results of both methods are in good agreement.
引用
收藏
页码:741 / 749
页数:9
相关论文
共 50 条
  • [31] Prediction of Tobacco Sales Based on Support Vector Machine
    Ding, Fuli
    Sun, Limin
    LISS 2014, 2015, : 891 - 896
  • [32] Application of a support vector machine for prediction of slope stability
    XUE Xin Hua
    YANG Xing Guo
    CHEN Xin
    Science China(Technological Sciences), 2014, 57 (12) : 2379 - 2386
  • [33] Application of support vector machine in prediction of reservoir parameters
    Ye Duan-nan
    Zhang Guang-zhi
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 2539 - 2542
  • [34] Application of support vector machine to synthetic earthquake prediction
    Chun Jiang 1
    Earthquake Science, 2009, 22 (03) : 315 - 320
  • [35] Application of support vector machine to synthetic earthquake prediction
    Jiang, Chun
    Wei, Xueli
    Cui, Xiaofeng
    You, Dexiang
    EARTHQUAKE SCIENCE, 2009, 22 (03) : 315 - 320
  • [36] Online prediction model based on support vector machine
    Wang, Wenjian
    Men, Changqian
    Lu, Weizhen
    NEUROCOMPUTING, 2008, 71 (4-6) : 550 - 558
  • [37] Prediction of the β-Hairpins in Proteins Using Support Vector Machine
    Xiu Zhen Hu
    Qian Zhong Li
    The Protein Journal, 2008, 27 : 115 - 122
  • [38] Prediction of nucleosome positioning using a support vector machine
    Bishop, Eric
    Tullius, Thomas D.
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2007, 24 (06): : 624 - 624
  • [39] Application of a support vector machine for prediction of slope stability
    Xue XinHua
    Yang XingGuo
    Chen Xin
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2014, 57 (12) : 2379 - 2386
  • [40] Prediction Based on Wavelet Transform and Support Vector Machine
    Liu, Xiaohong
    Zhu, Yanwei
    Zhang, Yongli
    Wang, Xinchun
    INFORMATION COMPUTING AND APPLICATIONS, PT I, 2011, 243 : 618 - +