Carcinogenicity Prediction of Noncongeneric Chemicals by a Support Vector Machine

被引:25
|
作者
Zhong, Min [1 ]
Nie, Xianglei [1 ]
Yan, Aixia [1 ]
Yuan, Qipeng [1 ]
机构
[1] Beijing Univ Chem Technol, Dept Pharmaceut Engn, State Key Lab Chem Resource Engn, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
ORBITAL ELECTRONEGATIVITY; QSAR;
D O I
10.1021/tx4000182
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The ability to identify carcinogenic compounds is of fundamental importance to the safe application of chemicals. In this study, we generated an array of in silk models allowing the classification of compounds into carcinogenic and noncarcinogenic agents based on a data set of 852 noncongeneric chemicals collected from the Carcinogenic Potency Database (CPDBAS). Twenty-four molecular descriptors were selected by Pearson correlation, F-score, and stepwise regression analysis. These descriptors cover a range of physicochemical properties, including electrophilicity, geometry, molecular weight, size, and solubility. The descriptor mutagenic showed the highest correlation coefficient with carcinogenicity. On the basis of these descriptors, a support vector machine-based (SVM) classification model was developed and fine-tuned by a 10-fold cross-validation approach. Both the SVM model (Model A1) and the best model from the 10-fold cross-validation (Model B3) runs gave good results on the test set with prediction accuracy over 80%, sensitivity over 76%, and specificity over 82%. In addition, extended connectivity fingerprints (ECFPs) and the Toxtree software were used to analyze the functional groups and substructures linked to carcinogenicity. It was found that the results of both methods are in good agreement.
引用
收藏
页码:741 / 749
页数:9
相关论文
共 50 条
  • [21] STRUCTURE-ACTIVITY PREDICTION OF THE CARCINOGENICITY OF CHEMICALS
    HELMES, CT
    SIGMAN, CC
    SULLIVAN, PA
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 183 (MAR): : 2 - CHAS
  • [22] Prediction of antifungal activity by support vector machine approach
    Chen, SW
    Li, ZR
    Li, XY
    JOURNAL OF MOLECULAR STRUCTURE-THEOCHEM, 2005, 731 (1-3): : 73 - 81
  • [23] Prediction of mRNA polyadenylation sites by support vector machine
    Cheng, Yiming
    Miura, Robert M.
    Tian, Bin
    BIOINFORMATICS, 2006, 22 (19) : 2320 - 2325
  • [24] Marketing risk prediction based on the support vector machine
    Zhang, Yunqi
    Li, Jun
    PROCEEDINGS OF THE 2008 INTERNATIONAL CONFERENCE ON E-RISK MANAGEMENT (ICERM 2008), 2008, : 362 - 367
  • [25] SUPPORT VECTOR MACHINE FOR THE PREDICTION OF HEATING ENERGY USE
    Sretenovic, Aleksandra A.
    Jovanovic, Radisa Z.
    Novakovic, Vojislav M.
    Nord, Natasa M.
    Zivkovic, Branislav D.
    THERMAL SCIENCE, 2018, 22 : S1171 - S1181
  • [26] Prediction of heating parameters based on support vector machine
    Meiping, Wang
    Qi, Tian
    International Journal of Wireless and Mobile Computing, 2015, 8 (03) : 294 - 300
  • [27] Rockburst prediction using evolutionary support vector machine
    Zhao, HB
    PROGRESS IN SAFETY SCIENCE AND TECHNOLOGY, VOL V, PTS A AND B, 2005, 5 : 494 - 498
  • [28] Prediction of the β-hairpins in proteins using support vector machine
    Hu, Xiu Zhen
    Li, Qian Zhong
    PROTEIN JOURNAL, 2008, 27 (02): : 115 - 122
  • [29] Prediction of disulfide connectivity in proteins with support vector machine
    Hsuan-Liang Liu
    Shih-Chieh Chen
    JOURNAL OF THE CHINESE INSTITUTE OF CHEMICAL ENGINEERS, 2007, 38 (01): : 63 - 70
  • [30] Model of customer churn prediction on support vector machine
    Guangxi University of Finance and Economics, Nanning 530003, China
    不详
    Xitong Gongcheng Lilum yu Shijian, 2008, 1 (71-77):