FUZZY kNNMODEL APPLIED TO PREDICTIVE TOXICOLOGY DATA MINING

被引:4
|
作者
Guo, Gongde [1 ]
Neagu, Daniel [1 ]
机构
[1] Univ Bradford, Dept Comp, Bradford BD7 1DP, W Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
Fuzzy kNNModel; classification; predictive toxicology;
D O I
10.1142/S1469026805001635
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A robust method, fuzzy kNNModel, for toxicity prediction of chemical compounds is proposed. The method is based on a supervised clustering method, called kNNModel, which employs fuzzy partitioning instead of crisp partitioning to group clusters. The merits of fuzzy kNNModel are two-fold: (1) it overcomes the problems of choosing the parameter e-allowed error rate in a cluster and the parameter N - minimal number of instances covered by a cluster, for each data set; (2) it better captures the characteristics of boundary data by assigning them with different degrees of membership between 0 and 1 to different clusters. The experimental results of fuzzy kNNModel conducted on thirteen public data sets from UCI machine learning repository and seven toxicity data sets from real-world applications, are compared with the results of fuzzy c-means clustering, k-means clustering, kNN, fuzzy kNN, and kNNModel in terms of classification performance. This application shows that fuzzy kNNModel is a promising method for the toxicity prediction of chemical compounds.
引用
收藏
页码:321 / 333
页数:13
相关论文
共 50 条
  • [1] A comparative study of machine learning algorithms applied to predictive toxicology data mining
    Neagu, Daniel C.
    Guo, Gongde
    Trundle, Paul R.
    Cronin, Mark T. D.
    ATLA-ALTERNATIVES TO LABORATORY ANIMALS, 2007, 35 (01): : 25 - 32
  • [3] The research on fuzzy data mining applied on browser records
    Chen, QZ
    Han, JH
    Lai, YG
    He, WX
    Mao, KJ
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 527 - 535
  • [4] Predictive toxicology of chemicals and database mining
    WANG Jiansuo
    Chinese Science Bulletin, 2000, (12) : 1093 - 1097
  • [5] Predictive toxicology of chemicals and database mining
    Wang, JS
    Lai, LH
    Tang, YQ
    CHINESE SCIENCE BULLETIN, 2000, 45 (12): : 1093 - 1097
  • [6] Data mining techniques applied to predictive modeling of the knurling process
    Feng, CXJ
    Wang, XFD
    IIE TRANSACTIONS, 2004, 36 (03) : 253 - 263
  • [7] Knowledge discovery and data mining in toxicology
    Helma, C
    Gottmann, E
    Kramer, S
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2000, 9 (04) : 329 - 358
  • [8] A Minimal Coverage-based Classification Method and Its Application in Predictive Toxicology Data Mining
    Guo, Gongde
    Huang, Yu
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1242 - +
  • [9] Data governance in predictive toxicology: A review
    Xin Fu
    Anna Wojak
    Daniel Neagu
    Mick Ridley
    Kim Travis
    Journal of Cheminformatics, 3
  • [10] Data governance in predictive toxicology: A review
    Fu, Xin
    Wojak, Anna
    Neagu, Daniel
    Ridley, Mick
    Travis, Kim
    JOURNAL OF CHEMINFORMATICS, 2011, 3