Comparison of AIS and fuzzy c-means clustering methods on the classification of breast cancer and diabetes datasets

被引:7
|
作者
Ozsen, Seral [1 ]
Ceylan, Rahime [1 ]
机构
[1] Selcuk Univ, Dept Elect & Elect Engn, Konya, Turkey
关键词
Artificial immune systems; artificial neural networks; fuzzy c-means clustering; breast cancer dataset; diabetes dataset;
D O I
10.3906/elk-1210-62
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data reduction is an indispensable part of pattern classification processes in many cases. If the number of samples is excessive, sample reduction or data reduction algorithms can be used for an effective processing time and reliable successive results. Many methods have been used for data reduction. Fuzzy c-means is one of these methods and it is widely used in such applications as clustering algorithms. In this study, we applied a different clustering algorithm, an artificial immune system (AIS), for the data reduction process. We realized the performance evaluation experiments on the standard Chain link and Iris datasets, while the main application was conducted using the Wisconsin Breast Cancer and Pima Indian datasets, which were taken from the University of California, Irvine Machine Learning Repository. For these datasets, the performance of the AIS in the data reduction process was compared with the fuzzy c-means clustering algorithm, in which a multilayer perceptron artificial neural network was used as a classifier after the data reduction processes. The obtained results show that the maximum classification accuracies were obtained as 73.96% for the Pima Indian Diabetes dataset and 97.80% for the Wisconsin Breast Cancer dataset with the AIS and the compression rates were 80% and 40% for these results. For fuzzy c-means clustering, however, the aforementioned accuracies were obtained as 63% and 86.69% for the Pima Indian Diabetes and Wisconsin Breast Cancer datasets, respectively. Moreover, the compression rates for these results for fuzzy c-means were 90% and 70%. When the mean classification accuracy values over the experimented compression rates were taken into consideration, the AIS reached a mean classification accuracy of 70.07% for the Pima Indian Diabetes dataset, while 47.64% was obtained by fuzzy c-means for this dataset. For the Wisconsin Breast Cancer dataset, however, the mean classification accuracies of the AIS and fuzzy c-means methods were recorded as 94.90% and 75.43%, respectively.
引用
收藏
页码:1241 / 1254
页数:14
相关论文
共 50 条
  • [1] Comparison of Four Kinds of Fuzzy C-means Clustering Methods and Their Applications on Posture Classification
    Wang, Chuanxu
    Yan, Chunjuan
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 382 - 385
  • [2] Differentially Private Fuzzy C-Means Clustering Algorithms for Fuzzy Datasets
    Shakiba, Ali
    2018 6TH IRANIAN JOINT CONGRESS ON FUZZY AND INTELLIGENT SYSTEMS (CFIS), 2018, : 91 - 93
  • [3] Fuzzy C-Means Clustering: A Review of Applications in Breast Cancer Detection
    Krasnov, Daniel
    Davis, Dresya
    Malott, Keiran
    Chen, Yiting
    Shi, Xiaoping
    Wong, Augustine
    ENTROPY, 2023, 25 (07)
  • [4] COMPARISON OF CLUSTERING IN TUBERCULOSIS USING FUZZY C-MEANS AND K-MEANS METHODS
    Rochman, Eka Mala Sari
    Miswanto
    Suprajitno, Herry
    COMMUNICATIONS IN MATHEMATICAL BIOLOGY AND NEUROSCIENCE, 2022,
  • [5] Classification via Deep Fuzzy c-Means Clustering
    Yeganejou, Mojtaba
    Dick, Scott
    2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,
  • [6] Comparison of 3-point dixon imaging and fuzzy C-means clustering methods for breast density measurement
    Clendenen, Tess V.
    Zeleniuch-Jacquotte, Anne
    Moy, Linda
    Pike, Malcolm C.
    Rusinek, Henry
    Kim, Sungheon
    JOURNAL OF MAGNETIC RESONANCE IMAGING, 2013, 38 (02) : 474 - 481
  • [7] Fuzzy c-means clustering methods for symbolic interval data
    de Carvalho, Francisco de A. T.
    PATTERN RECOGNITION LETTERS, 2007, 28 (04) : 423 - 437
  • [8] FuzzyCSampling: A Hybrid fuzzy c-means clustering sampling strategy for imbalanced datasets
    Maras, Abdullah
    Selcukcan Erol, Cigdem
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2023, 31 (07) : 1223 - 1236
  • [9] Clustering large amounts of healthcare datasets using fuzzy c-means algorithm
    Reddy, B. Ramakantha
    Kumar, Y. Vijay
    Prabhakar, M.
    2019 5TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2019, : 93 - 97
  • [10] Generalized Fuzzy c-Means Clustering and its Property of Fuzzy Classification Function
    Kanzawa, Yuchi
    Miyamoto, Sadaaki
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2021, 25 (01) : 73 - 82