Improving Clustering Method Performance Using K-Means, Mini Batch K-Means, BIRCH and Spectral

被引:3
|
作者
Wahyuningrum, Tenia [1 ]
Khomsah, Siti [2 ]
Suyanto, Suyanto [3 ]
Meliana, Selly [3 ]
Yunanto, Prasti Eko [3 ]
Al Maki, Wikky F. [3 ]
机构
[1] Inst Teknol Telkom Purwokerto, Dept Informat, Banyumas, Indonesia
[2] Inst Teknol Telkom Purwokerto, Dept Data Sci, Banyumas, Indonesia
[3] Telkom Univ, Sch Comp, Bandung, Indonesia
关键词
clustering; KNN; K-Means; BIRCH; Spectral;
D O I
10.1109/ISRITI54043.2021.9702823
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The most pressing problem of the k-Nearest Neighbor (KNN) classification method is voting technology, which will lead to poor accuracy of some randomly distributed complex data sets. To overcome the weakness of KNN, we added a step before the KNN classification phase. We developed a new schema for grouping data sets, making the number of clusters greater than the number of data classes. In addition, the committee selects each cluster so that it does not use voting techniques such as standard KNN methods. This study uses two sequential methods, namely the clustering method and the KNN method. Clustering methods can be used to group records into multiple clusters to select commissions from these clusters. Five clustering methods were tested: K-Means, K-Means with Principal Component Analysis (PCA), Mini Batch K-Means, Spectral and Balanced Iterative Reduction and Clustering using Hierarchies (BIRCH). All tested clustering methods are based on the cluster type of the center of gravity. According to the result, the BIRCH method has the lowest error rate among the five clustering methods (2.13), and K-Means has the largest clusters (156.63).
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Spectral Comparison Using k-Means Clustering
    Ramachandran, Vignesh R.
    Mitchell, Herbert J.
    Jacobs, Samantha K.
    Tzeng, Nigel H.
    Firpi, Alexer H.
    Rodriguez, Benjamin M.
    [J]. 2014 IEEE AEROSPACE CONFERENCE, 2014,
  • [2] Anomaly Detection by Using Streaming K-Means and Batch K-Means
    Wang, Zhuo
    Zhou, Yanghui
    Li, Gangmin
    [J]. 2020 5TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (IEEE ICBDA 2020), 2020, : 11 - 17
  • [3] Clustering of Image Data Using K-Means and Fuzzy K-Means
    Rahmani, Md. Khalid Imam
    Pal, Naina
    Arora, Kamiya
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (07) : 160 - 163
  • [4] Comparative Study of K-means and Mini Batch K-means Clustering Algorithms in Android Malware Detection Using Network Traffic Analysis
    Feizollah, Ali
    Anuar, Nor Badrul
    Salleh, Rosli
    Amalina, Fairuz
    [J]. 2014 INTERNATIONAL SYMPOSIUM ON BIOMETRICS AND SECURITY TECHNOLOGIES (ISBAST), 2014, : 193 - 197
  • [5] Spectral relaxation for K-means clustering
    Zha, HY
    He, XF
    Ding, C
    Simon, H
    Gu, M
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1057 - 1064
  • [6] Nested Mini-Batch K-Means
    Newling, James
    Fleuret, Francois
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [7] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    [J]. 2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [8] K-Means Cloning: Adaptive Spherical K-Means Clustering
    Hedar, Abdel-Rahman
    Ibrahim, Abdel-Monem M.
    Abdel-Hakim, Alaa E.
    Sewisy, Adel A.
    [J]. ALGORITHMS, 2018, 11 (10):
  • [9] Performance Evaluation of a Novel Hybrid Clustering Algorithm using Birch and K-Means
    Kaur, Jaskaranjit
    Singh, Harpreet
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [10] Improving the Walktrap Algorithm Using K-Means Clustering
    Brusco, Michael
    Steinley, Douglas
    Watts, Ashley L.
    [J]. MULTIVARIATE BEHAVIORAL RESEARCH, 2024, 59 (02) : 266 - 288