COMPARISON OF CLUSTERING IN TUBERCULOSIS USING FUZZY C-MEANS AND K-MEANS METHODS

被引:2
|
作者
Rochman, Eka Mala Sari [1 ,2 ]
Miswanto [1 ]
Suprajitno, Herry [1 ]
机构
[1] Airlangga Univ, Fac Sci & Technol, Dept Math, Surabaya, Indonesia
[2] Univ Trunojoyo Madura, Fac Engn, Dept Informat, Bangkalan, Indonesia
关键词
tuberculosis; imputation; cluster; k-means; FCM; elbow; silhouette coefficient; DBI;
D O I
10.28919/cmbn/7335
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Tuberculosis (TB) is a health problem that has yet to be resolved in Indonesia. Based on WHO data, in 2021 Indonesia will still be in the third rank of the highest TB cases in the world. This study aims to determine how many groups of TB patients are based on age, gender, HIV status, history of diabetes mellitus, chest X-ray, and the results of the Molecular Rapid Test (TCM). The data used in this study were 985 from 2017 to 2020. The method used in this research is K-Nearest Neighbor (KNN) in carrying out the imputation process, as well as comparing the k-means and Fuzzy C-Means (FCM) methods in classifying TB data. Before doing the grouping, the data cleaning process is carried out by an imputation process which is useful for filling in the missing data in this case, using the KNN method. To produce maximum results of data grouping or clustering, it is necessary to determine the right number of clusters. For this reason, this study tries to compare the elbow, silhouette coefficient, and Davies Bouldin Index (DBI) methods. The application of the KNN method in the data imputation process in this study is to use k=5. The application of the K-Means algorithm is to form groups of TB patients based on six features. Determination of the optimal number of clusters using the K-means and FCM methods shows the optimal number of clusters, namely K = 2 but with different values. The results of the clustering test using the elbow method with the K-means and FCM methods are 93288.49. The DBI value for the K-means and FCM methods is 0.4937. Meanwhile, the clustering trial with the silhouette coefficient on K-means yields a value of 0.6318 which is better than the FCM which produces a value of 0.6321. This shows that the results of clustering k-means with silhouette coefficients produce better cluster quality because they have a lower silhouette coefficient value than FCM.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] A Fast Fingerprint Image Alignment Algorithms Using K-Means and Fuzzy C-Means Clustering Based Image Rotation Technique
    Jaganathan, P.
    Rajinikannan, M.
    [J]. CONTROL, COMPUTATION AND INFORMATION SYSTEMS, 2011, 140 : 281 - 288
  • [42] Improved Fuzzy C-means and K-means Algorithms for Texture and Boundary Segmentation
    Koc, Yunus
    Olmez, Tamer
    [J]. 2018 6TH INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING & INFORMATION TECHNOLOGY (CEIT), 2018,
  • [43] Performance of the K-means and fuzzy C-means algorithms in big data analytics
    Salman Z.
    Alomary A.
    [J]. International Journal of Information Technology, 2024, 16 (1) : 465 - 470
  • [44] Possibilistic fuzzy c-means clustering model using kernel methods
    Wu, Xiao-Hong
    Zhou, Jian-Jiang
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 2, PROCEEDINGS, 2006, : 465 - +
  • [45] Disaggregating and updating a legacy soil map using DSMART, fuzzy c-means and k-means clustering algorithms in Central Iran
    Zeraatpisheh, Mojtaba
    Ayoubi, Shamsollah
    Brungard, Colby W.
    Finke, Peter
    [J]. GEODERMA, 2019, 340 : 249 - 258
  • [46] Comparison of K-Means and Fuzzy c-Means Algorithm Performance for Automated Determination of the Arterial Input Function
    Yin, Jiandong
    Sun, Hongzan
    Yang, Jiawen
    Guo, Qiyong
    [J]. PLOS ONE, 2014, 9 (02):
  • [47] A performance comparison of using principal component analysis and ant clustering with fuzzy c-means and k-harmonic means
    Julrode, Phichete
    Supratid, Siriporn
    Suksawatchon, Ureerat
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND CYBERNETICS (CYBERNETICSCOM), 2012, : 123 - 128
  • [48] Spectral Comparison Using k-Means Clustering
    Ramachandran, Vignesh R.
    Mitchell, Herbert J.
    Jacobs, Samantha K.
    Tzeng, Nigel H.
    Firpi, Alexer H.
    Rodriguez, Benjamin M.
    [J]. 2014 IEEE AEROSPACE CONFERENCE, 2014,
  • [49] Analysis of Poor Population in DKI Jakarta Regions using Fuzzy C-Means and K-Means Algorithms
    Sidqi, Syafirina Arsyidin
    Akso, Rafian Tri
    Nurdianto, Imam Bayu
    Novendri, Alia
    Tahyudin, Imam
    Sholikhatin, Siti Alvi
    [J]. APICS 2022 - 2022 1st International Conference on Smart Technology, Applied Informatics, and Engineering, Proceedings, 2022, : 27 - 30
  • [50] Modified fuzzy C-means and K-means clustering based spectrum sensing using cooperative spectrum for cognitive radio networks applications
    Parimala, V
    Devarajan, K.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (03) : 3727 - 3740