COMPARISON OF CLUSTERING IN TUBERCULOSIS USING FUZZY C-MEANS AND K-MEANS METHODS

被引：2

作者：

Rochman, Eka Mala Sari ^{[1
,2
]}

Miswanto ^{[1
]}

Suprajitno, Herry ^{[1
]}

机构：

[1] Airlangga Univ, Fac Sci & Technol, Dept Math, Surabaya, Indonesia

[2] Univ Trunojoyo Madura, Fac Engn, Dept Informat, Bangkalan, Indonesia

来源：

COMMUNICATIONS IN MATHEMATICAL BIOLOGY AND NEUROSCIENCE | 2022年

关键词：

tuberculosis; imputation; cluster; k-means; FCM; elbow; silhouette coefficient; DBI;

D O I：

10.28919/cmbn/7335

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Tuberculosis (TB) is a health problem that has yet to be resolved in Indonesia. Based on WHO data, in 2021 Indonesia will still be in the third rank of the highest TB cases in the world. This study aims to determine how many groups of TB patients are based on age, gender, HIV status, history of diabetes mellitus, chest X-ray, and the results of the Molecular Rapid Test (TCM). The data used in this study were 985 from 2017 to 2020. The method used in this research is K-Nearest Neighbor (KNN) in carrying out the imputation process, as well as comparing the k-means and Fuzzy C-Means (FCM) methods in classifying TB data. Before doing the grouping, the data cleaning process is carried out by an imputation process which is useful for filling in the missing data in this case, using the KNN method. To produce maximum results of data grouping or clustering, it is necessary to determine the right number of clusters. For this reason, this study tries to compare the elbow, silhouette coefficient, and Davies Bouldin Index (DBI) methods. The application of the KNN method in the data imputation process in this study is to use k=5. The application of the K-Means algorithm is to form groups of TB patients based on six features. Determination of the optimal number of clusters using the K-means and FCM methods shows the optimal number of clusters, namely K = 2 but with different values. The results of the clustering test using the elbow method with the K-means and FCM methods are 93288.49. The DBI value for the K-means and FCM methods is 0.4937. Meanwhile, the clustering trial with the silhouette coefficient on K-means yields a value of 0.6318 which is better than the FCM which produces a value of 0.6321. This shows that the results of clustering k-means with silhouette coefficients produce better cluster quality because they have a lower silhouette coefficient value than FCM.

引用

页数：20

共 50 条

[1] k-means and fuzzy c-means fusion for object clustering
Heni, Ashraf
Jdey, Imen
Ltifi, Hela
[J]. 2022 8TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'22), 2022, : 177 - 182
[2] Modeling of Vehicle Trajectory using K-Means and Fuzzy C-Means Clustering
Choong, Mei Yeen
Angeline, Lorita
Chin, Renee Ka Yin
Yeo, Kiam Beng
Teo, Kenneth Tze Kin
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN ENGINEERING AND TECHNOLOGY (IICAIET), 2018, : 1 - 6
[3] Comparison Between K-Means and Fuzzy C-Means Clustering in Network Traffic Activities
Purnawansyah
Haviluddin
Gafar, Achmad Fanany Onnilita
Tahyudin, Imam
[J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, 2018, : 300 - 310
[4] Empirical Evaluation of K-Means, Bisecting K-Means, Fuzzy C-Means and Genetic K-Means Clustering Algorithms
Banerjee, Shreya
Choudhary, Ankit
Pal, Somnath
[J]. 2015 IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE), 2015, : 172 - 176
[5] Clustering Aluminum Smelting Potlines Using Fuzzy C-Means and K-Means Algorithms
de Lima, Flavia A. N.
de Souza, Alan M. F.
Soares, Fabio M.
Cardoso, Diego Lisboa
de Oliveira, Roberto C. L.
[J]. LIGHT METALS 2017, 2017, : 589 - 597
[6] A Comparative Study of K-Means, K-Means plus plus and Fuzzy C-Means Clustering Algorithms
Kapoor, Akanksha
Singhal, Abhishek
[J]. 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2017,
[7] Implementation and Comparison of K-Means and Fuzzy C-Means Algorithms for Agricultural Data
Shedthi, Shabari B.
Shetty, Surendra
Siddappa, M.
[J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2017, : 105 - 108
[8] A fuzzy c-means and k-means clustering analysis on relevant diabetic retinopathy biomarkers
Valeanu, A.
Margina, D.
Gradinaru, D.
Ilie, M.
[J]. TOXICOLOGY LETTERS, 2016, 258 : S117 - S117
[9] Evaluation of Segmentation in Magnetic Resonance Images Using k-Means and Fuzzy c-Means Clustering Algorithms
Finkst, Tomaz
[J]. ELEKTROTEHNISKI VESTNIK-ELECTROCHEMICAL REVIEW, 2012, 79 (03): : 129 - 134
[10] Development of slope mass rating system using K-means and fuzzy c-means clustering algorithms
Zakaria, Jalali
[J]. INTERNATIONAL JOURNAL OF MINING SCIENCE AND TECHNOLOGY, 2016, 26 (06) : 959 - 966

← 1 2 3 4 5 →