Centroid Based Celestial Clustering Algorithm: A Novel Unsupervised Learning Method for Haemogram Data Clustering

被引:2
|
作者
Kumar, Shibu K. B. [1 ]
Samuel, Philip [2 ]
机构
[1] Coll Engn Trivandrum, Dept Comp Sci & Engn, Trivandrum 695016, Kerala, India
[2] Cochin Univ Sci & Technol, Dept Comp Sci, Cochin 682022, Kerala, India
关键词
Diseases; Clustering algorithms; Prediction algorithms; Blood; Optimization; Force; Unsupervised learning; Centroid based celestial clustering; clustering algorithms; disease prediction; haemogram data clustering; nature inspired methods; K-MEANS; GENERAL FRAMEWORK; OPTIMIZATION;
D O I
10.1109/TETCI.2022.3211004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accuracy of clustering is the most important parameter as far as automated disease identification is concerned. There have always been attempts to automate the process of disease prediction from haemogram data. However, there are several components in blood test results and very often we find that a variety of combinations of these component results are to be used to detect a disease. This makes identification of diseases really hard and necessitates the use of data analysis techniques. As new diseases are arising from time to time, a useful method for prediction is unsupervised learning and the corresponding data analysis technique is clustering. An easy, efficient and centroid based clustering algorithm that has been in practice widely is k-means. Its simplicity and efficiency make it a natural choice for most of the clustering applications. However, k-means is largely dependent on the selection of initial cluster centers and a bad choice can make it fall to local optima, thereby sacrificing accuracy. Besides, it is non-deterministic in nature. This paper proposes a novel, nature inspired, clustering method, named Centroid Based Celestial Clustering, which overcomes the above issues. Our method is deterministic and converges to global optima on spherical datasets. We experimentally evaluate our algorithm for speed of execution and cluster quality against well-known clustering algorithms using statistical evaluation metrics like silhouette width, adjusted rand index and Dunn index. We use the method to predict diseases identifiable from blood tests and our experiments show that the accuracy of prediction is very promising.
引用
收藏
页码:942 / 956
页数:15
相关论文
共 50 条
  • [21] A Fuzzy Threshold Based Unsupervised Clustering Algorithm for Natural Data Exploration
    Thomas, Binu
    Raju, G.
    2010 INTERNATIONAL CONFERENCE ON NETWORKING AND INFORMATION TECHNOLOGY (ICNIT 2010), 2010, : 473 - 477
  • [22] A Novel Method of Data Correlation Analysis of the Big Data Based on Network Clustering Algorithm
    Yang, Yue
    Wang, Chunting
    PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2015, : 360 - 366
  • [23] Unsupervised Data Fusion With Deeper Perspective: A Novel Multisensor Deep Clustering Algorithm
    Shahi, Kasra Rafiezadeh
    Ghamisi, Pedram
    Rasti, Behnood
    Scheunders, Paul
    Gloaguen, Richard
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 284 - 296
  • [24] Unsupervised Meta-Learning for Clustering Algorithm Recommendation
    Pimentel, Bruno Almeida
    de Carvalho, Andre C. P. L. E.
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [25] A constructive unsupervised learning algorithm for clustering binary patterns
    Wang, D
    Chaudhari, NS
    Patra, JC
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 1381 - 1385
  • [26] Uncertain Centroid based Partitional Clustering of Uncertain Data
    Gullo, Francesco
    Tagarelli, Andrea
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (07): : 610 - 621
  • [27] A novel clustering algorithm based on data transformation approaches
    Azimi, Rasool
    Ghayekhloo, Mohadeseh
    Ghofrani, Mahmoud
    Sajedi, Hedieh
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 76 : 59 - 70
  • [28] Unsupervised Evolutionary Clustering Algorithm for Mixed Type Data
    Zheng, Zhi
    Gong, Maoguo
    Ma, Jingjing
    Jiao, Licheng
    Wu, Qiaodi
    2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,
  • [29] Unsupervised clustering algorithm for N-dimensional data
    Montgomery, EB
    Huang, H
    Assadi, A
    JOURNAL OF NEUROSCIENCE METHODS, 2005, 144 (01) : 19 - 24
  • [30] Intelligent Hybrid Algorithm for Unsupervised Data Clustering Problem
    Hamdi, Amira
    Monmarche, Nicolas
    Slimane, Mohamed
    Alimi, Adel M.
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS 2016), 2017, 552 : 442 - 455