Centroid Based Celestial Clustering Algorithm: A Novel Unsupervised Learning Method for Haemogram Data Clustering

被引:2
|
作者
Kumar, Shibu K. B. [1 ]
Samuel, Philip [2 ]
机构
[1] Coll Engn Trivandrum, Dept Comp Sci & Engn, Trivandrum 695016, Kerala, India
[2] Cochin Univ Sci & Technol, Dept Comp Sci, Cochin 682022, Kerala, India
关键词
Diseases; Clustering algorithms; Prediction algorithms; Blood; Optimization; Force; Unsupervised learning; Centroid based celestial clustering; clustering algorithms; disease prediction; haemogram data clustering; nature inspired methods; K-MEANS; GENERAL FRAMEWORK; OPTIMIZATION;
D O I
10.1109/TETCI.2022.3211004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accuracy of clustering is the most important parameter as far as automated disease identification is concerned. There have always been attempts to automate the process of disease prediction from haemogram data. However, there are several components in blood test results and very often we find that a variety of combinations of these component results are to be used to detect a disease. This makes identification of diseases really hard and necessitates the use of data analysis techniques. As new diseases are arising from time to time, a useful method for prediction is unsupervised learning and the corresponding data analysis technique is clustering. An easy, efficient and centroid based clustering algorithm that has been in practice widely is k-means. Its simplicity and efficiency make it a natural choice for most of the clustering applications. However, k-means is largely dependent on the selection of initial cluster centers and a bad choice can make it fall to local optima, thereby sacrificing accuracy. Besides, it is non-deterministic in nature. This paper proposes a novel, nature inspired, clustering method, named Centroid Based Celestial Clustering, which overcomes the above issues. Our method is deterministic and converges to global optima on spherical datasets. We experimentally evaluate our algorithm for speed of execution and cluster quality against well-known clustering algorithms using statistical evaluation metrics like silhouette width, adjusted rand index and Dunn index. We use the method to predict diseases identifiable from blood tests and our experiments show that the accuracy of prediction is very promising.
引用
收藏
页码:942 / 956
页数:15
相关论文
共 50 条
  • [11] Quantum spectral clustering algorithm for unsupervised learning
    Li, Qingyu
    Huang, Yuhan
    Jin, Shan
    Hou, Xiaokai
    Wang, Xiaoting
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (10)
  • [12] Quantum spectral clustering algorithm for unsupervised learning
    Qingyu Li
    Yuhan Huang
    Shan Jin
    Xiaokai Hou
    Xiaoting Wang
    Science China Information Sciences, 2022, 65
  • [13] An unsupervised clustering algorithm for data on the unit hypersphere
    Yang, Min-Shen
    Chang-Chien, Shou-Jen
    Hung, Wen-Liang
    APPLIED SOFT COMPUTING, 2016, 42 : 290 - 313
  • [14] Method for increasing the computation speed of an unsupervised learning approach for data clustering
    Yuwono, Mitchell
    Su, Steven W.
    Moulton, Bruce
    Hung Nguyen
    2012 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2012,
  • [15] Combining supervised and unsupervised learning for data clustering
    Paolo Corsini
    Beatrice Lazzerini
    Francesco Marcelloni
    Neural Computing & Applications, 2006, 15 : 289 - 297
  • [16] Combining supervised and unsupervised learning for data clustering
    Corsini, Paolo
    Lazzerini, Beatrice
    Marcelloni, Francesco
    NEURAL COMPUTING & APPLICATIONS, 2006, 15 (3-4): : 289 - 297
  • [17] A novel algorithm for data clustering
    Wong, CC
    Chen, CC
    Su, MC
    PATTERN RECOGNITION, 2001, 34 (02) : 425 - 442
  • [18] On unsupervised simultaneous kernel learning and data clustering
    Malhotra, Akshay
    Schizas, Ioannis D.
    PATTERN RECOGNITION, 2020, 108
  • [19] New Method for Power System Dynamic Stability Analysis Based on a Novel Unsupervised Clustering Algorithm
    Guan Lin
    Wang Tongwen
    Zhang Yao
    2009 ASIA-PACIFIC POWER AND ENERGY ENGINEERING CONFERENCE (APPEEC), VOLS 1-7, 2009, : 1726 - 1729
  • [20] A METHOD OF DATA CLUSTERING BASED ON DYNAMIC LEARNING
    Liu, Xiaohong
    DECISION MAKING AND SOFT COMPUTING, 2014, 9 : 282 - 287