Centroid Based Celestial Clustering Algorithm: A Novel Unsupervised Learning Method for Haemogram Data Clustering

被引:2
|
作者
Kumar, Shibu K. B. [1 ]
Samuel, Philip [2 ]
机构
[1] Coll Engn Trivandrum, Dept Comp Sci & Engn, Trivandrum 695016, Kerala, India
[2] Cochin Univ Sci & Technol, Dept Comp Sci, Cochin 682022, Kerala, India
关键词
Diseases; Clustering algorithms; Prediction algorithms; Blood; Optimization; Force; Unsupervised learning; Centroid based celestial clustering; clustering algorithms; disease prediction; haemogram data clustering; nature inspired methods; K-MEANS; GENERAL FRAMEWORK; OPTIMIZATION;
D O I
10.1109/TETCI.2022.3211004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accuracy of clustering is the most important parameter as far as automated disease identification is concerned. There have always been attempts to automate the process of disease prediction from haemogram data. However, there are several components in blood test results and very often we find that a variety of combinations of these component results are to be used to detect a disease. This makes identification of diseases really hard and necessitates the use of data analysis techniques. As new diseases are arising from time to time, a useful method for prediction is unsupervised learning and the corresponding data analysis technique is clustering. An easy, efficient and centroid based clustering algorithm that has been in practice widely is k-means. Its simplicity and efficiency make it a natural choice for most of the clustering applications. However, k-means is largely dependent on the selection of initial cluster centers and a bad choice can make it fall to local optima, thereby sacrificing accuracy. Besides, it is non-deterministic in nature. This paper proposes a novel, nature inspired, clustering method, named Centroid Based Celestial Clustering, which overcomes the above issues. Our method is deterministic and converges to global optima on spherical datasets. We experimentally evaluate our algorithm for speed of execution and cluster quality against well-known clustering algorithms using statistical evaluation metrics like silhouette width, adjusted rand index and Dunn index. We use the method to predict diseases identifiable from blood tests and our experiments show that the accuracy of prediction is very promising.
引用
收藏
页码:942 / 956
页数:15
相关论文
共 50 条
  • [41] Robust Federated Learning Based on Metrics Learning and Unsupervised Clustering for Malicious Data Detection
    Li, Jiaming
    Zhang, Xinyue
    Zhao, Liang
    ACMSE 2022: PROCEEDINGS OF THE 2022 ACM SOUTHEAST CONFERENCE, 2022, : 238 - 242
  • [42] A novel data clustering algorithm based on modified gravitational search algorithm
    Han, XiaoHong
    Quan, Long
    Xiong, XiaoYan
    Almeter, Matt
    Xiang, Jie
    Lan, Yuan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 61 : 1 - 7
  • [43] Heuristic Clustering Based on Centroid Learning and Cognitive Feature Capturing
    Li, Chunzhong
    Zhang, Yunong
    Chen, Xu
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [44] Image Classification Algorithm Based on Proposal Region Clustering Learning-Unsupervised Deep Learning
    Li, Lei
    Yin, Xiao-li
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2023, 18 (02) : 1337 - 1349
  • [45] Image Classification Algorithm Based on Proposal Region Clustering Learning-Unsupervised Deep Learning
    Lei Li
    Xiao-li Yin
    Journal of Electrical Engineering & Technology, 2023, 18 : 1337 - 1349
  • [46] A Deep Unsupervised Learning Algorithm for Clustering of Wind Frequency Maps
    Pantula, Priyanka D.
    Miriyala, Srinivas S.
    Mitra, Kishalay
    2022 EIGHTH INDIAN CONTROL CONFERENCE, ICC, 2022, : 361 - 366
  • [47] A Self-learning Clustering Algorithm Based on Clustering Coefficient
    Zhong, MingJie
    Ding, ZhiJun
    Sun, HaiChun
    Wang, PengWei
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2014, PT I, 2014, 8786 : 79 - 94
  • [48] A Self-Learning Clustering Algorithm Based on Clustering Coefficient
    Zhong, Mingjie
    Ding, Zhijun
    Sun, Haichun
    Wang, Pengwei
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8786 : 79 - 94
  • [49] Centroid Location Technology Based on Fuzzy Clustering and Data Consistency
    Xue, Shanliang
    Li, Mengying
    Yang, Peiru
    CLOUD COMPUTING AND SECURITY, PT V, 2018, 11067 : 138 - 147
  • [50] A Novel Data Association Algorithm based on Intuitionistic Fuzzy Clustering
    Li Liang-qun
    Xie Wei-xin
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 2121 - 2124