Ensemble clustering using extended fuzzy k-means for cancer data analysis

被引:24
|
作者
Khan, Imran [1 ]
Luo, Zongwei [3 ]
Shaikh, Abdul Khalique [2 ]
Hedjam, Rachid [1 ]
机构
[1] Sultan Qaboos Univ, Coll Sci, Dept Comp Sci, POB 31, Muscat 123, Oman
[2] Sultan Qaboos Univ, Dept Informat Syst, POB 31, Muscat 123, Oman
[3] Beijing Normal Univ Zhuhai, BNU UIC Inst Artificial Intelligence & Future Net, BNU HKBU United Int Coll Tangjiawan, Rd JinTong 2000, Zhuhai, Guangdong, Peoples R China
关键词
Fuzzy k-means; Cluster analysis; Cancer data; Variable weights; GENE-EXPRESSION DATA; MICROARRAY DATA; CLASS DISCOVERY; VALIDATION; PREDICTION; PROFILE;
D O I
10.1016/j.eswa.2021.114622
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering analysis is a significant research topic in discovering cancer using different profiles of gene expression, which is very important to successfully diagnose and treat the cancer decease. Many ensemble clustering methods have been developed to perform clustering using tumor data. Only few of them incorporates a significant number of input clusterings, the optimal number of clusters in each input clustering, and an appropriate ensemble method to combine input clusterings into a final clustering. In this paper, we introduce two new steps in the standard fuzzy k-means algorithm to determine the optimal number of input clusterings, and the optimal number of clusters in each clustering for ensemble clustering. The first one is to incorporate a penalty term for making the algorithm insensitive to the initialization of cluster centroids. The second one is to automate a clustering process for iteratively updating the feature weights. This step addresses the noise values in the dataset. We propose an ensemble clustering method, which combines a set of input clusterings into a final clustering having better overall quality. Experiments on real cancer gene expression profiles illustrate that the proposed algorithm outperformed the well-known clustering algorithms.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Clustering of Image Data Using K-Means and Fuzzy K-Means
    Rahmani, Md. Khalid Imam
    Pal, Naina
    Arora, Kamiya
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (07) : 160 - 163
  • [2] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    [J]. 2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [3] Clustering of Lung Cancer Data Using Foggy K-Means
    Yadav, Akhilesh Kumar
    Tomar, Divya
    Agarwal, Sonali
    [J]. 2013 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2013, : 13 - 18
  • [4] NMR metabolic analysis of samples using fuzzy K-means clustering
    Cuperlovic-Culf, Miroslava
    Belacel, Nabil
    Cuif, Adrian S.
    Chute, Ian C.
    Ouellette, Rodney J.
    Burton, Ian W.
    Karakach, Tobias K.
    Walter, John A.
    [J]. MAGNETIC RESONANCE IN CHEMISTRY, 2009, 47 : S96 - S104
  • [5] Analysis and Visualization of Twitter Data using k-means Clustering
    Garg, Neha
    Rani, Rinkle
    [J]. 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 670 - 675
  • [6] Reducing data dimensionality using random projections and fuzzy k-means clustering
    Kumar, Ch. Aswani
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2011, 4 (03) : 353 - 365
  • [7] Robust deep fuzzy K-means clustering for image data
    Wu, Xiaoling
    Yu, Yu-Feng
    Chen, Long
    Ding, Weiping
    Wang, Yingxu
    [J]. PATTERN RECOGNITION, 2024, 153
  • [8] Ensemble-Initialized k-Means Clustering
    Xu, Shasha
    Huang, Dong
    [J]. ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 59 - 63
  • [9] Data Analysis of Educational Evaluation Using K-Means Clustering Method
    Liu, Rui
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [10] Crime Analysis using k-means Clustering
    Joshi, Anant
    Sabitha, A. Sai
    Choudhury, Tanupriya
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NETWORKS (CINE), 2017, : 33 - 39