A Clustering Ensemble Method for Clustering Mixed Data

被引:0
|
作者
Al-Shaqsi, Jamil [1 ]
Wang, Wenjia [1 ]
机构
[1] Univ E Anglia, Sch Comp Sci, Norwich NR4 7TJ, Norfolk, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a clustering ensemble method based on our novel three-staged clustering algorithm. A clustering ensemble is a paradigm that seeks to best combine the outputs of several clustering algorithms with a decision fusion function to achieve a more accurate and stable final output. Our ensemble is constructed with our proposed clustering algorithm as a core modelling method that is used to generate a series of clustering results with different conditions for a given dataset. Then, a decision aggregation mechanism such as voting is employed to find a combined partition of the different clusters. The voting mechanism considered only experimental results that produce intra-similarity value higher than the average intra-similarity value for a particular interval. The aim of this procedure is to find a clustering result that minimizes the number of disagreements between different clustering results. Our ensemble method has been tested on 11 benchmark datasets and compared with some individual methods including TwoStep, k-means, squeezer, k-prototype and some ensemble based methods including k-ANMI, ccdByEnsemble, SIPR, and SICM. The experimental results showed its strengths over the compared clustering algorithms.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Incremental clustering algorithm of mixed numerical and categorical data based on clustering ensemble
    Li, Tao-Ying
    Chen, Yan
    Zhang, Jin-Song
    Qin, Sheng-Jun
    [J]. Kongzhi yu Juece/Control and Decision, 2012, 27 (04): : 603 - 608
  • [2] Approximate Clustering Ensemble Method for Big Data
    Mahmud, Mohammad Sultan
    Huang, Joshua Zhexue
    Ruby, Rukhsana
    Ngueilbaye, Alladoumbaye
    Wu, Kaishun
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (04) : 1142 - 1155
  • [3] A semiparametric method for clustering mixed data
    Foss, Alex
    Markatou, Marianthi
    Ray, Bonnie
    Heching, Aliza
    [J]. MACHINE LEARNING, 2016, 105 (03) : 419 - 458
  • [4] A semiparametric method for clustering mixed data
    Alex Foss
    Marianthi Markatou
    Bonnie Ray
    Aliza Heching
    [J]. Machine Learning, 2016, 105 : 419 - 458
  • [5] Clustering ensemble method
    Alqurashi, Tahani
    Wang, Wenjia
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (06) : 1227 - 1246
  • [6] Clustering ensemble method
    Tahani Alqurashi
    Wenjia Wang
    [J]. International Journal of Machine Learning and Cybernetics, 2019, 10 : 1227 - 1246
  • [7] Clustering analysis of microarray gene expression data with new clustering ensemble method
    Luo, Fei
    Liu, Juan
    [J]. PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 500 - 504
  • [8] Incremental Clustering for Categorical Data Using Clustering Ensemble
    Li Taoying
    Chne Yan
    Qu Lili
    Mu Xiangwei
    [J]. PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 2519 - 2524
  • [9] Clustering mixed data
    Hunt, Lynette
    Jorgensen, Murray
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (04) : 352 - 361
  • [10] Clustering cancer gene expression data by projective clustering ensemble
    Yu, Xianxue
    Yu, Guoxian
    Wang, Jun
    [J]. PLOS ONE, 2017, 12 (02):