Analysis of diversity measures in clustering ensembles

被引:0
|
作者
Luo, Hui-Lan [1 ,2 ]
Kong, Fan-Sheng [1 ]
Li, Yi-Xiao [1 ]
机构
[1] Institute of Artificial Intelligence, Zhejiang University, Hangzhou 310027, China
[2] School of Information Engineering, Jiangxi University of Science and Technology, Ganzhou 341000, China
来源
Jisuanji Xuebao/Chinese Journal of Computers | 2007年 / 30卷 / 08期
关键词
Clustering algorithms - Learning systems;
D O I
暂无
中图分类号
学科分类号
摘要
The diversity of an ensemble is known to be an important factor in determining its performance. There are a number of ways to quantify diversity in ensembles of classifiers, while little research has been done in clustering ensembles. This paper compares seven diversity measures of clustering ensembles with regard to their possible use in ensemble design. Five experiments have been designed to examine the relationships between the accuracy of the clustering ensembles and the measures of diversity under conditions of difference ensemble methods, different ensemble size and different data distributions respectively. Experiments show the relationships between these diversity measures and ensemble performances are not monotonous. However, when constructing ensembles with moderate ensemble size by suitable clustering algorithms for a given data set with uniform cluster distribution, the correlation coefficients between the diversity measures and ensemble performances are relatively high. Finally, the authors give some useful suggestions about the usefulness of diversity measures in building clustering ensembles.
引用
收藏
页码:1315 / 1324
相关论文
共 50 条
  • [31] A survey: Clustering ensembles techniques
    Ghaemi, Reza
    Sulaiman, Nasir
    Ibrahim, Hamidah
    Mustapha, Norwati
    World Academy of Science, Engineering and Technology, 2009, 38 : 644 - 653
  • [32] Clustering Ensembles with Active Constraints
    Al-Razgan, Muna
    Domeniconi, Carlotta
    APPLICATIONS OF SUPERVISED AND UNSUPERVISED ENSEMBLE METHODS, 2009, 245 : 175 - 189
  • [33] A mixture model for clustering ensembles
    Topchy, A
    Jain, AK
    Punch, W
    PROCEEDINGS OF THE FOURTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2004, : 379 - 390
  • [34] Framework for Active Clustering With Ensembles
    Barr, Jeremiah R.
    Bowyer, Kevin W.
    Flynn, Patrick J.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2014, 9 (11) : 1986 - 2001
  • [35] Clustering ensembles of social networks
    Sweet, Tracy M.
    Flynt, Abby
    Choi, David
    NETWORK SCIENCE, 2019, 7 (02) : 141 - 159
  • [36] The Performance Factors of Clustering Ensembles
    Amasyali, M. Fatih
    Ersoy, Okan
    2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 201 - 204
  • [37] Nonparametric Bayesian Clustering Ensembles
    Wang, Pu
    Domeniconi, Carlotta
    Laskey, Kathryn Blackmond
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2010, 6323 : 435 - 450
  • [38] The Diversity/Accuracy Dilemma: An Empirical Analysis in the Context of Heterogeneous Ensembles
    de Oliveira, Diogo F.
    Canuto, Anne M. P.
    de Souto, Marcilio C. P.
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 939 - 946
  • [39] Similarity Measures for Protein Ensembles
    Lindorff-Larsen, Kresten
    Ferkinghoff-Borg, Jesper
    PLOS ONE, 2009, 4 (01):
  • [40] Comparative Analysis of Similarity Measures in Document Clustering
    Karun, Kavitha A.
    Philip, Mintu
    Lubna, K.
    2013 INTERNATIONAL CONFERENCE ON GREEN COMPUTING, COMMUNICATION AND CONSERVATION OF ENERGY (ICGCE), 2013, : 857 - 860