From cluster ensemble to structure ensemble

被引:31
|
作者
Yu, Zhiwen [1 ,2 ]
You, Jane [2 ]
Wong, Hau-San [3 ]
Han, Guoqiang [1 ]
机构
[1] S China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Guangdong, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
[3] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Cluster ensemble; Structure ensemble; CLASSIFIER ENSEMBLES; MICROARRAY DATA; RELIABILITY; STABILITY; CONSENSUS; CANCER;
D O I
10.1016/j.ins.2012.02.019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the problem of integrating multiple structures which are extracted from different sets of data points into a single unified structure. We first propose a new generalized concept called structure ensemble for the fusion of multiple structures. Unlike traditional cluster ensemble approaches the main objective of which is to align individual labels obtained from different clustering solutions, the structure ensemble approach focuses on how to unify the structures obtained from different data sources. Based on this framework, a new structure ensemble approach called the probabilistic bagging based structure ensemble approach (BSEA) is designed, which integrates the bagging technique, the force based self-organizing map (FBSOM) and the normalized cut algorithm into the proposed framework. BSEA views structures obtained from different datasets generated by the bagging technique as nodes in a graph, and adopts graph theory to find the most representative structure. In addition, the force based self-organizing map (FBSOM), which is a generalized form of SOM, is proposed to serve as the basic clustering algorithm in the structure ensemble framework. Finally, a new external index called correlation index (CI), which considers the correlation relationship of both the similarity and dissimilarity between the predicted solution and the true solution, is proposed to evaluate the performance of BSEA. The experiments show that (i) The performance of BSEA outperforms most of the state-of-the-art clustering approaches, and (ii) BSEA performs well on datasets from the UCI repository and real cancer gene expression profiles. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:81 / 99
页数:19
相关论文
共 50 条
  • [31] Latent variable model for cluster ensemble
    Wang, Hong-Jun
    Li, Zhi-Shu
    Cheng, Yang
    Zhou, Peng
    Zhou, Wei
    Ruan Jian Xue Bao/Journal of Software, 2009, 20 (04): : 825 - 833
  • [32] Experimental comparison of cluster ensemble methods
    Kuncheva, L. I.
    Hadjitodorov, S. T.
    Todorova, L. P.
    2006 9TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOLS 1-4, 2006, : 384 - 390
  • [33] CLUSTER ENSEMBLE SELECTION Using Average Cluster Consistency
    Duarte, F. Jorge F.
    Duarte, Joao M. M.
    Rodrigues, M. Fatima C.
    Fred, Ana L. N.
    KDIR 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2009, : 85 - +
  • [34] Ensemble learning from ensemble docking: revisiting the optimum ensemble size problem
    Sara Mohammadi
    Zahra Narimani
    Mitra Ashouri
    Rohoullah Firouzi
    Mohammad Hossein Karimi‐Jafari
    Scientific Reports, 12
  • [35] Ensemble learning from ensemble docking: revisiting the optimum ensemble size problem
    Mohammadi, Sara
    Narimani, Zahra
    Ashouri, Mitra
    Firouzi, Rohoullah
    Karimi-Jafari, Mohammad Hossein
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [36] Downscaling of ECMWF ensemble forecasts for cases of severe weather: Ensemble statistics and cluster analysis
    Brankovic, Cedo
    Matjacic, Blazenka
    Ivatek-Sahdan, Stjepan
    Buizza, Roberto
    MONTHLY WEATHER REVIEW, 2008, 136 (09) : 3323 - 3342
  • [37] Cluster-Oriented Ensemble Classifier: Impact of Multicluster Characterization on Ensemble Classifier Learning
    Verma, Brijesh
    Rahman, Ashfaqur
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (04) : 605 - 618
  • [38] An ensemble approach for generating partitional clusters from multiple cluster hierarchies
    Hossain, Mahmood
    Bridges, Susan M.
    Wang, Yong
    Hodges, Julia E.
    2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 666 - +
  • [39] INFERENCE OF CLUSTER PHASE FROM CONSIDERATIONS OF HOMOGENEOUS NUCLEATION IN AN EVAPORATIVE ENSEMBLE
    BARTELL, LS
    JOURNAL OF PHYSICAL CHEMISTRY, 1992, 96 (01): : 108 - 111
  • [40] Knowledge Based Cluster Ensemble for Cancer Discovery From Biomolecular Data
    Yu, Zhiwen
    Wongb, Hau-San
    You, Jane
    Yang, Qinmin
    Liao, Hongying
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2011, 10 (02) : 76 - 85