Distribution-Based Cluster Structure Selection

被引:44
|
作者
Yu, Zhiwen [1 ,2 ]
Zhu, Xianjun [1 ]
Wong, Hau-San [3 ]
You, Jane [4 ]
Zhang, Jun [5 ]
Han, Guoqiang [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
[2] Hong Kong Polytech Univ, Hong Kong, Hong Kong, Peoples R China
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong 852, Hong Kong, Peoples R China
[4] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
[5] Sun Yat Sen Univ, Sch Adv Comp, Guangzhou 510275, Guangdong, Peoples R China
关键词
Cluster ensemble; clustering analysis; expectation-maximization (EM); Gaussian mixture model (GMM); graph cut; hypergraph; ENSEMBLE FRAMEWORK; CONSENSUS; COMBINATION; SEARCH;
D O I
10.1109/TCYB.2016.2569529
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of cluster structure ensemble is to find a unified cluster structure from multiple cluster structures obtained from different datasets. Unfortunately, not all the cluster structures contribute to the unified cluster structure. This paper investigates the problem of how to select the suitable cluster structures in the ensemble which will be summarized to a more representative cluster structure. Specifically, the cluster structure is first represented by a mixture of Gaussian distributions, the parameters of which are estimated using the expectation-maximization algorithm. Then, several distribution-based distance functions are designed to evaluate the similarity between two cluster structures. Based on the similarity comparison results, we propose a new approach, which is referred to as the distribution-based cluster structure ensemble (DCSE) framework, to find the most representative unified cluster structure. We then design a new technique, the distribution-based cluster structure selection strategy (DCSSS), to select a subset of cluster structures. Finally, we propose using a distribution-based normalized hypergraph cut algorithm to generate the final result. In our experiments, a nonparametric test is adopted to evaluate the difference between DCSE and its competitors. We adopt 20 real-world datasets obtained from the University of California, Irvine and knowledge extraction based on evolutionary learning repositories, and a number of cancer gene expression profiles to evaluate the performance of the proposed methods. The experimental results show that: 1) DCSE works well on the real-world datasets and 2) DCSE based on DCSSS can further improve the performance of the algorithm.
引用
收藏
页码:3554 / 3567
页数:14
相关论文
共 50 条
  • [31] Distribution-Based Global Sensitivity Analysis in Hydrology
    Ciriello, Valentina
    Lauriola, Ilaria
    Tartakovsky, Daniel M.
    WATER RESOURCES RESEARCH, 2019, 55 (11) : 8708 - 8720
  • [33] Distribution-based CFAR detection in SAR images
    Gan, RB
    Wang, JG
    IGARSS 2005: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, PROCEEDINGS, 2005, : 1753 - 1756
  • [34] A DISTRIBUTION-BASED APPROACH TO DECISION RISK ANALYSIS
    IKERD, JE
    ANDERSON, KB
    AMERICAN JOURNAL OF AGRICULTURAL ECONOMICS, 1986, 68 (05) : 1387 - 1387
  • [35] Distribution-Based Bisimulation for Labelled Markov Processes
    Yang, Pengfei
    Jansen, David N.
    Zhang, Lijun
    FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS (FORMATS 2017), 2017, 10419 : 170 - 186
  • [36] Distribution-based selective classifiers for incomplete data
    Chen, Jingnian
    Huang, Houkuan
    Yang, Liping
    Tian, Fengzhan
    Beijing Jiaotong Daxue Xuebao/Journal of Beijing Jiaotong University, 2008, 32 (02): : 26 - 29
  • [37] RF Ultrasound Distribution-Based Confidence Maps
    Klein, Tassilo
    Wells, William M., III
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2015, PT II, 2015, 9350 : 595 - 602
  • [38] Uncertain Distribution-Based Similarity Measure of Concepts
    Li, Shuai
    Yang, Jie
    Qi, Zhipeng
    Zeng, Juanli
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020 (2020)
  • [39] Distribution-based CFAR detectors in SAR images
    Gan Rongbing1
    2.School of Electronic Engineering
    JournalofSystemsEngineeringandElectronics, 2006, (04) : 717 - 721
  • [40] Dynamic Data Distribution-based Curriculum Learning
    Chaudhry, Shonal
    Sharma, Anuraganand
    INFORMATION SCIENCES, 2025, 702