Distribution-Based Cluster Structure Selection

被引:44
|
作者
Yu, Zhiwen [1 ,2 ]
Zhu, Xianjun [1 ]
Wong, Hau-San [3 ]
You, Jane [4 ]
Zhang, Jun [5 ]
Han, Guoqiang [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
[2] Hong Kong Polytech Univ, Hong Kong, Hong Kong, Peoples R China
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong 852, Hong Kong, Peoples R China
[4] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
[5] Sun Yat Sen Univ, Sch Adv Comp, Guangzhou 510275, Guangdong, Peoples R China
关键词
Cluster ensemble; clustering analysis; expectation-maximization (EM); Gaussian mixture model (GMM); graph cut; hypergraph; ENSEMBLE FRAMEWORK; CONSENSUS; COMBINATION; SEARCH;
D O I
10.1109/TCYB.2016.2569529
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of cluster structure ensemble is to find a unified cluster structure from multiple cluster structures obtained from different datasets. Unfortunately, not all the cluster structures contribute to the unified cluster structure. This paper investigates the problem of how to select the suitable cluster structures in the ensemble which will be summarized to a more representative cluster structure. Specifically, the cluster structure is first represented by a mixture of Gaussian distributions, the parameters of which are estimated using the expectation-maximization algorithm. Then, several distribution-based distance functions are designed to evaluate the similarity between two cluster structures. Based on the similarity comparison results, we propose a new approach, which is referred to as the distribution-based cluster structure ensemble (DCSE) framework, to find the most representative unified cluster structure. We then design a new technique, the distribution-based cluster structure selection strategy (DCSSS), to select a subset of cluster structures. Finally, we propose using a distribution-based normalized hypergraph cut algorithm to generate the final result. In our experiments, a nonparametric test is adopted to evaluate the difference between DCSE and its competitors. We adopt 20 real-world datasets obtained from the University of California, Irvine and knowledge extraction based on evolutionary learning repositories, and a number of cancer gene expression profiles to evaluate the performance of the proposed methods. The experimental results show that: 1) DCSE works well on the real-world datasets and 2) DCSE based on DCSSS can further improve the performance of the algorithm.
引用
收藏
页码:3554 / 3567
页数:14
相关论文
共 50 条
  • [1] Road traffic estimation and distribution-based route selection
    Kamphuis, Rens
    Mandjes, Michel
    Serra, Paulo
    ELECTRONIC JOURNAL OF STATISTICS, 2025, 19 (01): : 865 - 920
  • [2] DDES: A Distribution-Based Dynamic Ensemble Selection Framework
    Choi, Ye-Rim
    Lim, Dong-Joon
    IEEE ACCESS, 2021, 9 : 40743 - 40754
  • [3] Beta Distribution-Based Cross-Entropy for Feature Selection
    Dai, Weixing
    Guo, Dianjing
    ENTROPY, 2019, 21 (08)
  • [4] 3Sigma: Distribution-based cluster scheduling for runtime uncertainty
    Park, Jun Woo
    Tumanov, Alexey
    Jiang, Angela
    Kozuch, Michael A.
    Ganger, Gregory R.
    EUROSYS '18: PROCEEDINGS OF THE THIRTEENTH EUROSYS CONFERENCE, 2018,
  • [5] A Laplace Distribution-based Fuzzy-rough Feature Selection Algorithm
    Han, Xiaomeng
    Qu, Yanpeng
    Deng, Ansheng
    PROCEEDINGS OF 2018 TENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2018, : 776 - 781
  • [6] Conditional Joint Distribution-Based Test Selection for Fault Detection and Isolation
    Li, Yang
    Wang, Xiuli
    Lu, Ningyun
    Jiang, Bin
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 13168 - 13180
  • [7] GAUSSIAN DISTRIBUTION-BASED MODE SELECTION FOR INTRA PREDICTION OF SPATIAL SHVC
    Wang, Dayong
    Wang, Xin
    Sun, Yu
    Li, Weisheng
    Lu, Xin
    Dufaux, Frederic
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2711 - 2715
  • [8] Distribution-based Adversarial Filter Feature Selection against Evasion Attack
    Chan, Patrick P. K.
    Liang, YuanChao
    Zhang, Fei
    Yeung, Daniel S.
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [9] A demonstration of distribution-based calibration
    Markou, Ioulia
    Papathanasopoulou, Vasileia
    Antoniou, Constantinos
    2015 INTERNATIONAL CONFERENCE ON MODELS AND TECHNOLOGIES FOR INTELLIGENT TRANSPORTATION SYSTEMS (MT-ITS), 2015, : 109 - 115
  • [10] A possibility distribution-based multicriteria decision algorithm for resilient supplier selection problems
    Jiang, Dizuo
    Hassan, Md Mahmudul
    Ibn Faiz, Tasnim
    Noor-E-Alam, Md
    JOURNAL OF MULTI-CRITERIA DECISION ANALYSIS, 2020, 27 (3-4) : 203 - 223