Data clustering via cooperative games: A novel approach and comparative study

被引:4
|
作者
Coelho, Andre L. V. [1 ]
Sandes, Nelson C. [2 ]
机构
[1] Univ Fortaleza UNIFOR, Ctr Technol Sci, Grad Program Appl Informat, Fortaleza, Ceara, Brazil
[2] Fed Univ Cariri UFCA, Dept Comp Sci, Ctr Technol Sci, Juazeiro Do Norte, Brazil
关键词
Data clustering; Cooperative game theory; Coalitions; Hedonic games; Shapley value; Nash stability; COALITION-FORMATION;
D O I
10.1016/j.ins.2020.09.018
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Arguably, the main purpose of cluster analysis is to develop algorithms to reveal natural groupings (clusterings) over a set of data points based on their similarity. On the other hand, the focus of cooperative game theory (CGT) is to study the formation of groups (coalitions) of decision makers (players) and ways to split the resulting income among them. Due to the conceptual similitude between these fields, algorithms rooted in CGT have recently emerged for tackling the data clustering problem. In this work, we revisit two such algorithms, one based on cluster prototypes (Biobjective Game Clustering - BiGC) and the other based on dense regions of data points (Density-Restricted Agglomerative Clustering - DRAC). We also present a novel partitional clustering algorithm, referred to as HGC (after Hedonic Game based Clustering), which is grounded on theoretical results stemming from the subclass of hedonic games. Two HGC versions are investigated, which differ in the order of the players in the game, and a detailed factorial simulation study is reported to analyze how sensitive they are to three relevant factors, namely number of clusters, number of features, and noise level. Besides, a heuristic to calibrate the value of HGC's single control parameter (viz., the number of nearest neighbors of each point) is provided, so as to yield high-quality partitions. To compare the performance of the CGT algorithms, a series of experiments were conducted on UCI and gene-expression data sets, the majority of which being high dimensional. Overall, the results measured by 10 external validation indices evidence that HGC is usually more stable and effective than DRAC and BiGC. They also show that HGC is very competitive (sometimes, considerably better) to well-known clustering algorithms/variants (specifically, k-means, k-means++, affinity propagation, two variants of hierarchical clustering, and the density peak clustering algorithm). Remarkably, HGC could fully recover the true clustering structures for two gene-expression data sets. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:791 / 812
页数:22
相关论文
共 50 条
  • [1] Applying Cooperative Games with Coalition Structure for Data Clustering
    Bure, V. M.
    Staroverova, K. Yu.
    [J]. AUTOMATION AND REMOTE CONTROL, 2019, 80 (08) : 1541 - 1551
  • [2] Applying Cooperative Games with Coalition Structure for Data Clustering
    V. M. Bure
    K. Yu. Staroverova
    [J]. Automation and Remote Control, 2019, 80 : 1541 - 1551
  • [3] A novel approach to the clustering of microarray data via nonparametric density estimation
    De Bin, Riccardo
    Risso, Davide
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [4] A novel approach to the clustering of microarray data via nonparametric density estimation
    Riccardo De Bin
    Davide Risso
    [J]. BMC Bioinformatics, 12
  • [5] A Novel Approach to Clustering Algorithms and their Comparative Performance Analysis on Different Data Set
    Lamba, Manish
    Dash, Sagarjit
    Jamwal, Atul Singh
    [J]. 2017 INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING AND INFORMATION SYSTEMS (ICNGCIS), 2017, : 164 - 167
  • [6] An improved competitive and cooperative learning approach for data clustering
    Wang, Shao-ping
    Pei, Wen-jiang
    Cheung, Yiu-ming
    [J]. CIS: 2007 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PROCEEDINGS, 2007, : 320 - +
  • [7] A competitive and cooperative learning approach to robust data clustering
    Cheung, YM
    [J]. PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND COMPUTATIONAL INTELLIGENCE, 2004, : 131 - 136
  • [8] Electroencephalogram Signal Clustering With Convex Cooperative Games
    Dai, Chenglong
    Wu, Jia
    Pi, Dechang
    Cui, Lin
    Johnson, Blake
    Becker, Stefanie, I
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (12) : 5755 - 5769
  • [9] A Cooperative Multi-population Approach to Clustering Temporal Data
    Georgieva, Kristina
    Engelbrecht, Andries P.
    [J]. 2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 1983 - 1991
  • [10] Modelling and Verification Analysis of Cooperative and Non-Cooperative Games via a Modal Logic Approach
    Konigsberg, Zvi Retchkiman
    [J]. COMPUTACION Y SISTEMAS, 2021, 25 (03): : 515 - 521