Study on the Influence of Diversity and Quality in Entropy Based Collaborative Clustering

被引:3
|
作者
Sublime, Jeremie [1 ,2 ]
Cabanes, Guenael [2 ]
Matei, Basarab [2 ]
机构
[1] DaSSIP Team LISITE, ISEP, 10 Rue Vanves, F-92130 Issy Les Moulineaux, France
[2] Univ Paris 13, Sorbonne Paris Cite, LIPN CNRS UMR 7030, 99 Ave JB Clement, F-93430 Villetaneuse, France
关键词
collaborative clustering; clustering quality; entropy; diversity;
D O I
10.3390/e21100951
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The aim of collaborative clustering is to enhance the performances of clustering algorithms by enabling them to work together and exchange their information to tackle difficult data sets. The fundamental concept of collaboration is that clustering algorithms operate locally but collaborate by exchanging information about the local structures found by each algorithm. This kind of collaborative learning can be beneficial to a wide number of tasks including multi-view clustering, clustering of distributed data with privacy constraints, multi-expert clustering and multi-scale analysis. Within this context, the main difficulty of collaborative clustering is to determine how to weight the influence of the different clustering methods with the goal of maximizing the final results and minimizing the risk of negative collaborations-where the results are worse after collaboration than before. In this paper, we study how the quality and diversity of the different collaborators, but also the stability of the partitions can influence the final results. We propose both a theoretical analysis based on mathematical optimization, and a second study based on empirical results. Our findings show that on the one hand, in the absence of a clear criterion to optimize, a low diversity pool of solution with a high stability are the best option to ensure good performances. And on the other hand, if there is a known criterion to maximize, it is best to rely on a higher diversity pool of solution with a high quality on the said criterion. While our approach focuses on entropy based collaborative clustering, we believe that most of our results could be extended to other collaborative algorithms.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Entropy Based Clustering of Viral Sequences
    Juyal, Akshay
    Hosseini, Roya
    Novikov, Daniel
    Grinshpon, Mark
    Zelikovsky, Alex
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2022, 2022, 13760 : 369 - 380
  • [22] Clustering ensemble selection considering quality and diversity
    Abbasi, Sadr-olah
    Nejatian, Samad
    Parvin, Hamid
    Rezaie, Vahideh
    Bagherifard, Karamolah
    ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (02) : 1311 - 1340
  • [23] Clustering ensemble selection considering quality and diversity
    Sadr-olah Abbasi
    Samad Nejatian
    Hamid Parvin
    Vahideh Rezaie
    Karamolah Bagherifard
    Artificial Intelligence Review, 2019, 52 : 1311 - 1340
  • [24] Collaborative filtering-based recommendations against shilling attacks with particle swarm optimiser and entropy-based mean clustering
    Verma, Anjani Kumar
    Dixit, Veer Sain
    INTERNATIONAL JOURNAL OF INFORMATION AND COMPUTER SECURITY, 2023, 20 (1-2) : 133 - 144
  • [25] Semantic Web content analysis: A study in proximity-based collaborative clustering
    Loia, Vincenzo
    Pedrycz, Witold
    Senatore, Sabrina
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2007, 15 (06) : 1294 - 1312
  • [26] How the Outliers Influence the Quality of Clustering?
    Nowak-Brzezinska, Agnieszka
    Gaibei, Igor
    ENTROPY, 2022, 24 (07)
  • [27] The Influence of Data Quality on Clustering Outcomes
    Sivogolovko, Elena
    DATABASES AND INFORMATION SYSTEMS VII, 2013, 249 : 95 - 105
  • [29] Balanced clustering based on collaborative neurodynamic optimization
    Dai, Xiangguang
    Wang, Jun
    Zhang, Wei
    KNOWLEDGE-BASED SYSTEMS, 2022, 250
  • [30] Collaborative Filtering Algorithm Based On User Clustering
    Deng, Zhao
    Wang, Jin
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 1044 - 1048