A "Density-Based" Algorithm for Cluster Analysis Using Species Sampling Gaussian Mixture Models

被引:15
|
作者
Argiento, Raffaele [1 ]
Cremaschi, Andrea [2 ]
Guglielmi, Alessandra [3 ]
机构
[1] CNR IMATI, I-20133 Milan, Italy
[2] Univ Kent, Sch Math Stat & Actuarial Sci, Canterbury, Kent, England
[3] Politecn Milan, Dipartimento Matemat, I-20133 Milan, Italy
关键词
DBSCAN algorithm; Bayesian nonparametrics; Dirichlet process;
D O I
10.1080/10618600.2013.856796
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a new model for cluster analysis in a Bayesian nonparametric framework. Our model combines two ingredients, species sampling mixture models of Gaussian distributions on one hand, and a deterministic clustering procedure (DBSCAN) on the other. Here, two observations from the underlying species sampling mixture model share the same cluster if the distance between the densities corresponding to their latent parameters is smaller than a threshold; this yields a random partition which is coarser than the one induced by the species sampling mixture. Since this procedure depends on the value of the threshold, we suggest a strategy to fix it. In addition, we discuss implementation and applications of the model; comparison with more standard clustering algorithms will be given as well. Supplementary materials for the article are available online.
引用
收藏
页码:1126 / 1142
页数:17
相关论文
共 50 条
  • [1] A new density-based sampling algorithm
    Ros, Frederic
    Guillaume, Serge
    [J]. PROCEEDINGS OF THE 2015 CONFERENCE OF THE INTERNATIONAL FUZZY SYSTEMS ASSOCIATION AND THE EUROPEAN SOCIETY FOR FUZZY LOGIC AND TECHNOLOGY, 2015, 89 : 145 - 151
  • [2] A Density-Based Clustering Algorithm with Sampling for Travel Behavior Analysis
    Tang, Wang
    Pi, Dechang
    He, Yun
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2016, 2016, 9937 : 231 - 239
  • [3] Density-based clustering algorithm for mixture data sets
    Huang, De-Cai
    Wu, Tian-Hong
    [J]. Kongzhi yu Juece/Control and Decision, 2010, 25 (03): : 416 - 421
  • [4] DENDIS: A new density-based sampling for clustering algorithm
    Ros, Frederic
    Guillaume, Serge
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 56 : 349 - 359
  • [5] Sampling strategies in density-based sensitivity analysis
    Castaings, W.
    Borgonovo, E.
    Morris, M. D.
    Tarantola, S.
    [J]. ENVIRONMENTAL MODELLING & SOFTWARE, 2012, 38 : 13 - 26
  • [6] An Improved Density-Based Cluster Analysis Method Combining Genetic Algorithm and Data Sampling for Large-Scale Datasets
    Ye Zonglin
    Cao Hui
    Wang Miaomiao
    Zhang Yanbin
    [J]. 2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 3552 - 3555
  • [7] Automated Operational Modal Analysis of a Helicopter Blade with a Density-Based Cluster Algorithm
    Sibille, Luigi
    Civera, Marco
    Fragonara, Luca Zanotti
    Ceravolo, Rosario
    [J]. AIAA JOURNAL, 2023, 61 (03) : 1411 - 1427
  • [8] Incremental relative density-based clustering algorithm for mixture data sets
    Huang, De-Cai
    Li, Xiao-Chang
    [J]. Kongzhi yu Juece/Control and Decision, 2013, 28 (06): : 815 - 822
  • [9] Multiobjective evolutionary algorithm based on mixture Gaussian models
    Zhou, Ai-Min
    Zhang, Qing-Fu
    Zhang, Gui-Xu
    [J]. Ruan Jian Xue Bao/Journal of Software, 2014, 25 (05): : 913 - 928
  • [10] Biased-Sampling of Density-based Local Outlier Detection Algorithm
    Fu, Peiguo
    Hu, Xiaohui
    [J]. 2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1246 - 1253