Clustering with density based initialization and Bhattacharyya based merging

被引:0
|
作者
Kose, Erdem [1 ]
Hocaoglu, Ali Koksal [1 ]
机构
[1] Gebze Tech Univ, Fac Engn, Dept Elect Engn, Kocaeli, Turkey
关键词
Infinite mixture models; density estimation; Jensen inequality; bandwidth selection; optimal number of; K-MEANS; DISTANCE; NUMBER;
D O I
10.55730/1300-0632.3794
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Centroid based clustering approaches, such as k-means, are relatively fast but inaccurate for arbitrary shape clusters. Fuzzy c-means with Mahalanobis distance can accurately identify clusters if data set can be modelled by a mixture of Gaussian distributions. However, they require number of clusters apriori and a bad initialization can cause poor results. Density based clustering methods, such as DBSCAN, overcome these disadvantages. However, they may perform poorly when the dataset is imbalanced. This paper proposes a clustering method, named clustering with density initialization and Bhattacharyya based merging based on the fuzzy clustering. The initialization is carried out by density estimation with adaptive bandwidth using k-Nearest Orthant-Neighb or algorithm to avoid the effects of imbalanced clusters. The local peaks of the point clouds constructed by the k-Nearest Orthant-Neighb or algorithm are used as initial cluster centers for the fuzzy clustering. We use Bhattacharyya measure and Jensen inequality to find overlapped Gaussians and merge them to form a single cluster. We carried out experiments on a variety of datasets and show that the proposed algorithm has remarkable advantages especially for imbalanced and arbitrarily shaped data sets.
引用
收藏
页码:502 / 517
页数:16
相关论文
共 50 条
  • [1] An Optimized Initialization Center K-means Clustering Algorithm based on Density
    Yuan, Qilong
    Shi, Haibo
    Zhou, Xiaofeng
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2015, : 790 - 794
  • [2] An Initialization Method for Clustering Mixed Numeric and Categorical Data Based on the Density and Distance
    Ji, Jinchao
    Pang, Wei
    Zheng, Yanlin
    Wang, Zhe
    Ma, Zhiqiang
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2015, 29 (07)
  • [3] '1+1 > 2':: Merging distance and density based clustering
    Dash, M
    Liu, H
    Xu, XW
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2001, : 32 - 39
  • [4] An improved density peaks clustering algorithm based on natural neighbor with a merging strategy
    Ding, Shifei
    Du, Wei
    Xu, Xiao
    Shi, Tianhao
    Wang, Yanru
    Li, Chao
    [J]. INFORMATION SCIENCES, 2023, 624 : 252 - 276
  • [5] Scale-invariant density-based clustering initialization algorithm and its application
    Hua, Chunsheng
    Sagawa, Ryusuke
    Yagi, Yasushi
    [J]. 19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3954 - 3957
  • [6] An Improved Initialization Center K-means Clustering Algorithm Based on Distance and Density
    Duan, Yanling
    Liu, Qun
    Xia, Shuyin
    [J]. ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955
  • [7] POISSON DISTRIBUTION BASED INITIALIZATION FOR FUZZY CLUSTERING
    Vintr, Tomas
    Vintrova, Vanda
    Rezankova, Hana
    [J]. NEURAL NETWORK WORLD, 2012, 22 (02) : 139 - 159
  • [8] Density Peak Clustering Based on Cumulative Nearest Neighbors Degree and Micro Cluster Merging
    Xu, Lizhong
    Zhao, Jia
    Yao, Zhanfeng
    Shi, Aiye
    Chen, Zhe
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 91 (10): : 1219 - 1236
  • [9] Density Peak Clustering Based on Cumulative Nearest Neighbors Degree and Micro Cluster Merging
    Lizhong Xu
    Jia Zhao
    Zhanfeng Yao
    Aiye Shi
    Zhe Chen
    [J]. Journal of Signal Processing Systems, 2019, 91 : 1219 - 1236
  • [10] ADAPTIVE CLUSTER CENTER INITIALIZATION USING DENSITY PEAKS FOR GEODESIC DISTANCE-BASED CLUSTERING
    Jahan, Sohana
    Setu, Afsana akter
    Muntaha, Sidratul
    Biswas, Tapan
    [J]. NUMERICAL ALGEBRA CONTROL AND OPTIMIZATION, 2024,