An Efficient Grid-based Clustering Method by Finding Density Peaks

被引:0
|
作者
Wu, Bo [1 ]
Wilamowski, B. M. [1 ]
机构
[1] Auburn Univ, Dept Elect & Comp Engn, Auburn, AL 36849 USA
关键词
clustering; grid; density peaks; efficiency; SEARCH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Clustering or categorizing an unprocessed data set is essential and critical in many areas. Much success has been published, which first needs to calculate the mutual distances between data points. It suffers from considerable computational costs, preventing the state-of-the-art methods such as the clustering method by fast search and find of density peaks (FSFDP, published in Science, 2014) from applying into real life (e.g., with thousands of data points). In this paper, an efficient grid-based clustering (GBC) method by finding density peaks is described. It keeps the advantage of the friendly interactive interface in the FSFDP, at the mean time, decreases enormously the computation complexity. The time complexity of the FSFDP is o(np(np 1)/2) while our method decreases it to o(np * sizeof (grid)), where np is the number of data points and the size of grid is always much smaller than np so that the time complexity of our approach is almost linearly proportional to np. The presented GBC method by finding density peaks was able to calculate the densities and categorize datasets within much less time, which makes the density-peak-based algorithm practical. By using the presented algorithm, it was possible to cluster high dimensional data sets as well. The GBC method by finding density peaks was successfully verified in clustering several datasets, which are commonly used to test clustering algorithms in published articles. It turned out that the presented method is much faster and efficient in clustering datasets into different categories than the conventional density-based ones, which makes the proposed method more preferable.
引用
收藏
页码:837 / 842
页数:6
相关论文
共 50 条
  • [31] Parallel grid-based density peak clustering of big trajectory data
    Niu, Xinzheng
    Zheng, Yunhong
    Fournier-Viger, Philippe
    Wang, Bing
    [J]. APPLIED INTELLIGENCE, 2022, 52 (15) : 17042 - 17057
  • [32] Grid-based spectral fiber clustering
    Klein, Jan
    Bittihn, Philip
    Ledochowitsch, Peter
    Hahn, Horst K.
    Konrad, Olaf
    Rexilius, Jan
    Peitgen, Heinz-Otto
    [J]. MEDICAL IMAGING 2007: VISUALIZATION AND IMAGE-GUIDED PROCEDURES, PTS 1 AND 2, 2007, 6509
  • [33] A NOVEL GRID-BASED CLUSTERING ALGORITHM
    Starczewski, Artur
    Scherer, Magdalena M.
    Ksiazek, Wojciech
    Debski, Maciej
    Wang, Lipo
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2021, 11 (04) : 319 - 330
  • [34] Grid-based dynamic clustering with grid proximity measure
    Lee, Gun Ho
    [J]. INTELLIGENT DATA ANALYSIS, 2016, 20 (04) : 853 - 875
  • [35] Supersampling method for efficient grid-based electronic structure calculations
    Ryu, Seongok
    Choi, Sunghwan
    Hong, Kwangwoo
    Kim, Woo Youn
    [J]. JOURNAL OF CHEMICAL PHYSICS, 2016, 144 (09):
  • [36] Grid-based improving clustering quality algorithm
    School of Electronics and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
    不详
    [J]. Jisuanji Gongcheng, 2006, 3 (12-13+98):
  • [37] An unsupervised grid-based approach for clustering analysis
    YUE ShiHong1
    2Department of Electrical Engineering
    [J]. Science China(Information Sciences), 2010, 53 (07) : 1345 - 1357
  • [38] Grid-Based Clustering Using Boundary Detection
    Du, Mingjing
    Wu, Fuyu
    [J]. ENTROPY, 2022, 24 (11)
  • [39] An unsupervised grid-based approach for clustering analysis
    ShiHong Yue
    JeenShing Wang
    Gao Tao
    HuaXiang Wang
    [J]. Science China Information Sciences, 2010, 53 : 1345 - 1357
  • [40] A deflected grid-based algorithm for clustering analysis
    Department of Computer Science and Information Engineering, Tamkang University, 151 Ying-Chuan Road, Tamsui, Taipei County, Taiwan
    [J]. WSEAS Trans. Comput., 2008, 3 (125-132):