An Efficient Grid-based Clustering Method by Finding Density Peaks

被引:0
|
作者
Wu, Bo [1 ]
Wilamowski, B. M. [1 ]
机构
[1] Auburn Univ, Dept Elect & Comp Engn, Auburn, AL 36849 USA
关键词
clustering; grid; density peaks; efficiency; SEARCH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Clustering or categorizing an unprocessed data set is essential and critical in many areas. Much success has been published, which first needs to calculate the mutual distances between data points. It suffers from considerable computational costs, preventing the state-of-the-art methods such as the clustering method by fast search and find of density peaks (FSFDP, published in Science, 2014) from applying into real life (e.g., with thousands of data points). In this paper, an efficient grid-based clustering (GBC) method by finding density peaks is described. It keeps the advantage of the friendly interactive interface in the FSFDP, at the mean time, decreases enormously the computation complexity. The time complexity of the FSFDP is o(np(np 1)/2) while our method decreases it to o(np * sizeof (grid)), where np is the number of data points and the size of grid is always much smaller than np so that the time complexity of our approach is almost linearly proportional to np. The presented GBC method by finding density peaks was able to calculate the densities and categorize datasets within much less time, which makes the density-peak-based algorithm practical. By using the presented algorithm, it was possible to cluster high dimensional data sets as well. The GBC method by finding density peaks was successfully verified in clustering several datasets, which are commonly used to test clustering algorithms in published articles. It turned out that the presented method is much faster and efficient in clustering datasets into different categories than the conventional density-based ones, which makes the proposed method more preferable.
引用
收藏
页码:837 / 842
页数:6
相关论文
共 50 条
  • [41] An unsupervised grid-based approach for clustering analysis
    Yue ShiHong
    Wang JeenShing
    Tao Gao
    Wang HuaXiang
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2010, 53 (07) : 1345 - 1357
  • [42] Clustering ensemble based on density peaks
    Chu R.-H.
    Wang H.-J.
    Yang Y.
    Li T.-R.
    [J]. Wang, Hong-Jun (wanghongjun@swjtu.edu.cn), 1600, Science Press (42): : 1401 - 1412
  • [43] PGMCLU: A Novel Parallel Grid-based Clustering Algorithm for Multi-density Datasets
    Chen Xiaoyun
    Chen Yi
    Qi Xiaoli
    Yue Min
    He Yanshan
    [J]. 2009 1ST IEEE SYMPOSIUM ON WEB SOCIETY, PROCEEDINGS, 2009, : 166 - 171
  • [44] A grid-based clustering method for large-scale wireless sensor networks
    Yan, Bin
    Zhou, Xiaojiao
    Wang, Houjun
    Li, Benliang
    [J]. 2007 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEMS; VOL 2: SIGNAL PROCESSING, COMPUTATIONAL INTELLIGENCE, CIRCUITS AND SYSTEMS, 2007, : 414 - +
  • [45] A Fast Density-Grid Based Clustering Method
    Brown, Daniel
    Japa, Arialdis
    Shi, Yong
    [J]. 2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2019, : 48 - 54
  • [46] An improved density peaks method for data clustering
    Lotfi, Abdulrahman
    Seyedi, Seyed Amjad
    Moradi, Parham
    [J]. 2016 6TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2016, : 263 - 268
  • [47] A Research about grid-based spatial clustering method on regional data analysis
    Zhang, Yu-Wei
    Wan, Lu-He
    [J]. Journal of Harbin Institute of Technology (New Series), 2011, 18 (SUPPL. 1) : 171 - 175
  • [48] A Novel Oversampling Method for Imbalanced Datasets Based on Density Peaks Clustering
    Cao, Jie
    Shi, Yong
    [J]. TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2021, 28 (06): : 1813 - 1819
  • [49] Community detection method based on vertex distance and clustering of density peaks
    Huang L.
    Li Y.
    Wang G.-S.
    Wang Y.
    [J]. Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2016, 46 (06): : 2042 - 2051
  • [50] EDDPC: An efficient distributed density peaks clustering algorithm
    Gong S.
    Zhang Y.
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2016, 53 (06): : 1400 - 1409