GDPC: generalized density peaks clustering algorithm based on order similarity

被引:8
|
作者
Yang, Xiaofei [1 ,2 ]
Cai, Zhiling [1 ]
Li, Ruijia [1 ]
Zhu, William [1 ]
机构
[1] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu, Peoples R China
[2] Xian Polytech Univ, Sch Sci, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Order similarity; Density; Density peak; Graph; K-NEAREST NEIGHBORS; FAST SEARCH; FIND;
D O I
10.1007/s13042-020-01198-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is a fundamental approach to discover the valuable information in data mining and machine learning. Density peaks clustering is a typical density based clustering and has received increasing attention in recent years. However DPC and most of its improvements still suffer from some drawbacks. For example, it is difficult to find peaks in the sparse cluster regions; assignment for the remaining points tends to cause Domino effect, especially for complicated data. To address the above two problems, we propose generalized density peaks clustering algorithm (GDPC) based on a new order similarity, which is calculated by the order rank of Euclidean distance between two samples. The order similarity can help us to find peaks in the sparse regions. In addition, a two-step assignment is used to weaken Domino effect. In general, GDPC can not only discover clusters in datasets regardless of different sizes, dimensions and shapes, but also address the above two issues. Several experiments on datasets, including Lung, COIL20, ORL, USPS, Mnist, breast and Vote, show that our algorithm is effective in most cases.
引用
收藏
页码:719 / 731
页数:13
相关论文
共 50 条
  • [41] A Novel Hierarchical Clustering Algorithm Based on Density Peaks for Complex Datasets
    Zhou, Rong
    Zhang, Yong
    Feng, Shengzhong
    Luktarhan, Nurbol
    COMPLEXITY, 2018,
  • [42] A Fast Density Peaks Clustering Algorithm Based on Pre-screening
    Xu, Xiao
    Ding, Shifei
    Sun, Tongfeng
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 513 - 516
  • [43] A trainable clustering algorithm based on shortest paths from density peaks
    Pizzagalli, Diego Ulisse
    Gonzalez, Santiago F.
    Krause, Rolf
    SCIENCE ADVANCES, 2019, 5 (10)
  • [44] Density peaks clustering algorithm based on kernel density estimation and minimum spanning tree
    Fan T.
    Li X.
    Hou J.
    Liu B.
    Kang P.
    International Journal of Innovative Computing and Applications, 2022, 13 (5-6) : 336 - 350
  • [45] Clustering ensemble based on density peaks
    Chu R.-H.
    Wang H.-J.
    Yang Y.
    Li T.-R.
    Wang, Hong-Jun (wanghongjun@swjtu.edu.cn), 1600, Science Press (42): : 1401 - 1412
  • [46] An improved density peaks clustering algorithm using similarity assignment strategy with K-nearest neighbors
    Hu, Wei
    Feng, Ji
    Yang, Degang
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (09): : 12689 - 12706
  • [47] A Multi-Density Clustering Algorithm Based on Similarity for Dataset With Density Variation
    Zhou, Xingxing
    Zhang, Haiping
    Ji, Genlin
    Tang, Guoan
    IEEE ACCESS, 2019, 7 : 186004 - 186016
  • [48] A Clustering Algorithm for Varied Density Clusters based on Similarity of Local Density of Objects
    Fahim, Ahmed
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 26 - 31
  • [49] A clustering algorithm based on generalized similarity for co-regulated genes
    Key Laboratory of Medical Image Computing, Ministry of Education, Northeastern University, Shenyang 110004, China
    不详
    不详
    Dongbei Daxue Xuebao, 2009, 11 (1558-1561):
  • [50] An Algorithm of Clustering by Density Peaks Using in Anomaly Detection
    Yin, Chunyong
    Zhang, Sun
    Yin, Zhichao
    Wang, Jin
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2015, 9 (12): : 115 - 127