On cluster tree for nested and multi-density data clustering

被引:20
|
作者
Li, Xutao [2 ]
Ye, Yunming [2 ]
Li, Mark Junjie [3 ]
Ng, Michael K. [1 ]
机构
[1] Hong Kong Baptist Univ, Dept Math, Kowloon Tong, Hong Kong, Peoples R China
[2] Harbin Inst Technol, Shenzhen Grad Sch, Dept Comp Sci, Harbin, Peoples R China
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Ctr High Performance Comp Technol, Inst Adv Comp & Digital Engn, Shenzhen, Peoples R China
关键词
Hierarchical clustering; Multi-densities; Cluster tree; k-Means-type algorithm; ALGORITHM; SELECTION; NUMBER; MODEL;
D O I
10.1016/j.patcog.2010.03.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is one of the important data mining tasks. Nested clusters or clusters of multi-density are very prevalent in data sets. In this paper, we develop a hierarchical clustering approach a cluster tree to determine such cluster structure and understand hidden information present in data sets of nested clusters or clusters of multi-density. We embed the agglomerative k-means algorithm in the generation of cluster tree to detect such clusters. Experimental results on both synthetic data sets and real data sets are presented to illustrate the effectiveness of the proposed method. Compared with some existing clustering algorithms (DBSCAN, X-means, BIRCH, CURE, NBC, OPTICS, Neural Gas. Tree-SOM, EnDBSAN and LDBSCAN), our proposed cluster tree approach performs better than these methods. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3130 / 3143
页数:14
相关论文
共 50 条
  • [31] A efficient clustering algorithm for 2D multi-density dataset in large database
    Xia, Ying
    Wang, GuoYin
    Gao, Song
    MUE: 2007 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2007, : 78 - +
  • [32] PGMCLU: A Novel Parallel Grid-based Clustering Algorithm for Multi-density Datasets
    Chen Xiaoyun
    Chen Yi
    Qi Xiaoli
    Yue Min
    He Yanshan
    2009 1ST IEEE SYMPOSIUM ON WEB SOCIETY, PROCEEDINGS, 2009, : 166 - 171
  • [33] Finding and Tracking Multi-Density Clusters in Online Dynamic Data Streams
    Fahy, Conor
    Yang, Shengxiang
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (01) : 178 - 192
  • [34] DEMOS: Clustering by Pruning a Density-Boosting Cluster Tree of Density Mounts
    Guan, Junyi
    Li, Sheng
    Chen, Xiaojun
    He, Xiongxiong
    Chen, Jiajia
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (10) : 10814 - 10830
  • [35] MDBSCAN: A multi-density DBSCAN based on relative density
    Qian, Jiaxin
    Zhou, You
    Han, Xuming
    Wang, Yizhang
    NEUROCOMPUTING, 2024, 576
  • [36] Multi-Density Datasets Clustering Using K-Nearest Neighbors and Chebyshev’s Inequality
    Bouchemal A.
    Kimour M.T.
    Informatica (Slovenia), 2023, 47 (08): : 161 - 168
  • [37] An approach based on maximal cliques and multi-density clustering for regional co-location pattern mining
    Wang, Dongsheng
    Wang, Lizhen
    Wang, Xiaoxu
    Tran, Vanha
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 248
  • [38] Multi-density DBSCAN algorithm based on Density Levels Partitioning
    Chen, R. (crt_310@163.com), 1600, Binary Information Press, Flat F 8th Floor, Block 3, Tanner Garden, 18 Tanner Road, Hong Kong (09):
  • [39] GDCIC: A grid-based density-confidence-interval clustering algorithm for multi-density dataset in large spatial database
    Gao, Song
    Xia, Ying
    ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, 2006, : 713 - 717
  • [40] Discovering Multi-density Urban Hotspots in a Smart City
    Cesario, Eugenio
    Uchubilo, Paschal, I
    Vinci, Andrea
    Zhu, Xiaotian
    2020 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP), 2020, : 332 - 337