TOBAE: A Density-based Agglomerative Clustering Algorithm

被引:0
|
作者
Shehzad Khalid
Shahid Razzaq
机构
[1] Bahria University,Department of Computer Engineering
来源
Journal of Classification | 2015年 / 32卷
关键词
Clustering; Agglomerative; Density distribution; Automatic; Noise removal; Non-parametric; Filtering; Terrain; Water puddles; Density threshold.;
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents a novel density based agglomerative clustering algorithm named TOBAE which is a parameter-less algorithm and automatically filters noise. It finds the appropriate number of clusters while giving a competitive running time. TOBAE works by tracking the cumulative density distribution of the data points on a grid and only requires the original data set as input. The clustering problem is solved by automatically finding the optimal density threshold for the clusters. It is applicable to any N-dimensional data set which makes it highly relevant for real world scenarios. The algorithm outperforms state of the art clustering algorithms by the additional feature of automatic noise filtration around clusters. The concept behind the algorithm is explained using the analogy of puddles (’tobae’), which the algorithm is inspired from. This paper provides a detailed algorithm for TOBAE along with the complexity analysis for both time and space. We show experimental results against known data sets and show how TOBAE competes with the best algorithms in the field while providing its own set of advantages.
引用
收藏
页码:241 / 267
页数:26
相关论文
共 50 条
  • [1] TOBAE: A Density-based Agglomerative Clustering Algorithm
    Khalid, Shehzad
    Razzaq, Shahid
    [J]. JOURNAL OF CLASSIFICATION, 2015, 32 (02) : 241 - 267
  • [2] A varied density-based clustering algorithm
    Fahim, Ahmed
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2023, 66
  • [3] The Density-Based Agglomerative Information Bottleneck
    Ren, Yongli
    Ye, Yangdong
    Li, Gang
    [J]. PRICAI 2008: TRENDS IN ARTIFICIAL INTELLIGENCE, 2008, 5351 : 333 - +
  • [4] An Efficient Density-Based Algorithm for Data Clustering
    Theljani, Foued
    Laabidi, Kaouther
    Zidi, Salah
    Ksouri, Moufida
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2017, 26 (04)
  • [5] ADCN: An Anisotropic Density-Based Clustering Algorithm
    Mai, Gengchen
    Janowicz, Krzysztof
    Hu, Yingjie
    Gao, Song
    [J]. 24TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2016), 2016,
  • [6] GrDBSCAN: A Granular Density-Based Clustering Algorithm
    Suchy, Dawid
    Siminski, Krzysztof
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2023, 33 (02) : 297 - 312
  • [7] EFFICIENT DENSITY-BASED PARTITIONAL CLUSTERING ALGORITHM
    Alamgir, Zareen
    Naveed, Hina
    [J]. COMPUTING AND INFORMATICS, 2021, 40 (06) : 1322 - 1344
  • [8] A Fuzzy Density-based Incremental Clustering Algorithm
    Laohakiat, Sirisup
    Ratanajaipan, Photchanan
    Navaravong, Leenhapat
    Ungrangsi, Rachanee
    Maleewong, Krissada
    [J]. 2018 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2018, : 211 - 215
  • [9] Incremental grid density-based clustering algorithm
    Chen, Ning
    Chen, An
    Zhou, Long-Xiang
    [J]. Ruan Jian Xue Bao/Journal of Software, 2002, 13 (01): : 1 - 7
  • [10] A Density-Based Clustering Algorithm with Educational Applications
    Wang, Zitong
    Kang, Peng
    Wu, Zewei
    Rao, Yanghui
    Wang, Fu Lee
    [J]. CURRENT DEVELOPMENTS IN WEB BASED LEARNING, ICWL 2015, 2016, 9584 : 118 - 127