Geometric algorithms for density-based data clustering

被引:0
|
作者
Chen, DZ
Smid, M
Xu, B [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[2] Carleton Univ, Sch Comp Sci, Ottawa, ON K1S 5B6, Canada
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present new geometric approximation and exact algorithms for the density-based data clustering problem in d-dimensional space R-d (for any constant integer d greater than or equal to 2). Previously known algorithms for this problem are efficient only for uniformly-distributed points. However, these algorithms all run in theta(n(2)) time in the worst case, where n is the number of input points. Our approximation algorithm based on the e-fuzzy distance function takes 0(n log n) time for any given fixed value epsilon > 0, and our exact algorithms take sub-quadratic time. The running times and output quality of our algorithms do not depend on any particular data distribution. We believe that our fast approximation algorithm is of considerable practical importance, while our sub-quadratic exact algorithms are more of theoretical interest. We implemented our approximation algorithm and the experimental results show that our approximation algorithm is efficient on arbitrary input point sets.
引用
收藏
页码:284 / 296
页数:13
相关论文
共 50 条
  • [31] Density-based clustering for bivariate-flow data
    Shu, Hua
    Pei, Tao
    Song, Ci
    Chen, Jie
    Chen, Xiao
    Guo, Sihui
    Liu, Yaxi
    Wang, Xi
    Wang, Xuyang
    Zhou, Chenghu
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2022, 36 (09) : 1809 - 1829
  • [32] Density-based clustering for evolving uncertain data stream
    He, Haitao
    Zhao, Jintian
    Journal of Computational Information Systems, 2014, 10 (01): : 419 - 426
  • [33] Density-based clustering on massive mobile communication data
    Liu, YF
    Tang, SW
    Yang, DQ
    Chen, Y
    Wang, TJ
    Ma, S
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XI, PROCEEDINGS: COMMUNICATION, NETWORK AND CONTROL SYSTEMS, TECHNOLOGIES AND APPLICATIONS: II, 2003, : 251 - 254
  • [34] Hierarchical density-based clustering of categorical data and a simplification
    Andreopoulos, Bill
    An, Aijun
    Wang, Xiaogang
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, 4426 : 11 - +
  • [35] Density-based clustering for road accident data analysis
    Alotaibi, Abdullah S.
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2018, 5 (08): : 113 - 121
  • [36] Density-based clustering algorithm for mixture data sets
    Huang, De-Cai
    Wu, Tian-Hong
    Kongzhi yu Juece/Control and Decision, 2010, 25 (03): : 416 - 421
  • [37] Density-based clustering with non-continuous data
    Adelchi Azzalini
    Giovanna Menardi
    Computational Statistics, 2016, 31 : 771 - 798
  • [38] A Density-Based Clustering of Spatio-Temporal Data
    Zaghlool, Ehab
    ElKaffas, Saleh
    Saad, Amani
    NEW CONTRIBUTIONS IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 2, 2015, 354 : 41 - 50
  • [39] Density-based clustering with non-continuous data
    Azzalini, Adelchi
    Menardi, Giovanna
    COMPUTATIONAL STATISTICS, 2016, 31 (02) : 771 - 798
  • [40] Density-based data clustering algorithms for lower dimensions using space-filling curves
    Xu, Bin
    Chen, Danny Z.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, 4426 : 997 - +