A fast consistent grid-based clustering algorithm

被引:0
|
作者
Anton S. Tarasenko [1 ]
Vladimir B. Berikov [2 ]
Igor A. Pestunov [1 ]
Sergey A. Rylov [2 ]
Pavel S. Ruzankin [3 ]
机构
[1] Sobolev Institute of Mathematics,
[2] Novosibirsk State University,undefined
[3] Federal Research Center for Information and Computational Technologies,undefined
关键词
Clustering; Estimator for the number of clusters; Density level sets; Big data;
D O I
10.1007/s10044-024-01354-0
中图分类号
学科分类号
摘要
We propose a fast consistent grid-based algorithm that estimates the number of clusters for observations in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathbb {R}}}^d$$\end{document} and, besides, constructs an approximation for the clusters. Consistency is proved under certain conditions. The time complexity of the algorithm can be made linear retaining the consistency. Numerical experiments confirm high computational efficiency of the new algorithm and its ability to process large datasets.
引用
收藏
相关论文
共 50 条
  • [21] LILA: A Connected Components Labeling Algorithm in Grid-Based Clustering
    Jiang, Tao
    Qiu, Ming
    Chen, Jie
    Cao, Xue
    FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, : 213 - 216
  • [22] Research on application of grid-based and density-based clustering algorithm
    Shen, LX
    Yan, C
    PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS I AND II, 2003, : 684 - 689
  • [23] Grid-based clustering algorithm based on intersecting partition and density estimation
    Qiu, Bao-Zhi
    Li, Xiang-Li
    Shen, Jun-Yi
    EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 368 - +
  • [24] A Extended Grid-based Clustering Algorithm with Referential Value of Parameters
    Zhou, Yantao
    Wu, Zhengguo
    Yi, Xingdong
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 1832 - +
  • [25] Flexible grid-based clustering
    Akodjenou-Jeannin, Marc-Ismael
    Salamatian, Kave
    Gallinarl, Patrick
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2007, PROCEEDINGS, 2007, 4702 : 350 - +
  • [26] Grid-based C-means Clustering Algorithm for Image Segmentation
    Yue, Shihong
    Li, YueFeng
    He, Boyang
    2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL I, 2011, : 58 - 61
  • [27] Scalable grid-based clustering algorithm for very large spatial databases
    Sun, Yufen
    Lu, Yansheng
    2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 763 - 768
  • [28] A grid-based clustering algorithm for high-dimensional data streams
    Lu, YS
    Sun, YF
    Xu, GP
    Liu, G
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 824 - 831
  • [29] Grid-based Hierarchical Spatial Clustering Algorithm in Presence of Obstacle and Constraints
    Yang, Yue
    Zhang, Jian-pei
    Yang, Jing
    ICICSE: 2008 INTERNATIONAL CONFERENCE ON INTERNET COMPUTING IN SCIENCE AND ENGINEERING, PROCEEDINGS, 2008, : 383 - 388
  • [30] IBUSCA: A grid-based bottom-up subspace clustering algorithm
    Glomba, Michal
    Markowska-Kaczmar, Urszula
    ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, 2006, : 671 - 676