A fast consistent grid-based clustering algorithm

被引:0
|
作者
Anton S. Tarasenko [1 ]
Vladimir B. Berikov [2 ]
Igor A. Pestunov [1 ]
Sergey A. Rylov [2 ]
Pavel S. Ruzankin [3 ]
机构
[1] Sobolev Institute of Mathematics,
[2] Novosibirsk State University,undefined
[3] Federal Research Center for Information and Computational Technologies,undefined
关键词
Clustering; Estimator for the number of clusters; Density level sets; Big data;
D O I
10.1007/s10044-024-01354-0
中图分类号
学科分类号
摘要
We propose a fast consistent grid-based algorithm that estimates the number of clusters for observations in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathbb {R}}}^d$$\end{document} and, besides, constructs an approximation for the clusters. Consistency is proved under certain conditions. The time complexity of the algorithm can be made linear retaining the consistency. Numerical experiments confirm high computational efficiency of the new algorithm and its ability to process large datasets.
引用
收藏
相关论文
共 50 条
  • [41] A real-time grid-based clustering algorithm for large data set
    Yu, Zhiwen
    Wong, Hau-San
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 740 - +
  • [42] GACH: a grid-based algorithm for hierarchical clustering of high-dimensional data
    Eghbal G. Mansoori
    Soft Computing, 2014, 18 : 905 - 922
  • [43] An unsupervised grid-based approach for clustering analysis
    YUE ShiHong1
    2Department of Electrical Engineering
    Science China(Information Sciences), 2010, 53 (07) : 1345 - 1357
  • [44] Grid-Based Clustering Using Boundary Detection
    Du, Mingjing
    Wu, Fuyu
    ENTROPY, 2022, 24 (11)
  • [45] An unsupervised grid-based approach for clustering analysis
    ShiHong Yue
    JeenShing Wang
    Gao Tao
    HuaXiang Wang
    Science China Information Sciences, 2010, 53 : 1345 - 1357
  • [46] An unsupervised grid-based approach for clustering analysis
    Yue ShiHong
    Wang JeenShing
    Tao Gao
    Wang HuaXiang
    SCIENCE CHINA-INFORMATION SCIENCES, 2010, 53 (07) : 1345 - 1357
  • [47] PGMCLU: A Novel Parallel Grid-based Clustering Algorithm for Multi-density Datasets
    Chen Xiaoyun
    Chen Yi
    Qi Xiaoli
    Yue Min
    He Yanshan
    2009 1ST IEEE SYMPOSIUM ON WEB SOCIETY, PROCEEDINGS, 2009, : 166 - 171
  • [48] Application of Grid-based K-means Clustering Algorithm for Optimal Image Processing
    Shi, Tingna
    Wang, Jeenshing
    Wang, Penglong
    Yue, Shihong
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2012, 9 (04) : 1679 - 1696
  • [49] Multidimensional grid-based clustering with local differential privacy
    Fu, Nan
    Ni, Weiwei
    Hu, Haibo
    Zhang, Sen
    INFORMATION SCIENCES, 2023, 623 : 402 - 420
  • [50] Clustering data streams using grid-based synopsis
    Bhatnagar, Vasudha
    Kaur, Sharanjit
    Chakravarthy, Sharma
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (01) : 127 - 152