Autonomous Data Density based Clustering Method

被引:0
|
作者
Angelov, Plamen Y. [1 ]
Gu, Xiaowei [1 ]
Gutierrez, German [2 ]
Antonio Iglesias, Jose [2 ]
Sanchis, Araceli [2 ]
机构
[1] Univ Lancaster, Sch Comp & Commun, InfoLab21, Lancaster LA1 4WA, England
[2] Carlos III Univ Madrid, Comp Sci Dept, Madrid, Spain
关键词
fully autonomous clustering; data density; mutual distribution; data analytics;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is well known that clustering is an unsupervised machine learning technique. However, most of the clustering methods need setting several parameters such as number of clusters, shape of clusters, or other user-or problem-specific parameters and thresholds. In this paper, we propose a new clustering approach which is fully autonomous, in the sense that it does not require parameters to be pre-defined. This approach is based on data density automatically derived from their mutual distribution in the data space. It is called ADD clustering (Autonomous Data Density based clustering). It is entirely based on the experimentally observable data and is free from restrictive prior assumptions. This new method exhibits highly accurate clustering performance. Its performance is compared on benchmarked data sets with other competitive alternative approaches. Experimental results demonstrate that ADD clustering significantly outperforms other clustering methods yet does not require restrictive user-or problem-specific parameters or assumptions. The new clustering method is a solid basis for further applications in the field of data analytics.
引用
收藏
页码:2405 / 2413
页数:9
相关论文
共 50 条
  • [41] Sample Density Clustering Method Considering Unbalanced Data Distribution
    Wang, Changhui
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [42] Share density-based clustering of income data
    Condino, Francesca
    STATISTICAL ANALYSIS AND DATA MINING, 2023, 16 (04) : 336 - 347
  • [43] Geometric algorithms for density-based data clustering
    Chen, DZ
    Smid, M
    Xu, B
    INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 2005, 15 (03) : 239 - 260
  • [44] Big data clustering with varied density based on MapReduce
    Heidari, Safanaz
    Alborzi, Mahmood
    Radfar, Reza
    Afsharkazemi, Mohammad Ali
    Ghatari, Ali Rajabzadeh
    JOURNAL OF BIG DATA, 2019, 6 (01)
  • [45] Hierarchical density-based clustering of uncertain data
    Kriegel, HP
    Pfeifle, M
    Fifth IEEE International Conference on Data Mining, Proceedings, 2005, : 689 - 692
  • [46] Stream Data Clustering Based on Grid Density and Attraction
    Tu, Li
    Chen, Yixin
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2009, 3 (03)
  • [47] Hyperspectral data clustering based on density analysis ensemble
    Chen, Yushi
    Ma, Shunli
    Chen, Xi
    Ghamisi, Pedram
    REMOTE SENSING LETTERS, 2017, 8 (02) : 194 - 203
  • [48] An Efficient Density-Based Algorithm for Data Clustering
    Theljani, Foued
    Laabidi, Kaouther
    Zidi, Salah
    Ksouri, Moufida
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2017, 26 (04)
  • [49] Application of Density Based Clustering to Microarray Data Analysis
    Raczynski, Lech
    Wozniak, Krzysztof
    Rubel, Tymon
    Zaremba, Krzysztof
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2010, 56 (03) : 281 - 286
  • [50] Anytime density-based clustering of complex data
    Son T. Mai
    Xiao He
    Jing Feng
    Claudia Plant
    Christian Böhm
    Knowledge and Information Systems, 2015, 45 : 319 - 355