An efficient clustering method of data mining for high-dimensional data

被引:0
|
作者
Chang, JW [1 ]
Kang, HM [1 ]
机构
[1] Chonbuk Natl Univ, Dept Comp Engn, Chonju 561756, Chonbuk, South Korea
关键词
clustering method; high-dimensional; data mining; filtering-based index;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining applications have recently required a large amount of high-dimensional data. However, most clustering methods for the data miming applications do not work efficiently for dealing with large, high-dimensional data because of the so-called 'curse of dimensionality' [1] and the limitation of available memory. In this paper, we propose an efficient clustering method for handling a large of amount of high-dimensional data. Our clustering method provides an efficient cell creation algorithm using a space-partitioning technique and a cell insertion algorithm to construct clusters as cells with more density than a given threshold. To achieve good retrieval performance on clusters, we also propose a filtering-based index structure using an approximation technique. In addition, we compare the performance of our clustering method with the CLIQUE method in terms of cluster construction time, precision, and retrieval time. The experimental results show that our clustering method achieves better performance on cluster construction time and retrieval time.
引用
收藏
页码:273 / 278
页数:6
相关论文
共 50 条
  • [21] Approximated clustering of distributed high-dimensional data
    Kriegel, HP
    Kunath, P
    Pfeifle, M
    Renz, M
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 432 - 441
  • [22] Clustering High-Dimensional Noisy Categorical Data
    Tian, Zhiyi
    Xu, Jiaming
    Tang, Jen
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024,
  • [23] Subspace selection for clustering high-dimensional data
    Baumgartner, C
    Plant, C
    Kailing, K
    Kriegel, HP
    Kröger, P
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 11 - 18
  • [24] Clustering of imbalanced high-dimensional media data
    Brodinova, Sarka
    Zaharieva, Maia
    Filzmoser, Peter
    Ortner, Thomas
    Breiteneder, Christian
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2018, 12 (02) : 261 - 284
  • [25] The Role of Hubness in Clustering High-Dimensional Data
    Tomasev, Nenad
    Radovanovic, Milos
    Mladenic, Dunja
    Ivanovic, Mirjana
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (03) : 739 - 751
  • [26] Clustering of imbalanced high-dimensional media data
    Šárka Brodinová
    Maia Zaharieva
    Peter Filzmoser
    Thomas Ortner
    Christian Breiteneder
    [J]. Advances in Data Analysis and Classification, 2018, 12 : 261 - 284
  • [27] The Role of Hubness in Clustering High-Dimensional Data
    Tomasev, Nenad
    Radovanovic, Milos
    Mladenic, Dunja
    Ivanovic, Mirjana
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6634 : 183 - 195
  • [28] An effective clustering scheme for high-dimensional data
    Xuansen He
    Fan He
    Yueping Fan
    Lingmin Jiang
    Runzong Liu
    Allam Maalla
    [J]. Multimedia Tools and Applications, 2024, 83 : 45001 - 45045
  • [29] An Efficient Online Monitoring Method for High-Dimensional Data Streams
    Zou, Changliang
    Wang, Zhaojun
    Jiang, Wei
    Zi, Xuemin
    [J]. TECHNOMETRICS, 2015, 57 (03) : 374 - 387
  • [30] An Efficient and Versatile Variational Method for High-Dimensional Data Classification
    Cai, Xiaohao
    Chan, Raymond H.
    Xie, Xiaoyu
    Zeng, Tieyong
    [J]. JOURNAL OF SCIENTIFIC COMPUTING, 2024, 100 (03)