An Efficient Density Based Incremental Clustering Algorithm in Data Warehousing Environment

被引:0
|
作者
Goyal, Navneet [1 ]
Goyal, Poonam [1 ]
Venkatramaiah, K. [1 ]
Deepak, P. C. [1 ]
Sanoop, P. S. [1 ]
机构
[1] BITS, Dept Comp Sci & Informat Syst, Pilani 333031, Rajasthan, India
关键词
Incremental clustering; DBSCAN; Incremental DBSCAN;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data Warehouses are a good source of data for downstream data mining applications. New data arrives in data warehouses during the periodic refresh cycles. Appending of data on existing data requires that all patterns discovered earlier using various data mining algorithms are updated with each refresh. In this paper, we present an incremental density based clustering algorithm. Incremental DBSCAN is an existing incremental algorithm in which data can be added/deleted to/from existing clusters, one point at a time. Our algorithm is capable of adding points in bulk to existing set of clusters. In this new algorithm, the data points to be added are first clustered using the DBSCAN algorithm and then these new clusters are merged with existing clusters, to come up with the modified set of clusters. That is, we add the clusters incrementally rather than adding points incrementally. It is found that the proposed incremental clustering algorithm produces the same clusters as obtained by Incremental DBSCAN. We have used R*-trees as the data structure to hold the multidimensional data that we need to cluster. One of the major advantages of the proposed approach is that it allows us to see the clustering patterns of the new data along with the existing clustering patterns. Moreover, we can see the merged clusters as well. The proposed algorithm is capable of considerable savings, in terms of region queries performed, as compared to incremental DBSCAN. Results are presented to support the claim.
引用
收藏
页码:556 / 560
页数:5
相关论文
共 50 条
  • [1] Efficient incremental density-based algorithm for clustering large datasets
    Bakr, Ahmad M.
    Ghanem, Nagia M.
    Ismail, Mohamed A.
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2015, 54 (04) : 1147 - 1154
  • [2] An efficient automated incremental density-based algorithm for clustering and classification
    Azhir, Elham
    Navimipour, Nima Jafari
    Hosseinzadeh, Mehdi
    Sharifi, Arash
    Darwesh, Aso
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 114 : 665 - 678
  • [3] Incremental Load in a Data Warehousing Environment
    Rahman, Nayem
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2010, 6 (03) : 1 - 16
  • [4] An Efficient Density-Based Algorithm for Data Clustering
    Theljani, Foued
    Laabidi, Kaouther
    Zidi, Salah
    Ksouri, Moufida
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2017, 26 (04)
  • [5] Incremental generalization for mining in a data warehousing environment
    Ester, M
    Wittmann, R
    [J]. ADVANCES IN DATABASE TECHNOLOGY - EDBT'98, 1998, 1377 : 135 - 149
  • [6] A Fuzzy Density-based Incremental Clustering Algorithm
    Laohakiat, Sirisup
    Ratanajaipan, Photchanan
    Navaravong, Leenhapat
    Ungrangsi, Rachanee
    Maleewong, Krissada
    [J]. 2018 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2018, : 211 - 215
  • [7] A fast incremental clustering algorithm based on grid and density
    Chen Zhuo
    Liu Xiang-shuang
    Zhuang Xiao-dong
    [J]. ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 5, PROCEEDINGS, 2007, : 207 - +
  • [8] An Efficient And Scalable Density-Based Clustering Algorithm For Normalize Data
    Nidhi
    Patel, Km Archana
    [J]. 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, COMMUNICATION & CONVERGENCE, ICCC 2016, 2016, 92 : 136 - 141
  • [9] Data Incremental Clustering Algorithm based on Differential Privacy
    Gao, Qing
    Wang, Xiujun
    Gao, Yan
    Tao, Tao
    [J]. 2023 IEEE 9TH WORLD FORUM ON INTERNET OF THINGS, WF-IOT, 2023,
  • [10] An Artificial Immune Based Incremental Data Clustering Algorithm
    Xiao, Xin
    [J]. ADVANCES IN CIVIL ENGINEERING II, PTS 1-4, 2013, 256-259 : 2935 - 2938