An Efficient Density Based Incremental Clustering Algorithm in Data Warehousing Environment

被引:0
|
作者
Goyal, Navneet [1 ]
Goyal, Poonam [1 ]
Venkatramaiah, K. [1 ]
Deepak, P. C. [1 ]
Sanoop, P. S. [1 ]
机构
[1] BITS, Dept Comp Sci & Informat Syst, Pilani 333031, Rajasthan, India
关键词
Incremental clustering; DBSCAN; Incremental DBSCAN;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data Warehouses are a good source of data for downstream data mining applications. New data arrives in data warehouses during the periodic refresh cycles. Appending of data on existing data requires that all patterns discovered earlier using various data mining algorithms are updated with each refresh. In this paper, we present an incremental density based clustering algorithm. Incremental DBSCAN is an existing incremental algorithm in which data can be added/deleted to/from existing clusters, one point at a time. Our algorithm is capable of adding points in bulk to existing set of clusters. In this new algorithm, the data points to be added are first clustered using the DBSCAN algorithm and then these new clusters are merged with existing clusters, to come up with the modified set of clusters. That is, we add the clusters incrementally rather than adding points incrementally. It is found that the proposed incremental clustering algorithm produces the same clusters as obtained by Incremental DBSCAN. We have used R*-trees as the data structure to hold the multidimensional data that we need to cluster. One of the major advantages of the proposed approach is that it allows us to see the clustering patterns of the new data along with the existing clustering patterns. Moreover, we can see the merged clusters as well. The proposed algorithm is capable of considerable savings, in terms of region queries performed, as compared to incremental DBSCAN. Results are presented to support the claim.
引用
收藏
页码:556 / 560
页数:5
相关论文
共 50 条
  • [21] EFFICIENT DENSITY-BASED PARTITIONAL CLUSTERING ALGORITHM
    Alamgir, Zareen
    Naveed, Hina
    [J]. COMPUTING AND INFORMATICS, 2021, 40 (06) : 1322 - 1344
  • [22] MIDBSCAN: An Efficient Density-Based Clustering Algorithm
    Tsai, Cheng-Fa
    Sung, Chun-Yi
    [J]. SIXTH INTERNATIONAL SYMPOSIUM ON NEURAL NETWORKS (ISNN 2009), 2009, 56 : 469 - 479
  • [23] Incremental clustering algorithm of data stream based on artificial immune network
    Yue, Xun
    Chi, Zhongxian
    Hao, Yanyou
    Mo, Hongwei
    Yue, Xun
    [J]. WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 4021 - +
  • [24] An incremental data stream clustering algorithm based on dense units detection
    Gao, J
    Li, JZ
    Zhang, ZG
    Tan, PN
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 420 - 425
  • [25] Incremental Clustering for Time Series Data based on an Improved Leader Algorithm
    Huynh Thi Thu Thuy
    Duong Tuan Anh
    Vo Thi Ngoc Chau
    [J]. 2019 IEEE - RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF), 2019, : 13 - 18
  • [26] A SOM based Incremental Clustering Algorithm
    Lei Chen
    Zhao, Bao-Jin
    Zhao, Li-Na
    [J]. JOURNAL OF COMPUTERS, 2014, 9 (03) : 601 - 607
  • [27] Efficient incremental subspace clustering in data streams
    Kontaki, Maria
    Papadopoulos, Apostolos N.
    Manolopoulos, Yannis
    [J]. 10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 53 - 60
  • [28] THE CLUSTERING ALGORITHM OF EVOLUTIONAL DATA STREAM BASED ON DENSITY
    Meng, Yuyu
    Zheng, Liying
    [J]. 3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE (ITCS 2011), PROCEEDINGS, 2011, : 473 - 477
  • [29] Density Based Clustering Technique For Efficient Data Mining
    Rahman, Md Asikur
    Chowdhury, A. K. M. Rasheduzzaman
    Rahman, Daud Md Jamilur
    Kamal, Abu Raihan Mostofa
    [J]. 2008 11TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY: ICCIT 2008, VOLS 1 AND 2, 2008, : 706 - 710
  • [30] Clustering Algorithm Based on Grid and Density for Data Stream
    Wang, Lang
    Li, Haiqing
    [J]. MATERIALS SCIENCE, ENERGY TECHNOLOGY, AND POWER ENGINEERING I, 2017, 1839