Clustering Based on Correlation Fractal Dimension Over an Evolving Data Stream

被引:0
|
作者
Yarlagadda, Anuradha [1 ]
Jonnalagedda, Murthy [2 ]
Munaga, Krishna [2 ]
机构
[1] Jawaharlal Nehru Technol Univ, Dept Comp Sci & Engn, Hyderabad, Andhra Prades, India
[2] Univ Coll Engn Kakinada, Dept Comp Sci & Engn, Kakinada, India
关键词
Cluster; data stream; fractal; self-similarity; sliding window; damped window;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online clustering, in an evolving high dimensional data is an amazing challenge for data mining applications. Although, many clustering strategies have been proposed, it is still an exciting task since the published algorithms fail to do well with high dimensional datasets, finding arbitrary shaped clusters and handling outliers. Knowing fractal characteristics of dataset can help abstract the dataset and provide insightful hints in the clustering process. This paper concentrates on presenting a novel strategy, FractStream for clustering data streams using fractal dimension, basic window technology, and damped window model. Core fractal-clusters, progressive fractal-cluster, outlier fractal clusters are identified, aiming to reduce search complexity and execution time. Pruning strategies are also employed based on the weights associated with each cluster, which reduced the usage of main memory. Experimental study of this paper over a number of data sets demonstrates the effectiveness and efficiency of the proposed technique.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 50 条
  • [1] A Grid and Fractal Dimension-Based Data Stream Clustering Algorithm
    Lin, Guoping
    Chen, Leisong
    [J]. ISISE 2008: INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING, VOL 1, 2008, : 66 - +
  • [2] Grid-based clustering over an evolving data stream
    Wan, Renxia
    Chen, Jingchao
    Wang, Lixin
    Su, Xiaoke
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2009, 1 (04) : 393 - 410
  • [3] Density-Based Clustering over an Evolving Data Stream with Noise
    Cao, Feng
    Ester, Martin
    Qian, Weining
    Zhou, Aoying
    [J]. PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 328 - +
  • [4] A Density-Based Clustering over Evolving Heterogeneous Data Stream
    Lin, Jinxian
    Lin, Hui
    [J]. 2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL IV, 2009, : 275 - +
  • [5] Fast estimation of fractal dimension and correlation integral on stream data
    Wong, A
    Wu, LJ
    Gibbons, PB
    Faloutsos, C
    [J]. INFORMATION PROCESSING LETTERS, 2005, 93 (02) : 91 - 97
  • [6] Synchronization-based clustering on evolving data stream
    Shao, Junming
    Tan, Yue
    Gao, Lianli
    Yang, Qinli
    Plant, Claudia
    Assent, Ira
    [J]. INFORMATION SCIENCES, 2019, 501 : 573 - 587
  • [7] Evolving data stream clustering based on constant false clustering probability
    Kashani, Elham S.
    Shouraki, Saeed Bagheri
    Norouzi, Yaser
    [J]. INFORMATION SCIENCES, 2022, 614 : 1 - 18
  • [8] A Three-Step Clustering Algorithm over an Evolving Data Stream
    Liu Li-xiong
    Guo Yun-fei
    Kang Jing
    Huang Hai
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1, 2009, : 160 - 164
  • [9] Knowledge-based Evolving Clustering Algorithm for Data Stream
    Sun, Zhaoyang
    Mao, K. Z.
    Tang, Wenyin
    Mak, Lee-Onn
    Xian, Kuitong
    Liu, Ying
    [J]. 2014 11TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM), 2014,
  • [10] A buffer-based online clustering for evolving data stream
    Islam, Md. Kamrul
    Ahmed, Md. Manjur
    Zamli, Kamal Z.
    [J]. INFORMATION SCIENCES, 2019, 489 : 113 - 135