FGCH: a fast and grid based clustering algorithm for hybrid data stream

被引:0
|
作者
Jinyin Chen
Xiang Lin
Qi Xuan
Yun Xiang
机构
[1] Zhejiang University of Technology,The College of Information Engineering
来源
Applied Intelligence | 2019年 / 49卷
关键词
Data stream; Clustering analysis; Non-uniform attenuation; Grid clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Streaming large volumes of data has a wide range of real-world applications, e.g., video flows, internet calls, and online games etc. Thus, fast and real-time data stream processing is important. Traditionally, data clustering algorithms are efficient and effective to mine information from large data. However, they are mostly not suitable for online data stream clustering. Therefore, in this work, we propose a novel fast and grid based clustering algorithm for hybrid data stream (FGCH). Specifically, we have made the following main contributions: 1), we develop a non-uniform attenuation model to enhance the resistance to noise; 2), we propose a similarity calculation method for hybrid data, which can calculate the similarity more efficiently and accurately; and 3), we present a novel clustering center fast determination algorithm (CCFD), which can automatically determine the number, center, and radius of clusters. Our technique is compared with several state-of-art clustering algorithms. The experimental results show that our technique can achieve more than better clustering accuracy on average. Meanwhile, the running time is shorter compared with the closest algorithm.
引用
收藏
页码:1228 / 1244
页数:16
相关论文
共 50 条
  • [21] A Grid and Density Based Fast Spatial Clustering Algorithm
    Huang Ming
    Bian Fuling
    2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL IV, PROCEEDINGS, 2009, : 260 - 263
  • [22] A fast consistent grid-based clustering algorithm
    Tarasenko, Anton S.
    Berikov, Vladimir B.
    Pestunov, Igor A.
    Rylov, Sergey A.
    Ruzankin, Pavel S.
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (04)
  • [23] A Density-Grid Based Clustering Algorithm on Data Stream Using Resilient Distributed Datasets
    Zhang, Yuan
    Zhang, Jiongmin
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2016, 2016, 9673 : 316 - 322
  • [24] Ant Colony Stream Clustering: A Fast Density Clustering Algorithm for Dynamic Data Streams
    Fahy, Conor
    Yang, Shengxiang
    Gongora, Mario
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (06) : 2215 - 2228
  • [25] Grid-based clustering over an evolving data stream
    Wan, Renxia
    Chen, Jingchao
    Wang, Lixin
    Su, Xiaoke
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2009, 1 (04) : 393 - 410
  • [26] Grid-based data stream clustering for intrusion detection
    Quan, Q. (qqian@shu.edu.cn), 1600, Femto Technique Co., Ltd. (15):
  • [27] Research on Data Stream Clustering Based on FCM Algorithm
    Gao, Tiancheng
    Li, Aihua
    Meng, Fan
    5TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2017, 2017, 122 : 595 - 602
  • [28] IMPROVED DENSITY BASED ALGORITHM FOR DATA STREAM CLUSTERING
    Mousavi, Maryam
    Abu Bakar, Azuraliza
    JURNAL TEKNOLOGI, 2015, 77 (18): : 73 - 77
  • [29] THE CLUSTERING ALGORITHM OF EVOLUTIONAL DATA STREAM BASED ON DENSITY
    Meng, Yuyu
    Zheng, Liying
    3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE (ITCS 2011), PROCEEDINGS, 2011, : 473 - 477
  • [30] Drifted Data Stream Clustering Based on ClusTree Algorithm
    Zgraja, Jakub
    Wozniak, Michal
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2018), 2018, 10870 : 338 - 349