A smart intelligent approach based on hybrid group search and pelican optimization algorithm for data stream clustering

被引:0
|
作者
Swathi Agarwal
C. R. K. Reddy
机构
[1] Osmania University,Department of Computer Science and Engineering
[2] CVR College of Engineering,Department of IT
[3] Mahatma Gandhi Institute of Technology,Department of Computer Science and Engineering
来源
关键词
Data stream clustering; K-means clustering; Hybrid group search pelican optimization; Micro-cluster formation; Cluster organization; Cluster optimization;
D O I
暂无
中图分类号
学科分类号
摘要
Big data applications generate a huge range of evolving, real-time, and high-dimensional streaming data. In many applications, data stream clustering regarding efficiency and effectiveness becomes challenging. A major issue in data mining is clustering of data streams. The several clustering techniques were implemented for stream data, but they are mostly quite restricted approaches to cluster dynamics. Generally, the data stream is an arrival of data sequence and also several factors are added in the clustering, which is rather than the classical clustering. For every data point, the stream is mostly unbounded and also the data has been estimated atleast once. It leads to higher processing time and an additional requirement on memory. In addition, the clusters in each data and their statistical property vary over time, and streams can be noisy. To address these challenges, this research work aims to implement a novel data stream clustering which is developed with a hybrid meta-heuristic model. Initially, a data stream is collected, and the micro-clusters are formed by the K-Means Clustering (KMC) technique. Then, the formation of micro-clusters, merge and sorting of the data clusters, where the cluster optimization is performed by the Hybrid Group Search Pelican Optimization (HGSPO). The main objective of the clustering is performed to maximize the accuracy through the radius, distance and similarity measures and then, the thresholds of these metrics are optimized. In the training phase, a stream of clustering threshold is fixed for each cluster. When new data comes into this stream clustering model, the output of training data is measured with new data output that is decided to forward the data into the appropriate clusters based on the assigned threshold with minimum similarity. Through the performance analysis and the attained results, the clustering quality of the recommended system is ensured regarding standard performance metrics by estimating with various clustering and heuristic algorithms.
引用
收藏
页码:2467 / 2500
页数:33
相关论文
共 50 条
  • [41] A Clustering Algorithm Based on Density-Grid for Stream Data
    Zhang, Dandan
    Tian, Hui
    Sang, Yingpeng
    Li, Yidong
    Wu, Yanbo
    Wu, Jun
    Shen, Hong
    2012 13TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS, AND TECHNOLOGIES (PDCAT 2012), 2012, : 398 - 403
  • [42] A Data Stream Clustering Algorithm Based on Density and Extended Grid
    Hua, Zheng
    Du, Tao
    Qu, Shouning
    Mou, Guodong
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT II, 2017, 10362 : 689 - 699
  • [43] Knowledge-based Evolving Clustering Algorithm for Data Stream
    Sun, Zhaoyang
    Mao, K. Z.
    Tang, Wenyin
    Mak, Lee-Onn
    Xian, Kuitong
    Liu, Ying
    2014 11TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM), 2014,
  • [44] An Incremental Algorithm Based on Irregular Grid for Clustering Data Stream
    Yin, Guisheng
    Yu, Xiang
    Yang, Guang
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 5680 - 5684
  • [45] AN EFFICIENT DATA STREAM CLUSTERING ALGORITHM BASED ON DYNAMIC GRIDS
    Yun Wu
    Gao Feng
    NEW TRENDS AND APPLICATIONS OF COMPUTER-AIDED MATERIAL AND ENGINEERING, 2011, 186 : 665 - +
  • [46] Data Stream Clustering Algorithm Based on Affinity Propagation and Density
    Li Yang
    Tan Baihong
    MANUFACTURING SYSTEMS AND INDUSTRY APPLICATIONS, 2011, 267 : 444 - 449
  • [47] A dynamic data stream clustering algorithm based on probability and exemplar
    Bi A.
    Dong A.
    Wang S.
    1600, Science Press (53): : 1029 - 1042
  • [48] Incremental clustering algorithm based on rough reduction for data stream
    College of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China
    Xinan Jiaotong Daxue Xuebao, 2009, 5 (637-643+653):
  • [49] A Hybrid Algorithm Based on Squirrel Search Algorithm and Invasive Weed Optimization for Optimization
    Hu, Hongping
    Zhang, Linmei
    Bai, Yanping
    Wang, Peng
    Tan, Xiuhui
    IEEE ACCESS, 2019, 7 : 105652 - 105668
  • [50] HSGS: A hybrid of harmony search algorithm and golden section for data clustering
    Talaei, Kazem
    Rahati, Amin
    Idoumghar, Lhassane
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224