A smart intelligent approach based on hybrid group search and pelican optimization algorithm for data stream clustering

被引:0
|
作者
Swathi Agarwal
C. R. K. Reddy
机构
[1] Osmania University,Department of Computer Science and Engineering
[2] CVR College of Engineering,Department of IT
[3] Mahatma Gandhi Institute of Technology,Department of Computer Science and Engineering
来源
关键词
Data stream clustering; K-means clustering; Hybrid group search pelican optimization; Micro-cluster formation; Cluster organization; Cluster optimization;
D O I
暂无
中图分类号
学科分类号
摘要
Big data applications generate a huge range of evolving, real-time, and high-dimensional streaming data. In many applications, data stream clustering regarding efficiency and effectiveness becomes challenging. A major issue in data mining is clustering of data streams. The several clustering techniques were implemented for stream data, but they are mostly quite restricted approaches to cluster dynamics. Generally, the data stream is an arrival of data sequence and also several factors are added in the clustering, which is rather than the classical clustering. For every data point, the stream is mostly unbounded and also the data has been estimated atleast once. It leads to higher processing time and an additional requirement on memory. In addition, the clusters in each data and their statistical property vary over time, and streams can be noisy. To address these challenges, this research work aims to implement a novel data stream clustering which is developed with a hybrid meta-heuristic model. Initially, a data stream is collected, and the micro-clusters are formed by the K-Means Clustering (KMC) technique. Then, the formation of micro-clusters, merge and sorting of the data clusters, where the cluster optimization is performed by the Hybrid Group Search Pelican Optimization (HGSPO). The main objective of the clustering is performed to maximize the accuracy through the radius, distance and similarity measures and then, the thresholds of these metrics are optimized. In the training phase, a stream of clustering threshold is fixed for each cluster. When new data comes into this stream clustering model, the output of training data is measured with new data output that is decided to forward the data into the appropriate clusters based on the assigned threshold with minimum similarity. Through the performance analysis and the attained results, the clustering quality of the recommended system is ensured regarding standard performance metrics by estimating with various clustering and heuristic algorithms.
引用
收藏
页码:2467 / 2500
页数:33
相关论文
共 50 条
  • [1] A smart intelligent approach based on hybrid group search and pelican optimization algorithm for data stream clustering
    Agarwal, Swathi
    Reddy, C. R. K.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (04) : 2467 - 2500
  • [2] Hybrid Reptile Search Algorithm and Remora Optimization Algorithm for Optimization Tasks and Data Clustering
    Almotairi, Khaled H.
    Abualigah, Laith
    SYMMETRY-BASEL, 2022, 14 (03):
  • [3] An efficient hybrid data clustering method based on Candidate Group Search and Genetic Algorithm
    Patil, Suvarna P.
    Thakare, Anuradha D.
    Dhote, C. A.
    2015 4TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2015,
  • [4] Automatic Data Clustering based on Hybrid Atom Search Optimization and Sine-Cosine Algorithm
    Abd Elaziz, Mohamed
    Neggaz, Nabil
    Ewees, Ahmed A.
    Lu, Songfeng
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2315 - 2322
  • [5] FGCH: a fast and grid based clustering algorithm for hybrid data stream
    Chen, Jinyin
    Lin, Xiang
    Xuan, Qi
    Xiang, Yun
    APPLIED INTELLIGENCE, 2019, 49 (04) : 1228 - 1244
  • [6] FGCH: a fast and grid based clustering algorithm for hybrid data stream
    Jinyin Chen
    Xiang Lin
    Qi Xuan
    Yun Xiang
    Applied Intelligence, 2019, 49 : 1228 - 1244
  • [7] Data Stream Clustering Algorithm for Smart Site and Its Implementation Based on Flink
    Li, Wei
    Jian, Tiantian
    Zhao, Ziqiao
    Ma, Xiang
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2838 - 2845
  • [8] A hybrid approach to global optimization using a clustering algorithm in a genetic search framework
    Hanagandi, V
    Nikolaou, M
    COMPUTERS & CHEMICAL ENGINEERING, 1998, 22 (12) : 1913 - 1925
  • [9] A hybrid approach to global optimization using a clustering algorithm in a genetic search framework
    Hanagandi, Vijay
    Nikolaou, Michael
    Computers and Chemical Engineering, 1998, 22 (12): : 1913 - 1925
  • [10] Multi Objective Hybridized Firefly Algorithm with Group Search Optimization for Data Clustering
    George, Golda
    Parthiban, Latha
    2015 IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2015, : 125 - 130