Data Stream Clustering: Challenges and Issues

被引:0
|
作者
Khalilian, Madjid [2 ,1 ]
Mustapha, Norwati [2 ]
机构
[1] Islamic Azad Univ, Karaj Branch, Karaj, Iran
[2] UPM, Fac Comp Sci & Informat Technoloy, Dept Comp Sci, Serdang, Turkey
关键词
Data Stream; Clustering; K-Means; Concept drift; MODEL;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Very large databases are required to store massive amounts of data that are continuously inserted and queried. Analyzing huge data sets and extracting valuable pattern in many applications are interesting for researchers. We can identify two main groups of techniques for huge data bases mining. One group refers to streaming data and applies mining techniques whereas second group attempts to solve this problem directly with efficient algorithms. Recently many researchers have focused on data stream as an efficient strategy against huge data base mining instead of mining on entire data base. The main problem in data stream mining means evolving data is more difficult to detect in this techniques therefore unsupervised methods should be applied. However, clustering techniques can lead us to discover hidden information. In this survey, we try to clarify: first, the different problem definitions related to data stream clustering in general; second, the specific difficulties encountered in this field of research; third, the varying assumptions, heuristics, and intuitions forming the basis of different approaches; and how several prominent solutions tackle different problems.
引用
收藏
页码:566 / +
页数:3
相关论文
共 50 条
  • [1] Data stream clustering: a review
    Zubaroglu, Alaettin
    Atalay, Volkan
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (02) : 1201 - 1236
  • [2] Data Stream Clustering: A Survey
    Silva, Jonathan A.
    Faria, Elaine R.
    Barros, Rodrigo C.
    Hruschka, Eduardo R.
    de Carvalho, Andre C. P. L. F.
    Gama, Joao
    [J]. ACM COMPUTING SURVEYS, 2013, 46 (01)
  • [3] Data stream clustering: a review
    Alaettin Zubaroğlu
    Volkan Atalay
    [J]. Artificial Intelligence Review, 2021, 54 : 1201 - 1236
  • [4] Issues in data stream management
    Golab, L
    Özsu, MT
    [J]. SIGMOD RECORD, 2003, 32 (02) : 5 - 14
  • [5] A survey on data stream clustering and classification
    Hai-Long Nguyen
    Woon, Yew-Kwong
    Ng, Wee-Keong
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 45 (03) : 535 - 569
  • [6] MVStream: Multiview Data Stream Clustering
    Huang, Ling
    Wang, Chang-Dong
    Chao, Hong-Yang
    Yu, Philip S.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (09) : 3482 - 3496
  • [7] Data Stream Clustering with Affinity Propagation
    Zhang, Xiangliang
    Furtlehner, Cyril
    Germain-Renaud, Cecile
    Sebag, Michele
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (07) : 1644 - 1656
  • [8] An evaluation of data stream clustering algorithms
    Mansalis, Stratos
    Ntoutsi, Eirini
    Pelekis, Nikos
    Theodoridis, Yannis
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2018, 11 (04) : 167 - 187
  • [9] Clustering data stream: A survey of algorithms
    Mahdiraji, Alireza
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2009, 13 (02) : 39 - 44
  • [10] Data Stream Clustering for Botnet Detection
    Silpa, V. G.
    Soniya, B.
    [J]. IEEE INTERNATIONAL CONFERENCE ON SOFT-COMPUTING AND NETWORK SECURITY (ICSNS 2018), 2018, : 107 - 111