FuzzStream: Fuzzy Data Stream Clustering Based on the Online-Offline Framework

被引:0
|
作者
Lopes, Priscilla de Abreu [1 ]
Camargo, Heloisa de Arruda [1 ]
机构
[1] Univ Fed Sao Carlos, Dept Comp, Sao Carlos, SP, Brazil
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Systems capable of generating data quickly and continuously, known as data streams, are a reality today and tend to increase. Due to the nature of data streams, unsupervised learning, such as clustering algorithms, is appropriate. In addition, techniques derived from fuzzy set theory can be useful and add flexibility to the learning process. Fuzzy clustering algorithms for data streams found in the literature are based on chunks, which require the definition of several parameters besides presenting the drawback of overly reducing the summarization of data. An approach to Data Stream clustering that overpasses some of the limitations of chunk-based algorithms is the one called Online-Offline Framework. This framework comprises two phases: summarization and clustering. To the best of our knowledge, there is not a fuzzy version of this framework. The objective of this work is to propose a fuzzy version for the Online-Offline Framework, called FuzzStream, whose main component is a summarization structure and its corresponding maintenance algorithm to be used in the online phase. The well known Weighted Fuzzy C-Means clustering algorithm is used in the offline phase. Experiments show that our proposal is a promising approach to deal with data streams and presents benefits with relation to the classic version.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Scalable Online-Offline Stream Clustering in Apache Spark
    Backhoff, Omar
    Ntoutsi, Eirini
    [J]. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 37 - 44
  • [2] DistStream: An Order-Aware Distributed Framework for Online-Offline Stream Clustering Algorithms
    Xu, Lijie
    Ye, Xingtong
    Kang, Kai
    Guo, Tian
    Dou, Wensheng
    Wang, Wei
    Wei, Jun
    [J]. 2020 IEEE 40TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2020, : 842 - 852
  • [3] d-FuzzStream: A Dispersion-Based Fuzzy Data Stream Clustering
    Schick, Leonardo
    Lopes, Priscilla de Abreu
    Camargo, Heloisa de Arruda
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,
  • [4] The ALICE Online-Offline Framework for the Extraction of Conditions Data
    Grosse-Oetringhaus, Jan Fiete
    Zampolli, Chiara
    Colla, Alberto
    Carminati, Federico
    [J]. 17TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP09), 2010, 219
  • [5] Efficient Clustering of Short Text Streams using Online-Offline Clustering
    Rakib, Md Rashadul Hasan
    Zeh, Norbert
    Milios, Evangelos
    [J]. PROCEEDINGS OF THE 21ST ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG '21), 2021,
  • [6] Scheduling framework based on reinforcement learning in online-offline colocated cloud environment
    Ma, Ling
    Fan, Qiliang
    Xu, Ting
    Guo, Guanchen
    Zhang, Shenglin
    Sun, Yongqian
    Zhang, Yuzhi
    [J]. Tongxin Xuebao/Journal on Communications, 2023, 44 (06): : 90 - 102
  • [7] A Distributed Framework for Online Stream Data Clustering
    Ding, Jiafeng
    Fang, Junhua
    Chao, Pingfu
    Xu, Jiajie
    Zhao, PengPeng
    Zhao, Lei
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT I, 2020, 12452 : 190 - 204
  • [8] An Online-Offline Combined Big Data Mining Platform
    Zhang, Weishan
    Lv, Hao
    Xu, Liang
    Liu, Yan
    Liu, Xin
    Lu, Qinghua
    Li, Zhongwei
    Zhou, Jiehan
    [J]. 2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI, 2017, : 1220 - 1225
  • [9] Data Stream Online Clustering Based on Fuzzy Expectation-Maximization Approach
    Deineko, Anastasiia O.
    Zhernova, Polina Ye
    Gordon, Boris
    Zayika, Oleksandr O.
    Pliss, Iryna
    Pabyrivska, Nelya
    [J]. 2018 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA STREAM MINING & PROCESSING (DSMP), 2018, : 171 - 176
  • [10] CREDIBILISTIC ROBUST ONLINE FUZZY CLUSTERING IN DATA STREAM MINING TASKS
    Yu, Shafronenko A.
    Kasatkina, N. V.
    Ye, V. Bodyanskiy
    Ye, O. Shafronenko
    [J]. RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2023, (03) : 97 - 103