Enhancing the DISSFCM Algorithm for Data Stream Classification

被引:3
|
作者
Casalino, Gabriella [1 ,2 ]
Castellano, Giovanna [1 ,2 ]
Fanelli, Anna Maria [1 ]
Mencar, Corrado [1 ,2 ]
机构
[1] Univ Bari Aldo Moro, Comp Sci Dept, Bari, Italy
[2] INdAM Res Grp GNCS, Rome, Italy
来源
关键词
Data stream classification; Semi-supervised fuzzy clustering; Incremental adaptive clustering;
D O I
10.1007/978-3-030-12544-8_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analyzing data streams has become a new challenge to meet the demands of real time analytics. Conventional mining techniques are proving inefficient to cope with challenges associated with data streams, including resources constraints like memory and running time along with single scan of the data. Most existing data stream classification methods require labeled samples that are more difficult and expensive to obtain than unlabeled ones. Semi-supervised learning algorithms can solve this problem by using unlabeled samples together with a few labeled ones to build classification models. Recently we proposed DISSFCM, an algorithm for data stream classification based on incremental semi-supervised fuzzy clustering. To cope with the evolution of data, DISSFCM adapts dynamically the number of clusters by splitting large-scale clusters. While splitting is effective in improving the quality of clusters, a repeated application without counter-balance may induce many small-scale clusters. To solve this problem, in this paper we enhance DISSFCM by introducing a procedure that merges small-scale clusters. Preliminary experimental results on a real-world benchmark dataset show the effectiveness of the method.
引用
收藏
页码:109 / 122
页数:14
相关论文
共 50 条
  • [31] Accuracy Based Weighted Aging Ensemble (AB-WAE) - algorithm for data stream classification
    Wozniak, Michal
    [J]. 2017 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI), 2017, : 21 - 24
  • [32] A Weighted Ensemble Classification Algorithm Based on Nearest Neighbors for Multi-Label Data Stream
    Wu, Hongxin
    Han, Meng
    Chen, Zhiqiang
    Li, Muhang
    Zhang, Xilong
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (05)
  • [33] A cluster algorithm for uncertain data stream
    Han D.-H.
    Wang K.
    Shao C.-L.
    Ma C.
    [J]. Han, Dong-Hong (handonghong@cse.neu.edu.cn), 1677, Northeast University (37): : 1677 - 1682
  • [34] Dynamic ensemble selection classification algorithm based on window over imbalanced drift data stream
    Han, Meng
    Zhang, Xilong
    Chen, Zhiqiang
    Wu, Hongxin
    Li, Muhang
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (03) : 1105 - 1128
  • [35] Dynamic ensemble selection classification algorithm based on window over imbalanced drift data stream
    Meng Han
    Xilong Zhang
    Zhiqiang Chen
    Hongxin Wu
    Muhang Li
    [J]. Knowledge and Information Systems, 2023, 65 : 1105 - 1128
  • [36] ENSEMBLE LEARNING FOR NETWORK DATA STREAM CLASSIFICATION USING SIMILARITY AND ONLINE GENETIC ALGORITHM CLASSIFIERS
    Raja, Arun Manicka M.
    Swamynathan, S.
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1601 - 1607
  • [37] Enhancing privacy in remote data classification
    Piva, A.
    Orlandi, C.
    Caini, M.
    Bianchi, T.
    Barni, M.
    [J]. PROCEEDINGS OF THE IFIP TC 11/ 23RD INTERNATIONAL INFORMATION SECURITY CONFERENCE, 2008, : 33 - +
  • [38] Enhancing data stream predictions with reliability estimators and explanation
    Bosnic, Zoran
    Demsar, Jaka
    Kespret, Grega
    Rodrigues, Pedro Pereira
    Gama, Joao
    Kononenko, Igor
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 34 : 178 - 192
  • [39] A Survey on Ensemble Learning for Data Stream Classification
    Gomes, Heitor Murilo
    Barddal, Jean Paul
    Enembreck, Fabricio
    Bifet, Albert
    [J]. ACM COMPUTING SURVEYS, 2017, 50 (02)
  • [40] Application of Combined Classifiers to Data Stream Classification
    Wozniak, Michal
    [J]. COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2013, 2013, 8104 : 13 - 23