A Cluster-Based Prototype Reduction for Online Classification

被引:4
|
作者
Garcia, Kemilly Dearo [1 ,2 ]
de Carvalho, Andre C. P. L. F. [2 ]
Mendes-Moreira, Joao [3 ,4 ]
机构
[1] Univ Twente, Enschede, Netherlands
[2] Univ Sao Paulo, ICMC, Sao Paulo, Brazil
[3] Univ Porto, Fac Engn, Porto, Portugal
[4] LIAAD INESC TEC, Porto, Portugal
关键词
kNN Prototyping; Data stream; Online clustering;
D O I
10.1007/978-3-030-03493-1_63
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream is a challenging research topic in which data can continuously arrive with a probability distribution that may change over time. Depending on the changes in the data distribution, different phenomena can occur, for example, a concept drift. A concept drift occurs when the concepts associated with a dataset change when new data arrive. This paper proposes a new method based on k-Nearest Neighbors that implements a sliding window requiring less instances stored for training than existing methods. For such, a clustering approach is used to summarize data by placing labeled instances considered similar in the same cluster. Besides, instances close to the uncertainty border of existing classes are also stored, in a sliding window, to adapt the model to concept drift. The proposed method is experimentally compared with state-of-the-art classifiers from the data stream literature, regarding accuracy and processing time. According to the experimental results, the proposed method has better accuracy and less time consumption when fewer information about the concepts are stored in a single sliding window.
引用
收藏
页码:603 / 610
页数:8
相关论文
共 50 条
  • [1] Cluster-based adaptive metric classification
    Giotis, Ioannis
    Petkov, Nicolai
    [J]. NEUROCOMPUTING, 2012, 81 : 33 - 40
  • [2] Cluster-based data relabelling for classification
    Wan, Huan
    Wang, Hui
    Scotney, Bryan
    Liu, Jun
    Wei, Xin
    [J]. INFORMATION SCIENCES, 2023, 648
  • [3] A cluster-based merging strategy for nearest prototype classifiers
    Mollineda, RA
    Ferri, FJ
    Vidal, E
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS: PATTERN RECOGNITION AND NEURAL NETWORKS, 2000, : 755 - 758
  • [4] Decoupling of clustering and classification steps in a cluster-based classification
    Hashemi, RR
    Bahar, M
    Childers, C
    Tyler, AA
    [J]. ICMLA 2005: FOURTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2005, : 285 - 290
  • [5] Cluster-based instance selection for machine classification
    Czarnowski, Ireneusz
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 30 (01) : 113 - 133
  • [6] A Cluster-Based Semisupervised Ensemble for Multiclass Classification
    Soares, Rodrigo G. F.
    Chen, Huanhuan
    Yao, Xin
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2017, 1 (06): : 408 - 420
  • [7] Efficient Cluster-Based Boosting for Semisupervised Classification
    Soares, Rodrigo G. F.
    Chen, Huanhuan
    Yao, Xin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) : 5667 - 5680
  • [8] Cluster-Based Classification of Blockchain Consensus Algorithms
    Aponte, Fredy
    Gutierrez, Luz
    Pineda, Magda
    Merino, Ines
    Salazar, Augusto
    Wightman, Pedro
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2021, 19 (04) : 688 - 696
  • [9] Cluster-based instance selection for machine classification
    Ireneusz Czarnowski
    [J]. Knowledge and Information Systems, 2012, 30 : 113 - 133
  • [10] Cluster-Based Partial-Order Reduction
    Twan Basten
    Dragan Bošnački
    Marc Geilen
    [J]. Automated Software Engineering, 2004, 11 (4) : 365 - 402