Pairwise Combination of Classifiers for Ensemble Learning on Data Streams

被引:2
|
作者
Gomes, Heitor Murilo [1 ]
Barddal, Jean Paul [1 ]
Enembreck, Fabricio [1 ]
机构
[1] Pontificia Univ Catolica Parana, Rua Imaculada Conceicao 1155, Curitiba, Parana, Brazil
关键词
Data Stream Mining; Concept Drift; Ensemble Classifiers; Machine Learning; Supervised Learning;
D O I
10.1145/2695664.2695754
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This work presents two different voting strategies for ensemble learning on data streams based on pairwise combination of component classifiers. Despite efforts to build a diverse ensemble, there is always some degree of overlap between component classifiers models. Our voting strategies are aimed at using these overlaps to support ensemble prediction. We hypothesize that by combining pairs of classifiers it is possible to alleviate incorrect individual predictions that would otherwise negatively impact the overall ensemble decision. The first strategy, Pairwise Accuracy (PA), combines the shared accuracy estimation of all possible pairs in the ensemble, while the second strategy, Pairwise Patterns (PP), record patterns of pairwise decisions during training and use these patterns during prediction. We present empirical results comparing ensemble classifiers with their original voting methods and our proposed methods in both real and synthetic datasets, with and without concept drifts. Our analysis indicates that pairwise voting is able to enhance overall performance for PP, especially on real datasets, and that PA is useful whenever there are noticeable differences in accuracy estimates among ensemble members, which is common during concept drifts.
引用
收藏
页码:941 / 946
页数:6
相关论文
共 50 条
  • [41] One-class classifiers with incremental learning and forgetting for data streams with concept drift
    Krawczyk, Bartosz
    Wozniak, Michal
    [J]. SOFT COMPUTING, 2015, 19 (12) : 3387 - 3400
  • [42] One-class classifiers with incremental learning and forgetting for data streams with concept drift
    Bartosz Krawczyk
    Michał Woźniak
    [J]. Soft Computing, 2015, 19 : 3387 - 3400
  • [43] Learning ensemble classifiers for diabetic retinopathy assessment
    Saleh, Emran
    Blaszczynski, Jerzy
    Moreno, Antonio
    Valls, Aida
    Romero-Aroca, Pedro
    de la Riya-Fernandez, Sofia
    Slowinsk, Roman
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2018, 85 : 50 - 63
  • [44] Pruning an ensemble of classifiers via reinforcement learning
    Partalas, Ioannis
    Tsoumakas, Grigorios
    Vlahavas, Ioannis
    [J]. NEUROCOMPUTING, 2009, 72 (7-9) : 1900 - 1909
  • [45] Ensemble learning with biased classifiers: The Triskel algorithm
    Hess, A
    Khoussainov, R
    Kushmerick, N
    [J]. MULTIPLE CLASSIFIER SYSTEMS, 2005, 3541 : 226 - 235
  • [46] An ensemble of cluster-based classifiers for semi-supervised classification of non-stationary data streams
    Hosseini, Mohammad Javad
    Gholipour, Ameneh
    Beigy, Hamid
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 46 (03) : 567 - 597
  • [47] An ensemble of cluster-based classifiers for semi-supervised classification of non-stationary data streams
    Mohammad Javad Hosseini
    Ameneh Gholipour
    Hamid Beigy
    [J]. Knowledge and Information Systems, 2016, 46 : 567 - 597
  • [48] Online Active Learning with Drifted Data Streams Using Paired Ensemble Framework
    Shan, Ji-Cheng
    Liu, Wei-Ke
    Chu, Chen-Xi
    Dai, Chao-Fan
    Liu, Qing-Bao
    [J]. 4TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2017), 2017, 12
  • [49] Microcluster-Based Incremental Ensemble Learning for Noisy, Nonstationary Data Streams
    Liu, Sanmin
    Xue, Shan
    Liu, Fanzhen
    Cheng, Jieren
    Li, Xiulai
    Kong, Chao
    Wu, Jia
    [J]. COMPLEXITY, 2020, 2020
  • [50] Random Ensemble Decision Trees for Learning Concept-Drifting Data Streams
    Li, Peipei
    Wu, Xindong
    Liang, Qianhui
    Hu, Xuegang
    Zhang, Yuhong
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6634 : 313 - 325