New diversity measure for data stream classification ensembles

被引:18
|
作者
Jackowski, Konrad [1 ]
机构
[1] Wroclaw Univ Sci & Technol, Wybrzeze Wyspianskiego 27, PL-50370 Wroclaw, Poland
关键词
Ensemble classifier; Diversity measure; Data stream classification; Concept drift; CONCEPT DRIFT; PATTERN-RECOGNITION;
D O I
10.1016/j.engappai.2018.05.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The diversity of a voting committee is one of the key characteristics of ensemble systems. It determines the benefits that can be obtained through classifier fusion. There are many measures of diversity that can be used in classical decision-making systems which operate in stationary environments. A plethora of algorithms have also been proposed to ensure ensemble diversity. Bagging and boosting are a few of the most popular examples. Unfortunately, these measures and algorithms cannot be applied in systems that process streaming data. Not only must a different implementation be designed for processing fast moving samples in a stream, but the notion of diversity must also be redefined. In this paper it is proposed to assess diversity based on analysis of classifier reactions to changes in data streams. Therefore, two novel error trend diversity measures are introduced that compare the error trends of classifiers while processing subsequent samples. A practical application of these measures is also proposed in the form of a novel error trend diversity driven ensemble algorithm, where our measures are incorporated into the training procedure. The performance of the proposed algorithm is evaluated through a series of experiments and compared to several competing methods. The results demonstrate that our measures accurately evaluate diversity and that their application facilitates the creation of small and effective ensemble classifier systems.
引用
收藏
页码:23 / 34
页数:12
相关论文
共 50 条
  • [1] A new ensemble diversity measure applied to thinning ensembles
    Banfield, RE
    Hall, LO
    Bowyer, KW
    Kegelmeyer, WP
    [J]. MULTIPLE CLASSIFIER SYSTEMS, PROCEEDING, 2003, 2709 : 306 - 316
  • [2] Making Data Stream Classification Tree-based Ensembles Lighter
    Turrisi da Costa, Victor G.
    Mastelini, Saulo M.
    de Carvalho, Andre C. P. de L. F.
    Barbon, Sylvio, Jr.
    [J]. 2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 480 - 485
  • [3] Exploring the Relationships between Data Complexity and Classification Diversity in Ensembles
    Garcia, Nathan Formentin
    Tiggeman, Frederico
    Borges, Eduardo N.
    Lucca, Giancarlo
    Santos, Helida
    Dimuro, Gracaliz
    [J]. PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS (ICEIS 2021), VOL 1, 2021, : 652 - 659
  • [4] A New Data Stream Classification Algorithm
    Liang, Hong-shuo
    Jin, Li-qun
    Zhao, Li
    [J]. PROCEEDINGS OF 2013 2ND INTERNATIONAL CONFERENCE ON MEASUREMENT, INFORMATION AND CONTROL (ICMIC 2013), VOLS 1 & 2, 2013, : 477 - 481
  • [5] Naive Bayes Classification Ensembles to Support Modeling Decisions in Data Stream Mining
    Lutu, Patricia E. N.
    [J]. 2015 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2015, : 335 - 340
  • [6] Dynamically Adjusting Diversity in Ensembles for the Classification of Data Streams with Concept Drift
    Hidalgo, Juan I. G.
    Santos, Silas G. T. C.
    Barros, Roberto S. M.
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (02)
  • [7] A new similarity/diversity measure for sequential data
    Todeschini, R.
    Ballabio, D.
    Consonni, V.
    Mauri, A.
    [J]. MATCH-COMMUNICATIONS IN MATHEMATICAL AND IN COMPUTER CHEMISTRY, 2007, 57 (01) : 51 - 67
  • [8] A New Heterogeneous Dissimilarity Measure for Data Classification
    Pereira, Cesar Lima
    Cavalcanti, George D. C.
    Ren, Tsang Ing
    [J]. 22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 2, 2010, : 373 - 374
  • [9] Diversity in Ensembles for One-Class Classification
    Krawczyk, Bartosz
    [J]. NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, 2013, 185 : 119 - 129
  • [10] A New Feature Selection Algorithm for Stream Data Classification
    Wankhade, Kapil
    Rane, Dhiraj
    Thool, Ravindra
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1843 - 1848