Adaptive Ensemble Active Learning for Drifting Data Stream Mining

被引:0
|
作者
Krawczyk, Bartosz [1 ]
Cano, Alberto [1 ]
机构
[1] Virginia Commonwealth Univ, Dept Comp Sci, Richmond, VA 23284 USA
关键词
DIVERSITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning from data streams is among the most vital contemporary fields in machine learning and data mining Streams pose new challenges to learning systems, due to their volume and velocity, as well as ever-changing nature caused by concept drift. Vast majority of works for data streams assume a fully supervised learning scenario, having an unrestricted access to class labels. This assumption does not hold in real-world applications, where obtaining ground truth is costly and time-consuming Therefore, we need to carefully select which instances should be labeled, as usually we are working under a strict label budget. In this paper, we propose a novel active learning approach based on ensemble algorithms that is capable of using multiple base classifiers during the label query process. It is a plug-in solution, capable of working with most of existing streaming ensemble classifiers. We realize this process as a Multi-Armed Bandit problem, obtaining an efficient and adaptive ensemble active learning procedure by selecting the most competent classifier from the pool for each query. In order to better adapt to concept drifts, we guide our instance selection by measuring the generalization capabilities of our classifiers. This adaptive solution leads not only to better instance selection under sparse access to class labels, but also to improved adaptation to various types of concept drift and increasing the diversity of the underlying ensemble classifier.
引用
收藏
页码:2763 / 2771
页数:9
相关论文
共 50 条
  • [21] Robust ensemble learning for data mining
    Rätsch, G
    Schölkopf, B
    Smola, AJ
    Mika, S
    Onoda, T
    Müller, KR
    [J]. KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS: CURRENT ISSUES AND NEW APPLICATIONS, 2000, 1805 : 341 - 344
  • [22] Mining Concept-Drifting and Noisy Data Streams using Ensemble Classifiers
    Ouyang, Zhenzheng
    Zhou, Min
    Wang, Tao
    Wu, Quanyuan
    [J]. 2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL IV, PROCEEDINGS, 2009, : 360 - +
  • [23] An Ensemble Learning Approach for Data Stream Clustering
    Fathzadeh, Ramin
    Mokhtari, Vahid
    [J]. 2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [24] A Survey on Ensemble Learning for Data Stream Classification
    Gomes, Heitor Murilo
    Barddal, Jean Paul
    Enembreck, Fabricio
    Bifet, Albert
    [J]. ACM COMPUTING SURVEYS, 2017, 50 (02)
  • [25] Ensemble learning for data stream analysis: A survey
    Krawczyk, Bartosz
    Minku, Leandro L.
    Gama, Joao
    Stefanowski, Jerzy
    Wozniak, Michal
    [J]. INFORMATION FUSION, 2017, 37 : 132 - 156
  • [26] Adaptive regularized ensemble for evolving data stream classification
    Paim, Aldo M.
    Enembreck, Fabricio
    [J]. PATTERN RECOGNITION LETTERS, 2024, 180 : 55 - 61
  • [27] Active and adaptive ensemble learning for online activity recognition from data streams
    Krawczyk, Bartosz
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 138 : 69 - 78
  • [28] Ensemble based Data Stream Mining with Recalling and Forgetting Mechanisms
    Jiang, Yanhuang
    Zhao, Qiangli
    Lu, Yutong
    [J]. 2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 430 - 435
  • [29] A Method for Automatic Adjustment of Ensemble Size in Stream Data Mining
    Pietruczuk, Lena
    Rutkowski, Leszek
    Jaworski, Maciej
    Duda, Piotr
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 9 - 15
  • [30] Active Learning Framework Combining Semi-supervised Approach for Data Stream Mining
    Kholghi, Mahnoosh
    Keyvanpour, MohammadReza
    [J]. INTELLIGENT COMPUTING AND INFORMATION SCIENCE, PT II, 2011, 135 : 238 - +