Adaptive Ensemble Active Learning for Drifting Data Stream Mining

被引:0
|
作者
Krawczyk, Bartosz [1 ]
Cano, Alberto [1 ]
机构
[1] Virginia Commonwealth Univ, Dept Comp Sci, Richmond, VA 23284 USA
关键词
DIVERSITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning from data streams is among the most vital contemporary fields in machine learning and data mining Streams pose new challenges to learning systems, due to their volume and velocity, as well as ever-changing nature caused by concept drift. Vast majority of works for data streams assume a fully supervised learning scenario, having an unrestricted access to class labels. This assumption does not hold in real-world applications, where obtaining ground truth is costly and time-consuming Therefore, we need to carefully select which instances should be labeled, as usually we are working under a strict label budget. In this paper, we propose a novel active learning approach based on ensemble algorithms that is capable of using multiple base classifiers during the label query process. It is a plug-in solution, capable of working with most of existing streaming ensemble classifiers. We realize this process as a Multi-Armed Bandit problem, obtaining an efficient and adaptive ensemble active learning procedure by selecting the most competent classifier from the pool for each query. In order to better adapt to concept drifts, we guide our instance selection by measuring the generalization capabilities of our classifiers. This adaptive solution leads not only to better instance selection under sparse access to class labels, but also to improved adaptation to various types of concept drift and increasing the diversity of the underlying ensemble classifier.
引用
收藏
页码:2763 / 2771
页数:9
相关论文
共 50 条
  • [11] Augmented Query Strategies for Active Learning in Stream Data Mining
    Faisal, Mustafa Amir
    Aung, Zeyar
    Woon, Wei Lee
    Svetinovic, Davor
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2014, PT III, 2014, 8836 : 431 - 438
  • [12] An Aggregate Ensemble for Mining Concept Drifting Data Streams with Noise
    Zhang, Peng
    Zhu, Xingquan
    Shi, Yong
    Wu, Xindong
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 1021 - +
  • [13] Active Learning With Drifting Streaming Data
    Zliobaite, Indre
    Bifet, Albert
    Pfahringer, Bernhard
    Holmes, Geoffrey
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (01) : 27 - 39
  • [14] Combining active learning with concept drift detection for data stream mining
    Krawczyk, Bartosz
    Pfahringer, Bernhard
    Wozniak, Michal
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2239 - 2244
  • [15] Combining Active Learning and Self-Labeling for Data Stream Mining
    Korycki, Lukasz
    Krawczyk, Bartosz
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2017, 2018, 578 : 481 - 490
  • [16] Sleep Disorder Data Stream Classification Based on Classifiers Ensemble and Active Learning
    Cai, Liangming
    Datta, Rituparna
    Huang, Jingshan
    Dong, Shuai
    Du, Min
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 1432 - 1435
  • [17] Active Learning From Stream Data Using Optimal Weight Classifier Ensemble
    Zhu, Xingquan
    Zhang, Peng
    Lin, Xiaodong
    Shi, Yong
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2010, 40 (06): : 1607 - 1621
  • [18] Pyramid Stack Data Stream Mining for Handling Concept-drifting
    Xu, Zhuoran
    Hou, Cuiqin
    Xia, Yingju
    Sun, Jun
    Inakoshi, Hiroya
    Yugami, Nobuhiro
    [J]. PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 33 - 37
  • [19] How to adjust an ensemble size in stream data mining?
    Pietruczuk, Lena
    Rutkowski, Leszek
    Jaworski, Maciej
    Duda, Piotr
    [J]. INFORMATION SCIENCES, 2017, 381 : 46 - 54
  • [20] Online Active Learning for Drifting Data Streams
    Liu, Sanmin
    Xue, Shan
    Wu, Jia
    Zhou, Chuan
    Yang, Jian
    Li, Zhao
    Cao, Jie
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) : 186 - 200