Adaptive Ensemble Active Learning for Drifting Data Stream Mining

被引:0
|
作者
Krawczyk, Bartosz [1 ]
Cano, Alberto [1 ]
机构
[1] Virginia Commonwealth Univ, Dept Comp Sci, Richmond, VA 23284 USA
关键词
DIVERSITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning from data streams is among the most vital contemporary fields in machine learning and data mining Streams pose new challenges to learning systems, due to their volume and velocity, as well as ever-changing nature caused by concept drift. Vast majority of works for data streams assume a fully supervised learning scenario, having an unrestricted access to class labels. This assumption does not hold in real-world applications, where obtaining ground truth is costly and time-consuming Therefore, we need to carefully select which instances should be labeled, as usually we are working under a strict label budget. In this paper, we propose a novel active learning approach based on ensemble algorithms that is capable of using multiple base classifiers during the label query process. It is a plug-in solution, capable of working with most of existing streaming ensemble classifiers. We realize this process as a Multi-Armed Bandit problem, obtaining an efficient and adaptive ensemble active learning procedure by selecting the most competent classifier from the pool for each query. In order to better adapt to concept drifts, we guide our instance selection by measuring the generalization capabilities of our classifiers. This adaptive solution leads not only to better instance selection under sparse access to class labels, but also to improved adaptation to various types of concept drift and increasing the diversity of the underlying ensemble classifier.
引用
收藏
页码:2763 / 2771
页数:9
相关论文
共 50 条
  • [41] Research on Concept-Drifting Data Stream Based on Fuzzy Integral Ensemble Classifier System
    Zhang, Baoju
    Chen, Yidi
    Xue, Lei
    [J]. COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 225 - 232
  • [42] Adaptive learning of an evolving cascade neo-fuzzy system in data stream mining tasks
    Bodyanskiy, Yevgeniy V.
    Tyshchenko, Oleksii K.
    Kopaliani, Daria S.
    [J]. EVOLVING SYSTEMS, 2016, 7 (02) : 107 - 116
  • [43] Active Weighted Aging Ensemble for drifted data stream classification
    Wozniak, Michal
    Zyblewski, Pawel
    Ksieniewicz, Pawel
    [J]. INFORMATION SCIENCES, 2023, 630 : 286 - 304
  • [44] Active Learning with Abstaining Classifiers for Imbalanced Drifting Data Streams
    Korycki, Lukasz
    Cano, Alberto
    Krawczyk, Bartosz
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2334 - 2343
  • [45] Active Learning for Mining Big Data
    Jahan, Sadia
    Shatabda, Swakkhar
    Farid, Dewan Md
    [J]. 2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [46] Learning concept-drifting data streams with random ensemble decision trees
    Li, Peipei
    Wu, Xindong
    Hu, Xuegang
    Wang, Hao
    [J]. NEUROCOMPUTING, 2015, 166 : 68 - 83
  • [47] Active Learning in Context-Driven Stream Mining With an Application to Image Mining
    Tekin, Cem
    van der Schaar, Mihaela
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) : 3666 - 3679
  • [48] Adaptive Active Learning with Ensemble of Learners and Multiclass Problems
    Czarnecki, Wojciech Marian
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT I, 2015, 9119 : 415 - 426
  • [49] Random Ensemble Decision Trees for Learning Concept-Drifting Data Streams
    Li, Peipei
    Wu, Xindong
    Liang, Qianhui
    Hu, Xuegang
    Zhang, Yuhong
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6634 : 313 - 325
  • [50] Robust ensemble learning for mining noisy data streams
    Zhang, Peng
    Zhu, Xingquan
    Shi, Yong
    Guo, Li
    Wu, Xindong
    [J]. DECISION SUPPORT SYSTEMS, 2011, 50 (02) : 469 - 479