Incremental Learning of Concept Drift from Streaming Imbalanced Data

被引:269
|
作者
Ditzler, Gregory [1 ]
Polikar, Robi [2 ]
机构
[1] Drexel Univ, Dept Elect & Comp Engn, Philadelphia, PA 19104 USA
[2] Rowan Univ, Dept Elect & Comp Engn, Glassboro, NJ 08028 USA
基金
美国国家科学基金会;
关键词
Incremental learning; concept drift; class imbalance; multiple classifier systems; TIME ADAPTIVE CLASSIFIERS; ENSEMBLE; MODELS;
D O I
10.1109/TKDE.2012.136
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning in nonstationary environments, also known as learning concept drift, is concerned with learning from data whose statistical characteristics change over time. Concept drift is further complicated if the data set is class imbalanced. While these two issues have been independently addressed, their joint treatment has been mostly underexplored. We describe two ensemble-based approaches for learning concept drift from imbalanced data. Our first approach is a logical combination of our previously introduced Learn(++).NSE algorithm for concept drift, with the well-established SMOTE for learning from imbalanced data. Our second approach makes two major modifications to Learn(++).NSE-SMOTE integration by replacing SMOTE with a subensemble that makes strategic use of minority class data; and replacing Learn(++).NSE and its class-independent error weighting mechanism with a penalty constraint that forces the algorithm to balance accuracy on all classes. The primary novelty of this approach is in determining the voting weights for combining ensemble members, based on each classifier's time and imbalance-adjusted accuracy on current and past environments. Favorable results in comparison to other approaches indicate that both approaches are able to address this challenging problem, each with its own specific areas of strength. We also release all experimental data as a resource and benchmark for future research.
引用
收藏
页码:2283 / 2301
页数:19
相关论文
共 50 条
  • [1] Dynamic Weighted Majority for Incremental Learning of Imbalanced Data Streams with Concept Drift
    Lu, Yang
    Cheung, Yiu-ming
    Tang, Yuan Yan
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2393 - 2399
  • [2] Incremental Bayesian Classifier for Streaming Data with Concept Drift
    Wu, Peng
    Xiong, Ning
    Li, Gang
    Lv, Jinrui
    [J]. ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 509 - 518
  • [3] Incremental learning imbalanced data streams with concept drift: The dynamic updated ensemble algorithm
    Li, Zeng
    Huang, Wenchao
    Xiong, Yan
    Ren, Siqi
    Zhu, Tuanfei
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 195
  • [4] Learning from streaming data with concept drift and imbalance: an overview
    T. Ryan Hoens
    Robi Polikar
    Nitesh V. Chawla
    [J]. Progress in Artificial Intelligence, 2012, 1 (1) : 89 - 101
  • [5] Learning from streaming data with concept drift and imbalance: an overview
    Hoens, T. Ryan
    Polikar, Robi
    Chawla, Nitesh V.
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, 2012, 1 (01) : 89 - 101
  • [6] Imbalanced class incremental learning system: A task incremental diagnosis method for imbalanced industrial streaming data
    Shi, Mingkuan
    Ding, Chuancang
    Shen, Changqing
    Huang, Weiguo
    Zhu, Zhongkui
    [J]. ADVANCED ENGINEERING INFORMATICS, 2024, 62
  • [7] Autoencoder-based Anomaly Detection in Streaming Data with Incremental Learning and Concept Drift Adaptation
    Li, Jin
    Malialis, Kleanthis
    Polycarpou, Marios M.
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [8] Incremental Learning of Bayesian Networks from Concept-Drift Data
    Yu, Haibo
    [J]. 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2019, : 701 - 704
  • [9] Active Learning Method for Imbalanced Concept Drift Data Stream
    Li, Yan-Hong
    Wang, Tian-Tian
    Wang, Su-Ge
    Li, De-Yu
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (03): : 589 - 606
  • [10] Deep learning framework for handling concept drift and class imbalanced complex decision-making on streaming data
    S. Priya
    R. Annie Uthra
    [J]. Complex & Intelligent Systems, 2023, 9 : 3499 - 3515