Dynamic Weighted Majority for Incremental Learning of Imbalanced Data Streams with Concept Drift

被引:0
|
作者
Lu, Yang [1 ]
Cheung, Yiu-ming [1 ,2 ]
Tang, Yuan Yan [3 ]
机构
[1] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
[2] HKBU Inst Res & Continuing Educ, Shenzhen, Peoples R China
[3] Univ Macau, Dept Comp & Informat Sci, Taipa, Macao, Peoples R China
基金
中国国家自然科学基金;
关键词
ENSEMBLE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Concept drifts occurring in data streams will jeopardize the accuracy and stability of the online learning process. If the data stream is imbalanced, it will be even more challenging to detect and cure the concept drift. In the literature, these two problems have been intensively addressed separately, but have yet to be well studied when they occur together. In this paper, we propose a chunk-based incremental learning method called Dynamic Weighted Majority for Imbalance Learning (DWMIL) to deal with the data streams with concept drift and class imbalance problem. DWMIL utilizes an ensemble framework by dynamically weighting the base classifiers according to their performance on the current data chunk. Compared with the existing methods, its merits are four-fold: (1) it can keep stable for non-drifted streams and quickly adapt to the new concept; (2) it is totally incremental, i.e. no previous data needs to be stored; (3) it keeps a limited number of classifiers to ensure high efficiency; and (4) it is simple and needs only one thresholding parameter. Experiments on both synthetic and real data sets with concept drift show that DWMIL performs better than the state-of-the-art competitors, with less computational cost.
引用
收藏
页码:2393 / 2399
页数:7
相关论文
共 50 条
  • [31] A novel online ensemble approach to handle concept drifting data streams: diversified dynamic weighted majority
    Sidhu, Parneeta
    Bhatia, M. P. S.
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (01) : 37 - 61
  • [32] A novel online ensemble approach to handle concept drifting data streams: diversified dynamic weighted majority
    Parneeta Sidhu
    M. P. S. Bhatia
    [J]. International Journal of Machine Learning and Cybernetics, 2018, 9 : 37 - 61
  • [33] An Active Learning Method for Data Streams with Concept Drift
    Park, Cheong Hee
    Kang, Youngsoon
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 746 - 752
  • [34] Active Learning Method for Imbalanced Concept Drift Data Stream
    Li Y.-H.
    Wang T.-T.
    Wang S.-G.
    Li D.-Y.
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (03): : 589 - 606
  • [35] Double Weighted Methodology: A weighted ensemble approach to handle concept drift in data streams
    Sidhu, Parneeta
    Bhatia, M. P. S.
    Ravi, Abhishek
    Jherwal, Kirti
    [J]. 2015 IEEE 2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION SYSTEMS (RETIS), 2015, : 114 - 119
  • [36] Incremental entropy-based clustering on categorical data streams with concept drift
    Li, Yanhong
    Li, Deyu
    Wang, Suge
    Zhai, Yanhui
    [J]. KNOWLEDGE-BASED SYSTEMS, 2014, 59 : 33 - 47
  • [37] On learning guarantees to unsupervised concept drift detection on data streams
    de Mello, Rodrigo F.
    Vaz, Yule
    Grossi, Carlos H.
    Bifet, Albert
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 117 : 90 - 102
  • [38] Learning Parameter Distributions to Detect Concept Drift in Data Streams
    Haug, Johannes
    Kasneci, Gjergji
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9452 - 9459
  • [39] Learning Decision Trees from Data Streams with Concept Drift
    Jankowski, Dariusz
    Jackowski, Konrad
    Cyganek, Boguslaw
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 1682 - 1691
  • [40] G-mean Weighted Classification Method for Imbalanced Data Stream with Concept Drift
    Liang B.
    Li G.
    Dai C.
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (12): : 2844 - 2857