Ensemble model and algorithm with recalling and forgetting mechanisms for data stream mining

被引:0
|
作者
Zhao, Qiang-Li [1 ]
Jiang, Yan-Huang [2 ]
Lu, Yu-Tong [2 ]
机构
[1] School of Computer and Information Engineering, Hu'nan University of Commerce, Changsha,410205, China
[2] State Key Laboratory of High Performance Computing (National University of Defense Technology), Changsha,410073, China
来源
Ruan Jian Xue Bao/Journal of Software | 2015年 / 26卷 / 10期
关键词
D O I
10.13328/j.cnki.jos.004747
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Using ensemble of classifiers on sequential chunks of training instances is a popular strategy for data stream mining with concept drifts. Aiming at the limitations of existing approaches, this paper introduces human recalling and forgetting mechanisms into a data stream mining system, and proposes a memorizing based data stream mining (MDSM) model. The model considers base classifiers as learned knowledge. Through recalling and forgetting mechanism, most useful classifiers in the past will be reserved in a memory repository, which improves the stability under random concept drifts. The best classifiers for the current data chunk are selected for prediction, which achieves high adaptability for different concept drifts. Based on MSDM, the paper puts forward a new algorithm MAE (memorizing based adaptive ensemble). MAE uses Ebbinghaus forgetting curve as forgetting mechanism and adopts ensemble pruning to emulate the recalling mechanism. Compared with four traditional data stream mining approaches, the results show that MAE achieves high and stable accuracy with moderate training time. The results also proved that MAE has good adaptability for different kinds of concept drifts, especially for the applications with recurring or complex concept drifts. © Copyright 2015, Institute of Software, the Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:2567 / 2580
相关论文
共 50 条
  • [1] Ensemble based Data Stream Mining with Recalling and Forgetting Mechanisms
    Jiang, Yanhuang
    Zhao, Qiangli
    Lu, Yutong
    [J]. 2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 430 - 435
  • [2] Kappa Updated Ensemble for drifting data stream mining
    Cano, Alberto
    Krawczyk, Bartosz
    [J]. MACHINE LEARNING, 2020, 109 (01) : 175 - 218
  • [3] Kappa Updated Ensemble for drifting data stream mining
    Alberto Cano
    Bartosz Krawczyk
    [J]. Machine Learning, 2020, 109 : 175 - 218
  • [4] How to adjust an ensemble size in stream data mining?
    Pietruczuk, Lena
    Rutkowski, Leszek
    Jaworski, Maciej
    Duda, Piotr
    [J]. INFORMATION SCIENCES, 2017, 381 : 46 - 54
  • [5] A New Gradual Forgetting Approach for Mining Data Stream with Concept Drift
    Li, Yingrong
    Wei, Yang
    Kolesnikova, Anastasiya
    Lee, Won Don
    [J]. ISISE 2008: INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING, VOL 1, 2008, : 556 - 559
  • [6] Adaptive Ensemble Active Learning for Drifting Data Stream Mining
    Krawczyk, Bartosz
    Cano, Alberto
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2763 - 2771
  • [7] Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining
    Jiang, Yanhuang
    Zhao, Qiangli
    Lu, Yutong
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [8] A Method for Automatic Adjustment of Ensemble Size in Stream Data Mining
    Pietruczuk, Lena
    Rutkowski, Leszek
    Jaworski, Maciej
    Duda, Piotr
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 9 - 15
  • [9] Online ensemble learning algorithm for imbalanced data stream
    Hongle, Du
    Yan, Zhang
    Gang, Ke
    Lin, Zhang
    Chen, Yeh-Cheng
    [J]. APPLIED SOFT COMPUTING, 2021, 107
  • [10] An Algorithm for Mining Frequent Closed Itemsets in Data Stream
    Dai, Caiyan
    Chen, Ling
    [J]. 2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL I, 2010, : 281 - 284