Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining

被引:0
|
作者
Jiang, Yanhuang [1 ,2 ]
Zhao, Qiangli [1 ,3 ]
Lu, Yutong [1 ,2 ]
机构
[1] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha 410073, Hunan, Peoples R China
[2] Natl Univ Def Technol, Sch Comp Sci, Changsha 410073, Hunan, Peoples R China
[3] Hunan Univ Commerce, Sch Comp & Informat Engn, Changsha 410205, Hunan, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
STATISTICAL COMPARISONS; CLASSIFIERS; DRIFT;
D O I
10.1155/2015/874032
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Combining several classifiers on sequential chunks of training instances is a popular strategy for data stream mining with concept drifts. This paper introduces human recalling and forgetting mechanisms into a data stream mining system and proposes a Memorizing Based Data Stream Mining (MDSM) model. In this model, each component classifier is regarded as a piece of knowledge that a human obtains through learning some materials and has a memory retention value reflecting its usefulness in the history. The classifiers with high memory retention values are reserved in a "knowledge repository." When a new data chunk comes, most useful classifiers will be selected (recalled) from the repository and compose the current target ensemble. Based on MDSM, we put forward a new algorithm, MAE (Memorizing Based Adaptive Ensemble), which uses Ebbinghaus forgetting curve as the forgetting mechanism and adopts ensemble pruning as the recalling mechanism. Compared with four popular data stream mining approaches on the datasets with different concept drifts, the experimental results show that MAE achieves high and stable predicting accuracy, especially for the applications with recurring or complex concept drifts. The results also prove the effectiveness of MDSM model.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Effective Data Stream Mining using Ensemble on Cloud with Load balancing (E2CL)
    Kathirvel, Jagadheeswaran
    Parasuraman, Elango
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATIONS TECHNOLOGIES (ICCCT 15), 2015, : 383 - 386
  • [42] Data Stream Mining in Fog Computing Environment with Feature Selection Using Ensemble of Swarm Search Algorithms
    Ma, Bin Bin
    Fong, Simon
    Millham, Richard
    [J]. 2018 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY (ICTAS), 2018,
  • [43] Stream mining: a novel architecture for ensemble-based classification
    Valerio Grossi
    Franco Turini
    [J]. Knowledge and Information Systems, 2012, 30 : 247 - 281
  • [44] Stream mining: a novel architecture for ensemble-based classification
    Grossi, Valerio
    Turini, Franco
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 30 (02) : 247 - 281
  • [45] Ensemble classifier for mining data streams
    Czarnowski, Ireneusz
    Jedrzejowicz, Piotr
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 18TH ANNUAL CONFERENCE, KES-2014, 2014, 35 : 397 - 406
  • [46] Robust ensemble learning for data mining
    Rätsch, G
    Schölkopf, B
    Smola, AJ
    Mika, S
    Onoda, T
    Müller, KR
    [J]. KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS: CURRENT ISSUES AND NEW APPLICATIONS, 2000, 1805 : 341 - 344
  • [47] Data Stream Mining: Challenges and Techniques
    Khan, Latifur
    [J]. 22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 2, 2010, : 295 - 295
  • [48] IoT Big Data Stream Mining
    Morales, Gianmarco De Francisci
    Bifet, Albert
    Khan, Latifur
    Gama, Joao
    Fan, Wei
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 2119 - 2120
  • [49] Stream mining on univariate uncertain data
    Liu, Ying-Ho
    [J]. APPLIED INTELLIGENCE, 2013, 39 (02) : 315 - 344
  • [50] Data Stream Mining: the Bounded Rationality
    Gama, Joao
    [J]. INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2013, 37 (01): : 21 - 26