Using a classifier pool in accuracy based tracking of recurring concepts in data stream classification

被引:24
|
作者
Hosseini, Mohammad Javad [1 ]
Ahmadi, Zahra [1 ]
Beigy, Hamid [1 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
关键词
Recurring concepts; Concept drift; Stream mining; Ensemble learning;
D O I
10.1007/s12530-012-9064-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data streams have some unique properties which make them applicable in precise modeling of many real data mining applications. The most challenging property of data streams is the occurrence of "concept drift''. Recurring concepts is a type of concept drift which can be seen in most of real world problems. Detecting recurring concepts makes it possible to exploit previous knowledge obtained in the learning process. This leads to quick adaptation of the learner whenever a concept reappears. In this paper, we propose a learning algorithm called Pool and Accuracy based Stream Classification with some variations, which takes the advantage of maintaining a pool of classifiers to track recurring concepts. Each classifier is used to describe an existing concept. Consecutive batches of instances are first classified by the pool of classifiers. Two approaches are presented for this task: active classifier and weighted classifiers methods. Then the true labels are revealed and the pool is updated at the end of the batch. Updating the pool is done using one of the following methods: exact Bayesian, Bayesian and Heuristic. As the algorithm may assign multiple classifiers to a single concept, a classifier merging process is used to resolve this problem. Experimental results on real and artificial datasets show the effectiveness of weighted classifiers method while dealing with sudden concept drifting datasets. In addition, the proposed updating methods outperform the existing algorithms in datasets with arbitrary attributes. Finally some performed experiments represent superiority of using merging process in large datasets.
引用
收藏
页码:43 / 60
页数:18
相关论文
共 50 条
  • [1] Data Stream Classification based on an Associative Classifier
    Lopez-Medina, Karen Pamela
    Uriarte-Arcia, Abril Valeria
    Yanez-Marquez, Cornelio
    [J]. COMPUTACION Y SISTEMAS, 2024, 28 (02): : 387 - 400
  • [2] Data Stream Classification Based on the Gamma Classifier
    Valeria Uriarte-Arcia, Abril
    Lopez-Yanez, Itzama
    Yanez-Marquez, Cornelio
    Gama, Joao
    Camacho-Nieto, Oscar
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [3] A clustering and ensemble based classifier for data stream classification
    Wankhade, Kapil K.
    Jondhale, Kalpana C.
    Dongre, Snehlata S.
    [J]. APPLIED SOFT COMPUTING, 2021, 102
  • [4] PGNBC: Pearson Gaussian Naive Bayes classifier for data stream classification with recurring concept drift
    Babu, D. Kishore
    Ramadevi, Y.
    Ramana, K. V.
    [J]. INTELLIGENT DATA ANALYSIS, 2017, 21 (05) : 1173 - 1191
  • [5] A New Semi-supervised Learning Based Ensemble Classifier for Recurring Data Stream
    Zhang, Bo
    Chen, Dingfang
    Zu, Qiaohong
    Mao, Yichao
    Pan, Yi
    Zhang, Xiaomin
    [J]. PERVASIVE COMPUTING AND THE NETWORKED WORLD, 2014, 8351 : 759 - +
  • [6] RGNBC: Rough Gaussian Na⟨ve Bayes Classifier for Data Stream Classification with Recurring Concept Drift
    Babu, D. Kishore
    Ramadevi, Y.
    Ramana, K. V.
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2017, 42 (02) : 705 - 714
  • [7] RGNBC: Rough Gaussian Naïve Bayes Classifier for Data Stream Classification with Recurring Concept Drift
    D. Kishore Babu
    Y. Ramadevi
    K. V. Ramana
    [J]. Arabian Journal for Science and Engineering, 2017, 42 : 705 - 714
  • [8] Tracking Recurring Concepts from Evolving Data Streams using Ensemble Method
    Sun, Yange
    Wang, Zhihai
    Yuan, Jidong
    Zhang, Wei
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2019, 16 (06) : 1044 - 1052
  • [9] Classifier Ensemble for Uncertain Data Stream Classification
    Pan, Shirui
    Wu, Kuan
    Zhang, Yang
    Li, Xue
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I, PROCEEDINGS, 2010, 6118 : 488 - +
  • [10] Diversity-Based Pool of Models for Dealing with Recurring Concepts
    Chiu, Chun Wai
    Minku, Leandro L.
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,