Mining Concept-Drifting and Noisy Data Streams using Ensemble Classifiers

被引:11
|
作者
Ouyang, Zhenzheng [1 ]
Zhou, Min [1 ]
Wang, Tao [3 ]
Wu, Quanyuan [2 ]
机构
[1] Natl Univ Def Technol, Sch Sci, Changsha 410073, Hunan, Peoples R China
[2] Natl Univ Def Technol, Sch Comp, Changsha 410073, Hunan, Peoples R China
[3] Nanjing Army Command Coll, Dept 2, Nanjing 210045, Jiangsu, Peoples R China
关键词
D O I
10.1109/AICI.2009.153
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining concept drifting data stream is a challenging area for data mining research. Recent years have witnessed an averaging ensemble classifier which is based on the learnable assumption, although this ensemble classifier is an efficient algorithm for mining concept-drifting data streams, it is still inadequate to represent real-world data streams with noisy data. In this paper, we propose a novel ensemble classifier framework for mining concept-drifting data streams with noise. The method, called WEAP-I, which trains a weighted ensemble classifier on the most n data chunks and trains an averaging ensemble classifier on the most recent data chunk. All the base classifiers are combined to form the WEAP-I ensemble classifier. Our theoretical and empirical study shows that our framework is superior and more robust to averaging ensemble for noisy data streams.
引用
收藏
页码:360 / +
页数:2
相关论文
共 50 条
  • [1] An Ensemble of Classifiers Algorithm Based on GA for Handling Concept-Drifting Data Streams
    Guan, Jinghua
    Guo, Wu
    Chen, Heng
    Lou, Oujun
    [J]. 2014 SIXTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2014, : 282 - 284
  • [2] Mining multi-dimensional concept-drifting data streams using Bayesian network classifiers
    Borchani, Hanen
    Larranaga, Pedro
    Gama, Joao
    Bielza, Concha
    [J]. INTELLIGENT DATA ANALYSIS, 2016, 20 (02) : 257 - 280
  • [3] Mining Multi-label Concept-Drifting Data Streams Using Dynamic Classifier Ensemble
    Qu, Wei
    Zhang, Yang
    Zhu, Junping
    Qiu, Qiang
    [J]. ADVANCES IN MACHINE LEARNING, PROCEEDINGS, 2009, 5828 : 308 - 321
  • [4] Ambiguous decision trees for mining concept-drifting data streams
    Liu, Jing
    Li, Xue
    Zhong, Weicai
    [J]. PATTERN RECOGNITION LETTERS, 2009, 30 (15) : 1347 - 1355
  • [5] On reducing classifier granularity in mining concept-drifting data streams
    Wang, P
    Wang, HX
    Wu, XC
    Wang, W
    Shi, BL
    [J]. FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 474 - 481
  • [6] Random Ensemble Decision Trees for Learning Concept-Drifting Data Streams
    Li, Peipei
    Wu, Xindong
    Liang, Qianhui
    Hu, Xuegang
    Zhang, Yuhong
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6634 : 313 - 325
  • [7] An adaptive distributed ensemble approach to mine concept-drifting data streams
    Folino, Gianluigi
    Pizzuti, Clara
    Spezzano, Giandomenico
    [J]. 19TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL II, PROCEEDINGS, 2007, : 183 - 187
  • [8] Learning concept-drifting data streams with random ensemble decision trees
    Li, Peipei
    Wu, Xindong
    Hu, Xuegang
    Wang, Hao
    [J]. NEUROCOMPUTING, 2015, 166 : 68 - 83
  • [9] Mining Concept-Drifting Data Streams Containing Labeled and Unlabeled Instances
    Borchani, Hanen
    Larranaga, Pedro
    Bielza, Concha
    [J]. TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT I, PROCEEDINGS, 2010, 6096 : 531 - 540
  • [10] A general framework for mining concept-drifting data streams with evolvable features
    Peng, Jiaqi
    Guo, Jinxia
    Yang, Qinli
    Lu, Jianyun
    Shao, Junmming
    [J]. 2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1276 - 1281