Random Ensemble Decision Trees for Learning Concept-Drifting Data Streams

被引:0
|
作者
Li, Peipei [1 ]
Wu, Xindong [1 ,2 ]
Liang, Qianhui [3 ]
Hu, Xuegang [1 ]
Zhang, Yuhong [1 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
[2] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
[3] Hewlett Packard Labs, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Data Stream; Random Decision Tree; Concept Drift; Noise;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few online classification algorithms based on traditional inductive ensembling focus on handling concept drifting data streams while performing well on noisy data. Motivated by this, an incremental algorithm based on random Ensemble Decision Trees for Concept-drifting data streams (EDTC) is proposed in this paper. Three variants of random feature selection are developed to implement split-tests. To better track concept drifts in data streams with noisy data, an improved two-threshold-based drifting detection mechanism is introduced. Extensive studies demonstrate that our algorithm performs very well compared to several known online algorithms based on single models and ensemble models. A conclusion is hence drawn that multiple solutions are provided for learning from concept drifting data streams with noise.
引用
收藏
页码:313 / 325
页数:13
相关论文
共 50 条
  • [1] Learning concept-drifting data streams with random ensemble decision trees
    Li, Peipei
    Wu, Xindong
    Hu, Xuegang
    Wang, Hao
    [J]. NEUROCOMPUTING, 2015, 166 : 68 - 83
  • [2] Ambiguous decision trees for mining concept-drifting data streams
    Liu, Jing
    Li, Xue
    Zhong, Weicai
    [J]. PATTERN RECOGNITION LETTERS, 2009, 30 (15) : 1347 - 1355
  • [3] Mining Concept-Drifting Data Streams with Multiple Semi-Random Decision Trees
    Li, Peipei
    Hu, Xuegang
    Wu, Xindong
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2008, 5139 : 733 - 740
  • [4] Mining Concept-Drifting and Noisy Data Streams using Ensemble Classifiers
    Ouyang, Zhenzheng
    Zhou, Min
    Wang, Tao
    Wu, Quanyuan
    [J]. 2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL IV, PROCEEDINGS, 2009, : 360 - +
  • [5] An adaptive distributed ensemble approach to mine concept-drifting data streams
    Folino, Gianluigi
    Pizzuti, Clara
    Spezzano, Giandomenico
    [J]. 19TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL II, PROCEEDINGS, 2007, : 183 - 187
  • [6] Prototype-based Learning on Concept-drifting Data Streams
    Shao, Junming
    Ahmadi, Zahra
    Kramer, Stefan
    [J]. PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 412 - 421
  • [7] An Ensemble of Classifiers Algorithm Based on GA for Handling Concept-Drifting Data Streams
    Guan, Jinghua
    Guo, Wu
    Chen, Heng
    Lou, Oujun
    [J]. 2014 SIXTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2014, : 282 - 284
  • [8] Concept Drifting Detection on Noisy Streaming Data in Random Ensemble Decision Trees
    Li, Peipei
    Hu, Xuegang
    Liang, Qianghui
    Gao, Yunjun
    [J]. MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, 2009, 5632 : 236 - +
  • [9] An Algorithm for Anticipating Future Decision Trees from Concept-Drifting Data
    Boettcher, Mirko
    Spott, Martin
    Kruse, Rudolf
    [J]. RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXV, 2009, : 293 - +
  • [10] Selective prototype-based learning on concept-drifting data streams
    Chen, Dongzi
    Yang, Qinli
    Liu, Jiaming
    Zeng, Zhu
    [J]. INFORMATION SCIENCES, 2020, 516 : 20 - 32