An adaptive distributed ensemble approach to mine concept-drifting data streams

被引:17
|
作者
Folino, Gianluigi [1 ]
Pizzuti, Clara [1 ]
Spezzano, Giandomenico [1 ]
机构
[1] CNR, ICAR, I-87036 Arcavacata Di Rende, CS, Italy
关键词
D O I
10.1109/ICTAI.2007.51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An adaptive boosting ensemble algorithm for classifying homogeneous distributed data streams is presented. The method builds an ensemble of classifiers by using Genetic Programming (GP) to inductively generate decision trees, each trained on different parts of the distributed training set. The approach adopts a co-evolutionary platform to support a cooperative model of GP A change detection strategy, based on self-similarity of the ensemble behavior and measured by its fractal dimension, permits to capture time-evolving trends and patterns in the stream, and to reveal changes in evolving data streams. The approach tracks online ensemble accuracy deviation over time and decides to recompute the ensemble if the deviation has exceeded a pre-specified threshold. This allows the maintenance of an accurate and up-to-date ensemble of classifiers for continuous flows of data with concept drifts. Experimental results on a real life data set show the validity of the approach.
引用
收藏
页码:183 / 187
页数:5
相关论文
共 50 条
  • [1] Generalized CMAC Adaptive Ensembles for Concept-Drifting Data Streams
    Gonzalez-Serrano, Francisco J.
    Figueiras-Vidal, Anibal R.
    [J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 2669 - 2673
  • [2] Random Ensemble Decision Trees for Learning Concept-Drifting Data Streams
    Li, Peipei
    Wu, Xindong
    Liang, Qianhui
    Hu, Xuegang
    Zhang, Yuhong
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6634 : 313 - 325
  • [3] Mining Concept-Drifting and Noisy Data Streams using Ensemble Classifiers
    Ouyang, Zhenzheng
    Zhou, Min
    Wang, Tao
    Wu, Quanyuan
    [J]. 2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL IV, PROCEEDINGS, 2009, : 360 - +
  • [4] ADAPTIVE DATA REUSE FOR CLASSIFYING IMBALANCED AND CONCEPT-DRIFTING DATA STREAMS
    Nguyen, Hien M.
    Cooper, Eric W.
    Kamei, Katsuari
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (7B): : 4995 - 5010
  • [5] Learning concept-drifting data streams with random ensemble decision trees
    Li, Peipei
    Wu, Xindong
    Hu, Xuegang
    Wang, Hao
    [J]. NEUROCOMPUTING, 2015, 166 : 68 - 83
  • [6] An Ensemble of Classifiers Algorithm Based on GA for Handling Concept-Drifting Data Streams
    Guan, Jinghua
    Guo, Wu
    Chen, Heng
    Lou, Oujun
    [J]. 2014 SIXTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2014, : 282 - 284
  • [7] Granularity adaptive density estimation and on demand clustering of concept-drifting data streams
    Zhu, Weiheng
    Pei, Jian
    Yin, Jian
    Xie, Yihuang
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4081 : 322 - 331
  • [8] An adaptive ensemble classifier for mining concept drifting data streams
    Farid, Dewan Md.
    Zhang, Li
    Hossain, Alamgir
    Rahman, Chowdhury Mofizur
    Strachan, Rebecca
    Sexton, Graham
    Dahal, Keshav
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (15) : 5895 - 5906
  • [9] An efficient and sensitive decision tree approach to mining concept-drifting data streams
    Tsai, Cheng-Jurig
    Lee, Chien-I
    Yang, Wei-Pang
    [J]. INFORMATICA, 2008, 19 (01) : 135 - 156
  • [10] FedStream: Prototype-Based Federated Learning on Distributed Concept-Drifting Data Streams
    Mawuli, Cobbinah B.
    Che, Liwei
    Kumar, Jay
    Din, Salah Ud
    Qin, Zhili
    Yang, Qinli
    Shao, Junming
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (11): : 7112 - 7124