EACD: evolutionary adaptation to concept drifts in data streams

被引:0
|
作者
Hossein Ghomeshi
Mohamed Medhat Gaber
Yevgeniya Kovalchuk
机构
[1] Birmingham City University,School of Computing and Digital Technology
来源
关键词
Data streams; Ensemble learning; Concept drifts; Evolutionary algorithms; Genetic algorithm; Non-stationary environments;
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents a novel ensemble learning method based on evolutionary algorithms to cope with different types of concept drifts in non-stationary data stream classification tasks. In ensemble learning, multiple learners forming an ensemble are trained to obtain a better predictive performance compared to that of a single learner, especially in non-stationary environments, where data evolve over time. The evolution of data streams can be viewed as a problem of changing environment, and evolutionary algorithms offer a natural solution to this problem. The method proposed in this paper uses random subspaces of features from a pool of features to create different classification types in the ensemble. Each such type consists of a limited number of classifiers (decision trees) that have been built at different times over the data stream. An evolutionary algorithm (replicator dynamics) is used to adapt to different concept drifts; it allows the types with a higher performance to increase and those with a lower performance to decrease in size. Genetic algorithm is then applied to build a two-layer architecture based on the proposed technique to dynamically optimise the combination of features in each type to achieve a better adaptation to new concepts. The proposed method, called EACD, offers both implicit and explicit mechanisms to deal with concept drifts. A set of experiments employing four artificial and five real-world data streams is conducted to compare its performance with that of the state-of-the-art algorithms using the immediate and delayed prequential evaluation methods. The results demonstrate favourable performance of the proposed EACD method in different environments.
引用
收藏
页码:663 / 694
页数:31
相关论文
共 50 条
  • [1] EACD: evolutionary adaptation to concept drifts in data streams
    Ghomeshi, Hossein
    Gaber, Mohamed Medhat
    Kovalchuk, Yevgeniya
    DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (03) : 663 - 694
  • [2] Recurrent Concept Drifts on Data Streams
    Gunasekara, Nuwan
    Pfahringer, Bernhard
    Gomes, Heitor Murilo
    Bifet, Albert
    Koh, Yun Sing
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 8029 - 8037
  • [3] Exploiting fractal dimension and a distributed evolutionary approach to classify data streams with concept drifts
    Folino, Gianluigi
    Guarascio, Massimo
    Papuzzo, Giuseppe
    APPLIED SOFT COMPUTING, 2019, 75 : 284 - 297
  • [4] Data streams classification method handling concept drifts
    Tianjin Key Laboratory of Intelligence Computing and Novel Software Technology, Tianjin University of Technology, Tianjin , China
    不详
    J. Inf. Comput. Sci., 15 (5427-5435):
  • [5] An Efficient Approach to Detect Concept Drifts in Data Streams
    Jadhav, Aditee
    Deshpande, Leena
    2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 28 - 32
  • [6] Mining data streams with concept drifts using genetic algorithm
    Vivekanandan, Periasamy
    Nedunchezhian, Raju
    ARTIFICIAL INTELLIGENCE REVIEW, 2011, 36 (03) : 163 - 178
  • [7] Mining decision rules on data streams in the presence of concept drifts
    Tsai, Cheng-Jung
    Lee, Chien-I.
    Yang, Wei-Pang
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 1164 - 1178
  • [8] Classifying Data Streams with Skewed Class Distributions and Concept Drifts
    Gao, Jing
    Ding, Bolin
    Han, Jiawei
    Fan, Wei
    Yu, Philip S.
    IEEE INTERNET COMPUTING, 2008, 12 (06) : 37 - 49
  • [9] Mining data streams with concept drifts using genetic algorithm
    Periasamy Vivekanandan
    Raju Nedunchezhian
    Artificial Intelligence Review, 2011, 36 : 163 - 178
  • [10] Handling Different Categories of Concept Drifts in Data Streams Using Distributed GP
    Folino, Gianluigi
    Papuzzo, Giuseppe
    GENETIC PROGRAMMING, PROCEEDINGS, 2010, 6021 : 74 - 85