Exploiting fractal dimension and a distributed evolutionary approach to classify data streams with concept drifts

被引:4
|
作者
Folino, Gianluigi [1 ]
Guarascio, Massimo [1 ]
Papuzzo, Giuseppe [1 ]
机构
[1] Univ Calabria, ICAR, CNR, Via P Bucci 7-C, I-87036 Arcavacata Di Rende, CS, Italy
关键词
Classification (of information) - Genetic algorithms - Genetic programming;
D O I
10.1016/j.asoc.2018.11.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Evolutionary algorithms, i.e., Genetic Programming (GP), have been successfully used for the task of classification, mainly because they are less likely to get stuck in the local optimum, can operate on chunks of data and allow to compute more solutions in parallel. Ensemble techniques are usually more accurate than component learners constituting the ensemble and can be built in an incremental way, improving flexibility, adapting to changes and maintaining part of the history present in the data. This paper proposes a framework based on a distributed GP ensemble algorithm for coping with different types of concept drift for the task of classification of large data streams. The framework is able to detect changes in a very efficient way using only a detection function based on the fractal dimension, which can also works on new incoming unclassified data. Thus, a distributed GP algorithm is performed only when a change is detected in order to improve classification accuracy and this, together with the exploitation of an adaptive procedure, permits to answer in short time to these changes. Experiments are conducted on a real and on some artificial datasets in order to assess the capacity of the framework to detect the drift and quickly respond to it. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:284 / 297
页数:14
相关论文
共 50 条
  • [1] EACD: evolutionary adaptation to concept drifts in data streams
    Ghomeshi, Hossein
    Gaber, Mohamed Medhat
    Kovalchuk, Yevgeniya
    DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (03) : 663 - 694
  • [2] EACD: evolutionary adaptation to concept drifts in data streams
    Hossein Ghomeshi
    Mohamed Medhat Gaber
    Yevgeniya Kovalchuk
    Data Mining and Knowledge Discovery, 2019, 33 : 663 - 694
  • [3] An Efficient Approach to Detect Concept Drifts in Data Streams
    Jadhav, Aditee
    Deshpande, Leena
    2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 28 - 32
  • [4] Recurrent Concept Drifts on Data Streams
    Gunasekara, Nuwan
    Pfahringer, Bernhard
    Gomes, Heitor Murilo
    Bifet, Albert
    Koh, Yun Sing
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 8029 - 8037
  • [5] Handling Different Categories of Concept Drifts in Data Streams Using Distributed GP
    Folino, Gianluigi
    Papuzzo, Giuseppe
    GENETIC PROGRAMMING, PROCEEDINGS, 2010, 6021 : 74 - 85
  • [6] Data streams classification method handling concept drifts
    Tianjin Key Laboratory of Intelligence Computing and Novel Software Technology, Tianjin University of Technology, Tianjin , China
    不详
    J. Inf. Comput. Sci., 15 (5427-5435):
  • [7] StreamGP: Tracking Evolving GP Ensembles in Distributed Data Streams using Fractal Dimension
    Folino, Gianluigi
    Pizzuti, Clara
    Spezzano, Giandomenico
    GECCO 2007: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2007, : 1751 - 1751
  • [8] Structure discovery in semantically distributed data sites: The fractal dimension approach
    Sadeghian, P
    Kantardzic, M
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING, 2004, : 248 - 253
  • [9] Mining data streams with concept drifts using genetic algorithm
    Vivekanandan, Periasamy
    Nedunchezhian, Raju
    ARTIFICIAL INTELLIGENCE REVIEW, 2011, 36 (03) : 163 - 178
  • [10] Mining decision rules on data streams in the presence of concept drifts
    Tsai, Cheng-Jung
    Lee, Chien-I.
    Yang, Wei-Pang
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 1164 - 1178