Scalable classification of large data sets by parallel genetic programming

被引:0
|
作者
Folino, G [1 ]
Pizzuti, C [1 ]
Spezzano, G [1 ]
机构
[1] Univ Calabria, ISI, CNR, DEIS, I-87036 Arcavacata Di Rende, CS, Italy
关键词
data mining; parallel genetic programming; cellular automata;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A parallel genetic programming approach to data classification is presented. The method uses cellular automata as a framework to enable a fine-grained parallel implementation of GP through the grid model. Experiments on real datasets from the UCI machine learning repository show good results with respect to C4.5. The generated trees are smaller, they have a misclassification error on the training set comparable, but, more important, they generalise better than C4.5: Furthermore, performance results show a nearly linear speedup.
引用
收藏
页码:87 / 90
页数:4
相关论文
共 50 条
  • [1] Scale Genetic Programming for large Data Sets: Case of Higgs Bosons Classification
    Hmida, Hmida
    Ben Hamida, Sana
    Borgi, Amel
    Rukoz, Marta
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 302 - 311
  • [2] Data classification using genetic parallel programming
    Cheang, SM
    Lee, KH
    Leung, KS
    GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2003, PT II, PROCEEDINGS, 2003, 2724 : 1918 - 1919
  • [3] Evolving data classification programs using genetic parallel programming
    Cheang, SM
    Lee, KH
    Leung, KS
    CEC: 2003 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-4, PROCEEDINGS, 2003, : 248 - 255
  • [4] A scalable cellular implementation of parallel genetic programming
    Folino, G
    Pizzuti, C
    Spezzano, G
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2003, 7 (01) : 37 - 53
  • [5] P-AutoClass: Scalable parallel clustering for mining large data sets
    Pizzuti, C
    Talia, D
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (03) : 629 - 641
  • [6] Classification of Imbalanced data sets using Multi Objective Genetic Programming
    Maheta, Hardik H.
    Dabhi, Vipul K.
    2015 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2015,
  • [7] A Parallel Genetic Programming Algorithm for Classification
    Cano, Alberto
    Zafra, Amelia
    Ventura, Sebastian
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PART I, 2011, 6678 : 172 - 181
  • [8] Parallel Distributed Genetic Rule Selection for Data Mining from Large Data Sets
    Nojima, Yusuke
    Mihara, Shingo
    Ishibuchi, Hisao
    SIMULATION AND MODELING RELATED TO COMPUTATIONAL SCIENCE AND ROBOTICS TECHNOLOGY, 2012, 37 : 140 - 154
  • [9] Parallel visualization of large data sets
    Rosenberg, R
    Lanzagorta, M
    Chtchelkanova, A
    Khokhlov, A
    VISUAL DATA EXPLORATION AND ANALYSIS VII, 2000, 3960 : 135 - 143
  • [10] Scalable parallel computations for large-scale stochastic programming
    Vladimirou, H
    Zenios, SA
    ANNALS OF OPERATIONS RESEARCH, 1999, 90 (0) : 87 - 129