A hybrid decision tree/genetic algorithm method for data mining

被引:95
|
作者
Carvalho, DR
Freitas, AA [1 ]
机构
[1] Univ Kent, Comp Lab, Canterbury CT2 7NF, Kent, England
[2] Univ Tuiuti Parana, Dept Comp Sci, BR-80215090 Curitiba, Parana, Brazil
关键词
classification; genetic algorithms; decision trees; data mining; machine learning;
D O I
10.1016/j.ins.2003.03.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the well-known classification task of data mining, where the objective is to predict the class which an example belongs to. Discovered knowledge is expressed in the form of high-level, easy-to-interpret classification rules. In order to discover classification rules, we propose a hybrid decision tree/genetic algorithm method. The central idea of this hybrid method involves the concept of small disjuncts in data mining, as follows. In essence, a set of classification rules can be regarded as a logical disjunction of rules, so that each rule can be regarded as a disjunct. A small disjunct is a rule covering a small number of examples. Due to their nature, small disjuncts are error prone. However, although each small disjunct covers just a few examples, the set of all small disjuncts can cover a large number of examples, so that it is important to develop new approaches to cope with the problem of small disjuncts. In our hybrid approach, we have developed two genetic algorithms (GA) specifically designed for discovering rules covering examples belonging to small disjuncts, whereas a conventional decision tree algorithm is used to produce rules covering examples belonging to large disjuncts. We present results evaluating the performance of the hybrid method in 22 real-world data sets. (C) 2003 Elsevier Inc. All rights reserved.
引用
收藏
页码:13 / 35
页数:23
相关论文
共 50 条
  • [1] New results for a hybrid decision tree/genetic algorithm for data mining
    Carvalho, DR
    Freitas, AA
    APPLICATIONS AND SCIENCE IN SOFT COMPUTING, 2004, : 149 - 154
  • [2] A hybrid genetic algorithm - Decision tree classifier
    Salem, ABM
    Mahmoud, AM
    INTELLIGENT INFORMATION PROCESSING AND WEB MINING, 2003, : 221 - 232
  • [3] Deep Mining Method of Distributed Data Association Based on Decision Tree Algorithm
    Cai, Jingjing
    Ding, Yongsheng
    Engineering Intelligent Systems, 2023, 31 (03): : 229 - 237
  • [4] Elegant decision tree algorithm for classification in data mining
    Chandra, B
    Mazumdar, S
    Arena, V
    Parimi, N
    WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING (WORKSHOPS), 2002, : 160 - 169
  • [5] A method for classifying and mining online teaching data in universities based on decision tree algorithm
    Wang, Fei
    Wu, Xiaoyan
    INTERNATIONAL JOURNAL OF CONTINUING ENGINEERING EDUCATION AND LIFE-LONG LEARNING, 2025, 35 (1-2)
  • [6] Data mining for fuzzy decision tree structure with a genetic program
    Smith, JF
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 13 - 18
  • [7] Research on the application of data mining algorithm based on decision tree
    Song, Liangong
    Metallurgical and Mining Industry, 2015, 7 (09): : 843 - 848
  • [8] A Statistical Decision Tree Algorithm for Medical Data Stream Mining
    Cazzolato, Mirela Teixeira
    Ribeiro, Marcela Xavier
    2013 IEEE 26TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2013, : 389 - 392
  • [9] Design and application of decision tree algorithm SLIQ in data mining
    Yan, Hongwen
    Ma, Rui
    Long, Jizhen
    Yan, Hongbin
    Jisuanji Gongcheng/Computer Engineering, 2005, 31 (06): : 60 - 62
  • [10] Research on dynamic cost-sensitive decision tree for mining uncertain data based on the genetic algorithm
    Huang, Yuwen
    Huang, Yuwen, 1600, Science and Engineering Research Support Society (07):