A DISTANCE-BASED ATTRIBUTE SELECTION MEASURE FOR DECISION TREE INDUCTION

被引:233
|
作者
DEMANTARAS, RL
机构
[1] Centre of Advanced Studies, CSIC, Girona
关键词
DISTANCE BETWEEN PARTITIONS; DECISION TREE INDUCTION; INFORMATION MEASURES;
D O I
10.1023/A:1022694001379
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This note introduces a new attribute selection measure for ID3-like inductive algorithms. This measure is based on a distance between partitions such that the selected attribute in a node induces the partition which is closest to the correct partition of the subset of training examples corresponding to this node. The relationship of this measure with Quinlan's information gain is also established. It is also formally proved that our distance is not biased towards attributes with large numbers of values. Experimental studies with this distance confirm previously reported results showing that the predictive accuracy of induced decision trees is not sensitive to the goodness of the attribute selection measure. However, this distance produces smaller trees than the gain ratio measure of Quinlan, especially in the case of data whose attributes have significantly different numbers of values.
引用
收藏
页码:81 / 92
页数:12
相关论文
共 50 条
  • [1] An improved attribute selection measure for decision tree induction
    Wang, Dianhong
    Jiang, Liangxiao
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 4, PROCEEDINGS, 2007, : 654 - +
  • [2] The choice of the best attribute selection measure in Decision Tree induction
    Badulescu, Laviniu Aurelian
    ANNALS OF THE UNIVERSITY OF CRAIOVA-MATHEMATICS AND COMPUTER SCIENCE SERIES, 2007, 34 : 89 - 94
  • [3] THE IMPORTANCE OF ATTRIBUTE SELECTION MEASURES IN DECISION TREE INDUCTION
    LIU, WZ
    WHITE, AP
    MACHINE LEARNING, 1994, 15 (01) : 25 - 41
  • [4] Comparative Analysis of Attribute Selection Measures Used for Attribute Selection in Decision Tree Induction
    Bhatt, Advait S.
    2012 INTERNATIONAL CONFERENCE ON RADAR, COMMUNICATION AND COMPUTING (ICRCC), 2012, : 230 - 234
  • [5] Distance-Based Decision Tree Algorithms for Label Ranking
    de Sa, Claudio Rebelo
    Rebelo, Carla
    Soares, Carlos
    Knobbe, Arno
    PROGRESS IN ARTIFICIAL INTELLIGENCE-BK, 2015, 9273 : 525 - 534
  • [6] An Decision Tree Algorithm Based on Dispersion Measure of Attribute Information
    He Dengchao
    Hao Wenning
    Gan Wenyan
    Chen Gang
    Jin Dawei
    2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 84 - 89
  • [7] Gain Ratio as Attribute Selection Measure in Elegant Decision Tree to Predict Precipitation
    Prasad, Narasimha
    Naidu, Mannava Munirathnam
    2013 8TH EUROSIM CONGRESS ON MODELLING AND SIMULATION (EUROSIM), 2013, : 141 - 150
  • [8] Decision tree induction using a fast splitting attribute selection for large datasets
    Franco-Arcega, A.
    Carrasco-Ochoa, J. A.
    Sanchez-Diaz, G.
    Fco Martinez-Trinidad, J.
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (11) : 14290 - 14300
  • [9] Distance-Based Tournament Selection
    Oesch, Christian
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2017, PT I, 2017, 10199 : 705 - 714
  • [10] Distance-based attribute value recombining algorithm
    Nian Fuzhong
    Bai Shibao
    Li Ming
    2007 IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1-7, 2007, : 1986 - 1990