Reduced Data Sets and Entropy-Based Discretization

被引:1
|
作者
Grzymala-Busse, Jerzy W. [1 ,2 ]
Hippe, Zdzislaw S. [2 ]
Mroczek, Teresa [2 ]
机构
[1] Univ Kansas, Dept Elect Engn & Comp Sci, Lawrence, KS 66045 USA
[2] Univ Informat Technol & Management, Dept Artificial Intelligence, PL-35225 Rzeszow, Poland
关键词
data mining; numerical attributes; discretization; entropy; FEATURE-SELECTION; ROUGH; PREDICTION; ATTRIBUTES;
D O I
10.3390/e21111051
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Results of experiments on numerical data sets discretized using two methods-global versions of Equal Frequency per Interval and Equal Interval Width-are presented. Globalization of both methods is based on entropy. For discretized data sets left and right reducts were computed. For each discretized data set and two data sets, based, respectively, on left and right reducts, we applied ten-fold cross validation using the C4.5 decision tree generation system. Our main objective was to compare the quality of all three types of data sets in terms of an error rate. Additionally, we compared complexity of generated decision trees. We show that reduction of data sets may only increase the error rate and that the decision trees generated from reduced decision sets are not simpler than the decision trees generated from non-reduced data sets.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Entropy-based discretization methods for ranking data
    de Sa, Claudio Rebelo
    Soares, Carlos
    Knobbe, Arno
    [J]. INFORMATION SCIENCES, 2016, 329 : 921 - 936
  • [2] Online entropy-based discretization for data streaming classification
    Ramirez-Gallego, S.
    Garcia, S.
    Herrera, F.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 86 : 59 - 70
  • [3] Merging of Numerical Intervals in Entropy-Based Discretization
    Grzymala-Busse, Jerzy W.
    Mroczek, Teresa
    [J]. ENTROPY, 2018, 20 (11)
  • [4] Biometric recognition using entropy-based discretization
    Kumar, Ajay
    Zhang, David
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 125 - +
  • [5] Evolutionary Optimization Guided by Entropy-Based Discretization
    Sheri, Guleng
    Corne, David W.
    [J]. APPLICATIONS OF EVOLUTIONARY COMPUTING, PROCEEDINGS, 2009, 5484 : 695 - 704
  • [6] Entropy-based matrix learning machine for imbalanced data sets
    Zhu, Changming
    Wang, Zhe
    [J]. PATTERN RECOGNITION LETTERS, 2017, 88 : 72 - 80
  • [7] Entropy-based operations on fuzzy sets
    Rudas, IJ
    Kaynak, MO
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1998, 6 (01) : 33 - 40
  • [8] Unsupervised Entropy-Based Selection of Data Sets for Improved Model Fitting
    Ferreira, Pedro M.
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3330 - 3337
  • [9] On entropy-based data mining
    Holzinger, Andreas
    Hörtenhuber, Matthias
    Mayer, Christopher
    Bachler, Martin
    Wassertheurer, Siegfried
    Pinho, Armando J
    Koslicki, David
    [J]. 1600, Springer Verlag (8401): : 209 - 226
  • [10] An entropy-based discretization method for classification rules with inconsistency checking
    Li, RP
    Wang, ZO
    [J]. 2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 243 - 246