Fast Decision Tree Algorithm

被引:9
|
作者
Purdila, Vasile [1 ]
Pentiuc, Stefan-Gheorghe [1 ]
机构
[1] Stefan Cel Mare Univ Suceava, Suceava 720229, Romania
关键词
algorithm; chi-merge; classification; data compression; decision tree; pruning;
D O I
10.4316/AECE.2014.01010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is a growing interest nowadays to process large amounts of data using the well-known decision-tree learning algorithms. Building a decision tree as fast as possible against a large dataset without substantial decrease in accuracy and using as little memory as possible is essential. In this paper we present an improved C4.5 algorithm that uses a compression mechanism to store the training and test data in memory. We also present a very fast tree pruning algorithm. Our experiments show that presented algorithms perform better than C5.0 in terms of speed and classification accuracy in most cases at the expense of tree size - the resulting trees are larger than the ones produced by C5.0. The data compression and pruning algorithms can be easily parallelized in order to achieve further speedup.
引用
收藏
页码:65 / 68
页数:4
相关论文
共 50 条
  • [1] VERY FAST DECISION TREE (VFDT) ALGORITHM ON HADOOP
    Desai, Sharmishta
    Roy, Sourav
    Patel, Brina
    Purandare, Samruddhi
    Kucheria, Minal
    2016 INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2016,
  • [2] Fast Intra Coding Algorithm for HEVC Based on Decision Tree
    Qin, Jia
    Bai, Huihui
    Zhang, Mengmeng
    Zhao, Yao
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (05): : 1274 - 1278
  • [3] Very Fast C4.5 Decision Tree Algorithm
    Cherfi, Anis
    Nouira, Kaouther
    Ferchichi, Ahmed
    APPLIED ARTIFICIAL INTELLIGENCE, 2018, 32 (02) : 119 - 137
  • [4] An Improved Fast Decision Tree Algorithm Based on Attribute Deviation
    Liu, Dan
    Zhang, Yue
    Sui, Xin
    Zeng, Yan
    Wang, Huan
    Li, Li
    FUZZY SYSTEMS AND DATA MINING III (FSDM 2017), 2017, 299 : 441 - 446
  • [5] A Fast SVM Training Algorithm Based on a Decision Tree Data Filter
    Cervantes, Jair
    Lopez, Asdrubal
    Garcia, Farid
    Trueba, Adrian
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PT I, 2011, 7094 : 187 - +
  • [6] Extremely Fast Decision Tree
    Manapragada, Chaitanya
    Webb, Geoffrey I.
    Salehi, Mahsa
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1953 - 1962
  • [7] An improvement on the algorithm of decision tree
    Liu, XM
    Huang, HK
    Xu, WX
    PROCEEDINGS OF THE 8TH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1-3, 2005, : 1485 - 1488
  • [8] A FAST TREE SOREING ALGORITHM
    黄竞伟
    戴大为
    ActaMathematicaScientia, 1998, (04) : 421 - 426
  • [9] A fast tree sorting algorithm
    Huang, JW
    Dai, DW
    ACTA MATHEMATICA SCIENTIA, 1998, 18 (04) : 421 - 426
  • [10] FAST IMAGE INTERPOLATION WITH DECISION TREE
    Huang, Jun-Jie
    Siu, Wan-Chi
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1221 - 1225