Fast Decision Tree Algorithm

被引:9
|
作者
Purdila, Vasile [1 ]
Pentiuc, Stefan-Gheorghe [1 ]
机构
[1] Stefan Cel Mare Univ Suceava, Suceava 720229, Romania
关键词
algorithm; chi-merge; classification; data compression; decision tree; pruning;
D O I
10.4316/AECE.2014.01010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is a growing interest nowadays to process large amounts of data using the well-known decision-tree learning algorithms. Building a decision tree as fast as possible against a large dataset without substantial decrease in accuracy and using as little memory as possible is essential. In this paper we present an improved C4.5 algorithm that uses a compression mechanism to store the training and test data in memory. We also present a very fast tree pruning algorithm. Our experiments show that presented algorithms perform better than C5.0 in terms of speed and classification accuracy in most cases at the expense of tree size - the resulting trees are larger than the ones produced by C5.0. The data compression and pruning algorithms can be easily parallelized in order to achieve further speedup.
引用
收藏
页码:65 / 68
页数:4
相关论文
共 50 条
  • [21] An Explainable Bayesian Decision Tree Algorithm
    Nuti, Giuseppe
    Rugama, Lluis Antoni Jimenez
    Cross, Andreea-Ingrid
    FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2021, 7
  • [22] A Streaming Parallel Decision Tree Algorithm
    Ben-Haim, Yael
    Tom-Tov, Elad
    JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 849 - 872
  • [23] Research on the parallelism of decision tree algorithm
    Guo, Jingfeng
    Mi, Pubo
    Liu, Guohua
    Jisuanji Gongcheng/Computer Engineering, 2002, 28 (08):
  • [24] Decision tree algorithm for packet classification
    Lyu, Gaofeng
    Tan, Jing
    Qiao, Guanjie
    Yan, Jinli
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2022, 44 (03): : 184 - 193
  • [25] A streaming parallel decision tree algorithm
    Ben-Haim, Yael
    Tom-Tov, Elad
    Journal of Machine Learning Research, 2010, 11 : 849 - 872
  • [26] DECISION-TREE LEARNING ALGORITHM
    Fresku, E.
    Anamali, A.
    JOURNAL OF ENVIRONMENTAL PROTECTION AND ECOLOGY, 2014, 15 (02): : 686 - 696
  • [27] Feature bundling in decision tree algorithm
    Zhuang, Xu
    Zhu, Yan
    Chang, Chin-Chen
    Peng, Qiang
    INTELLIGENT DATA ANALYSIS, 2017, 21 (02) : 371 - 383
  • [28] Fast algorithm for stochastic tree computation
    Kang, MZ
    de Reffye, P
    Barczi, JF
    Hu, BG
    WSCG'2003 POSTER PROCEEDINGS, 2003, : 65 - 68
  • [30] Decision tree algorithm based on sampling
    Song Xudong
    Cheng Xiaolan
    2007 IFIP INTERNATIONAL CONFERENCE ON NETWORK AND PARALLEL COMPUTING WORKSHOPS, PROCEEDINGS, 2007, : 689 - +