A Comparative Analysis of Pruning Methods for C4.5 and Fuzzy C4.5

被引:1
|
作者
Naseer, Tayyeba [1 ]
Asghar, Sohail [2 ]
Zhuang, Yan [3 ]
Fong, Simon [3 ]
机构
[1] Arid Agr Univ, PMAS, UIIT, Rawalpindi, Pakistan
[2] COMSATS Inst Informat Technol, Islamabad, Pakistan
[3] Univ Macau, Dept Comp & Informat Sci, Taipa, Macau, Peoples R China
来源
关键词
Classification; C4.5; Fuzzy C4.5; Data mining; TREES;
D O I
10.3233/978-1-61499-503-6-304
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The decision tree is an illustrious classification technique used for the pre-diction of the future data based on past experience. The decision tree is constructed using three major steps, first - constructing the decision tree to classify the data, second - pruning the decision tree to improve statistic certainty, third process the pruned decision tree to improve intelligibility. In this paper, we focus on second steps - pruning. Two famous algorithms are used for creating a decision tree, i.e. C4.5 and fuzzy C4.5. It presents the comparison of five well-known pruning methods. The performance of different pruning algorithm is evaluated using two main criteria size and classification errors after pruning decision trees. Different pruning methods have different impacts on decision tree.
引用
收藏
页码:304 / 312
页数:9
相关论文
共 50 条
  • [1] Fast C4.5
    He, Ping
    Chen, Ling
    Xu, Xiao-Hua
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 2841 - +
  • [2] Efficient C4.5
    Ruggieri, S
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (02) : 438 - 444
  • [3] C4.5 decision forests
    Ho, TK
    [J]. FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 545 - 549
  • [4] Bagging, boosting, and C4.5
    Quinlan, JR
    [J]. PROCEEDINGS OF THE THIRTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE, VOLS 1 AND 2, 1996, : 725 - 730
  • [5] Comparative Analysis of Data Customer Classification with C4.5 Algorithm
    Aisyah, Siti
    Rumapea, Bondang Johanes
    Halwan, M. Ghifari
    Siahaan, Denny Hartanto
    [J]. INTERNETWORKING INDONESIA, 2020, 12 (02): : 3 - 7
  • [6] Kansei factor analysis using C4.5
    Yoshida, K
    [J]. SOFT COMPUTING AND INDUSTRY: RECENT APPLICATIONS, 2002, : 577 - 588
  • [7] Improved C4.5 Algorithm for the Analysis of Sales
    Cao, Rong
    Xu, Lizhen
    [J]. 2009 SIXTH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE, PROCEEDINGS, 2009, : 173 - 176
  • [8] C4.5算法的优化
    黄秀霞
    孙力
    [J]. 计算机工程与设计, 2016, 37 (05) : 1265 - 1270
  • [9] The Application and Research of C4.5 Algorithm
    Zhao, Hongyan
    [J]. APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 1285 - 1288
  • [10] Credal C4.5 with Refinement of Parameters
    Mantas, Carlos J.
    Abellan, Joaquin
    Castellano, Javier G.
    Cano, Jose R.
    Moral, Serafin
    [J]. INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: APPLICATIONS, IPMU 2018, PT III, 2018, 855 : 739 - 747