Elegant decision tree algorithm for classification in data mining

被引:0
|
作者
Chandra, B [1 ]
Mazumdar, S [1 ]
Arena, V [1 ]
Parimi, N [1 ]
机构
[1] Indian Inst Technol, New Delhi, India
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Decision trees have been found very effective for classification especially in Data Mining. This paper aims at improving the performance of the SLIQ decision tree algorithm (Mehta et. al, 1996) for classification in data mining The drawback of this algorithm is that large number of gini indices have to be computed at each node of the decision tree. In order to decide which attribute is to be split at each node, the gini indices have to be computed for all the attributes and for each successive pair of values for all patterns which have not been classified. An improvement over the SLIQ algorithm has been proposed to reduce the computational complexity. In this algorithm, the gini index is computed not for every successive pair of values of an attribute but over different ranges of attribute values. Classification accuracy of this technique was compared with the existing SLIQ and the Neural Network technique on three real life datasets consisting of the effect of different chemicals on water pollution, Wisconsin Breast Cancer Data and Image data It was observed that the decision tree constructed using the proposed decision tree algorithm gave far better classification accuracy than the classification accuracy obtained using the SLIQ algorithm irrespective of the dataset under consideration. The classification accuracy of this algorithm was even better compared to the neural network classification technique. Overall, it was observed that this decision tree algorithm not only reduces the number of computations of gini indices but also leads to better classification accuracy.
引用
收藏
页码:160 / 169
页数:10
相关论文
共 50 条
  • [1] The research of decision tree learning algorithm in technology of data mining classification
    Department of Mechanical and Electrical Information, Lishui Vocational and Technical College, ZheJiang, China
    J. Convergence Inf. Technol., 2012, 10 (216-223):
  • [2] Privacy protection data mining algorithm in blockchain based on decision tree classification
    Cao, Yu
    Wei, Wei
    Zhou, Jin
    WEB INTELLIGENCE, 2022, 20 (02) : 103 - 112
  • [3] Generalization and decision tree induction: Efficient classification in data mining
    Kamber, M
    Winstone, L
    Gong, W
    Cheng, S
    Han, JW
    SEVENTH INTERNATIONAL WORKSHOP ON RESEARCH ISSUES IN DATA ENGINEERING, PROCEEDINGS: HIGH PERFORMANCE DATABASE MANAGEMENT FOR LARGE-SCALE APPLICATIONS, 1997, : 111 - 120
  • [4] A Statistical Decision Tree Algorithm for Data Stream Classification
    Cazzolato, Mirela Teixeira
    Ribeiro, Marcela Xavier
    Yaguinuma, Cristiane
    Prado Santos, Marilde Terezinha
    ICEIS: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1, 2013, : 217 - 223
  • [5] A hybrid decision tree/genetic algorithm method for data mining
    Carvalho, DR
    Freitas, AA
    INFORMATION SCIENCES, 2004, 163 (1-3) : 13 - 35
  • [6] Research on the application of data mining algorithm based on decision tree
    Song, Liangong
    Metallurgical and Mining Industry, 2015, 7 (09): : 843 - 848
  • [7] A Statistical Decision Tree Algorithm for Medical Data Stream Mining
    Cazzolato, Mirela Teixeira
    Ribeiro, Marcela Xavier
    2013 IEEE 26TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2013, : 389 - 392
  • [8] Design and application of decision tree algorithm SLIQ in data mining
    Yan, Hongwen
    Ma, Rui
    Long, Jizhen
    Yan, Hongbin
    Jisuanji Gongcheng/Computer Engineering, 2005, 31 (06): : 60 - 62
  • [9] Vlsi Implementation Of Flexible Architecture For Decision Tree Classification In Data Mining
    Sharma, K. Venkatesh
    Shewandagn, Behailu
    Bhukya, Shankar Nayak
    INTERNATIONAL CONFERENCE ON FUNCTIONAL MATERIALS, CHARACTERIZATION, SOLID STATE PHYSICS, POWER, THERMAL AND COMBUSTION ENERGY (FCSPTC-2017), 2017, 1859
  • [10] An incremental fuzzy decision tree classification method for mining data streams
    Wang, Tao
    Li, Zhoujun
    Yan, Yuejin
    Chen, Huowang
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2007, 4571 : 91 - +