Parallel formulations of decision-tree classification algorithms

被引:70
|
作者
Srivastava, A
Han, EH
Kumar, V
Singh, V
机构
[1] Univ Minnesota, Dept Comp Sci & Engn, Army HPC Res Ctr, Minneapolis, MN 55455 USA
[2] Hitachi Amer Inc, Informat Technol Lab, Tarrytown, NY 10591 USA
基金
美国国家科学基金会;
关键词
data mining; parallel processing; classification; scalability; decision trees;
D O I
10.1023/A:1009832825273
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification decision tree algorithms are used extensively for data mining in many domains such as retail target marketing, fraud detection, etc. Highly parallel algorithms for constructing classification decision trees are desirable for dealing with large data sets in reasonable amount of time. Algorithms for building classification decision trees have a natural concurrency, but are difficult to parallelize due to the inherent dynamic nature of the computation. In this paper, we present parallel formulations of classification decision tree learning algorithm based on induction. We describe two basic parallel formulations. One is based on Synchronous Tree Construction Approach and the other is based on Partitioned Tree Construction Approach. We discuss the advantages and disadvantages of using these methods and propose a hybrid method that employs the good features of these methods. We also provide the analysis of the cost of computation and communication of the proposed hybrid method. Moreover, experimental results on an IBM SP-2 demonstrate excellent speedups and scalability.
引用
收藏
页码:237 / 261
页数:25
相关论文
共 50 条
  • [41] Conformal decision-tree approach to instance transfer
    Zhou, S.
    Smirnov, E. N.
    Schoenmakers, G.
    Peeters, R.
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2017, 81 (1-2) : 85 - 104
  • [42] A decision-tree approach to the discovery of empirical regularities
    Asaithambi, A
    Valev, V
    INFORMATION REUSE AND INTEGRATION, 2001, : 18 - 23
  • [43] Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data
    Rodrigo C Barros
    Ana T Winck
    Karina S Machado
    Márcio P Basgalupp
    André CPLF de Carvalho
    Duncan D Ruiz
    Osmar Norberto de Souza
    BMC Bioinformatics, 13
  • [44] Decision-tree algorithm hybridized with neural network
    Shao, H.
    Zhao, H.
    Xiaoxing Weixing Jisuanji Xitong/Mini-Micro Systems, 2001, 22 (08):
  • [45] Uncertainty and risk from decision-tree analysis
    Lerche, I
    MacKay, JA
    ENERGY EXPLORATION & EXPLOITATION, 1997, 15 (06) : 493 - 496
  • [46] Explanation of attribute relevance in decision-tree induction
    McSherry, D
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XVIII, 2002, : 39 - 52
  • [47] A Decision-Tree Based Approach for Mutation Testing
    Reena
    Choudhary, Jyoti
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 1173 - 1176
  • [48] A decision-tree model of balance scale development
    Schmidt, WC
    Ling, CX
    MACHINE LEARNING, 1996, 24 (03) : 203 - 230
  • [49] Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data
    Barros, Rodrigo C.
    Winck, Ana T.
    Machado, Karina S.
    Basgalupp, Marcio P.
    de Carvalho, Andre C. P. L. F.
    Ruiz, Duncan D.
    de Souza, Osmar Norberto
    BMC BIOINFORMATICS, 2012, 13
  • [50] Conformal decision-tree approach to instance transfer
    S. Zhou
    E. N. Smirnov
    G. Schoenmakers
    R. Peeters
    Annals of Mathematics and Artificial Intelligence, 2017, 81 : 85 - 104