Parallel out-of-core divide-and-conquer techniques with application to classification trees

被引:7
|
作者
Sreenivas, MK [1 ]
AlSabti, K [1 ]
Ranka, S [1 ]
机构
[1] Univ Florida, Dept CISE, Gainesville, FL 32611 USA
关键词
D O I
10.1109/IPPS.1999.760532
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Classification is an important problem in the field of data mining. Construction of good classifiers is computationally intensive and offers plenty of scope for parallelization. Divide-and-conquer paradigm can be used to efficiently construct decision tree classifiers. We discuss in detail various techniques for parallel divide-and-conquer and extend these techniques to handle efficiently disk-resident data. Furthermore, a generic technique for parallel out-of-core divide-and-conquer problems is suggested We present pCLOUDS, the parallel version of the decision tree classifier algorithm CLOUDS, capable of handling large out-of-core data sets. pCLOUDS exhibits excellent speedup, sizeup and scaleup properties which make it a competitive tool for data mining applications. We evaluate the performance of pCLOUDS for a range of synthetic darn sets on the IBM-SP2.
引用
收藏
页码:555 / 562
页数:8
相关论文
共 50 条
  • [1] Parallel out-of-core divide-and-conquer techniques with application to classification trees
    Univ of Florida, Gainesville, United States
    [J]. Proc Int Parall Process Symp IPPS, (555-562):
  • [2] DIVIDE-AND-CONQUER FOR PARALLEL PROCESSING
    HOROWITZ, E
    ZORAT, A
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1983, 32 (06) : 582 - 585
  • [3] Parallel skeletons for Divide-and-Conquer and Branch-and-Bound techniques
    Dorta, I
    León, C
    Rodríguez, C
    Rojas, A
    [J]. ELEVENTH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PROCEEDINGS, 2003, : 292 - 298
  • [4] DIVIDE-AND-CONQUER AND PARALLEL GRAPH REDUCTION
    RABHI, FA
    MANSON, GA
    [J]. PARALLEL COMPUTING, 1991, 17 (2-3) : 189 - 205
  • [5] Divide-and-conquer recurrences - Classification of asymptotics
    Derfel G.
    Vogl F.
    [J]. aequationes mathematicae, 2000, 60 (3) : 243 - 257
  • [6] Compilation techniques for out-of-core parallel computations
    Kandemir, M
    Choudhary, A
    Ramanujam, J
    Bordawekar, R
    [J]. PARALLEL COMPUTING, 1998, 24 (3-4) : 597 - 628
  • [7] Parallel divide-and-conquer scheme for Delaunay triangulation
    Chen, MB
    Chuang, TR
    Wu, JJ
    [J]. NINTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 2002, : 571 - 576
  • [8] A Parallel Scheme Using the Divide-and-Conquer Method
    Qi Yang
    Son Dao
    Clement Yu
    Naphtali Rishe
    [J]. Distributed and Parallel Databases, 1997, 5 : 405 - 438
  • [9] A fast parallel squarer based on divide-and-conquer
    Yoo, JT
    Smith, KF
    Gopalakrishnan, G
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1997, 32 (06) : 909 - 912
  • [10] A parallel scheme using the divide-and-conquer method
    Yang, Q
    Dao, S
    Yu, C
    Rishe, NA
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 1997, 5 (04) : 405 - 438