Decision tree construction for data mining on grid computing environments

被引:0
|
作者
Yang, CT [1 ]
Tsai, ST [1 ]
Li, KC [1 ]
机构
[1] Tunghai Univ, High Performance Comp Lab, Taichung 40704, Taiwan
关键词
data mining; grid computing; PC clusters; heterogeneous; performance;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present our Grid-based decision tree architecture, with the intention of applying it to both parallel and sequential algorithms. Also, we show that, based on the scope and model of data mining applied in the Grid environment as well as user equivalent perspective, Grid roles can be categorized into three types. It is our goal, through these definitions, to help software developers define clear system processes and differentiate the application scope for software applications. To fulfill our architecture, we first apply an existing parallel decision tree algorithm (the SPRINT algorithm) to the Grid environment. The performance and differences in many other areas are compared using datasets of different sizes. The experimental results will be used for future reference and further development.
引用
收藏
页码:421 / 424
页数:4
相关论文
共 50 条
  • [1] Decision tree construction for data mining on grid computing
    Tsai, ST
    Yang, CT
    [J]. 2004 IEEE INTERNATIONAL CONFERNECE ON E-TECHNOLOGY, E-COMMERE AND E-SERVICE, PROCEEDINGS, 2004, : 441 - 447
  • [3] Distributed data mining in grid computing environments
    Luo, Ping
    Lu, Kevin
    Shi, Zhongzhi
    He, Qing
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2007, 23 (01): : 84 - 91
  • [4] Special section: Data mining in grid computing environments
    Stankovski, Vlado
    Dubitzky, Werner
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF GRID COMPUTING THEORY METHODS AND APPLICATIONS, 2007, 23 (01): : 31 - 33
  • [5] Performance-based data distribution for data mining applications on grid computing environments
    Shih, Wen-Chung
    Yang, Chao-Tung
    Tseng, Shian-Shyong
    [J]. JOURNAL OF SUPERCOMPUTING, 2010, 52 (02): : 171 - 198
  • [6] Performance-based data distribution for data mining applications on grid computing environments
    Wen-Chung Shih
    Chao-Tung Yang
    Shian-Shyong Tseng
    [J]. The Journal of Supercomputing, 2010, 52 : 171 - 198
  • [7] Distributed data mining in grid computing environment
    Ren, Jianlan
    Chen, Zhongsheng
    Zhang, Zheng
    [J]. INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2020, 16 (03) : 305 - 320
  • [8] Distributed data mining in grid computing environment
    Xue, Huifang
    [J]. AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (01): : 2719 - 2723
  • [9] Proposal of a tree load balancing algorithm to grid computing environments
    de Mello, RF
    de Mattos, ECT
    Trevelin, LC
    de Paiva, MSV
    Yang, LT
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (07): : 1729 - 1736
  • [10] Shared memory parallelization of decision tree construction using a general data mining middleware
    Jin, RM
    Agrawal, G
    [J]. EURO-PAR 2002 PARALLEL PROCESSING, PROCEEDINGS, 2002, 2400 : 346 - 354