Distributed data mining on grids: Services, tools, and applications

被引:65
|
作者
Cannataro, M [1 ]
Congiusta, A
Pugliese, A
Talia, D
Trunfio, P
机构
[1] Univ Catanzaro, I-88100 Catanzaro, Italy
[2] Univ Calabria, DEIS, I-87036 Arcavacata Di Rende, CS, Italy
关键词
grid computing; grid programming; grid scheduling; knowledge grid; data mining;
D O I
10.1109/TSMCB.2004.836890
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data mining algorithms are widely used today for the analysis of large corporate and scientific datasets stored in databases and data archives. Industry, science, and commerce fields often need to analyze very large datasets maintained over geographically distributed sites by using the computational power of distributed and parallel systems. The grid can play a significant role in providing an effective computational support for distributed knowledge discovery applications. For the development of data mining applications on grids we designed a system called KNOWLEDGE GRID. This paper describes the KNOWLEDGE GRID framework and presents the toolset provided by the KNOWLEDGE GRID for implementing distributed knowledge discovery. The paper discusses how to design and implement data mining applications by using the KNOWLEDGE GRID tools starting from searching grid resources, composing software and data components, and executing the resulting data mining process on a grid. Some performance results are also discussed.
引用
收藏
页码:2451 / 2465
页数:15
相关论文
共 50 条
  • [1] WSRF services for composing distributed data mining applications on grids: Functionality and performance
    Talia, D
    Trunfio, P
    Verta, O
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2006, PT 1, 2006, 3980 : 1080 - 1089
  • [2] Middleware for data mining applications on clusters and grids
    Glimcher, Leonid
    Jin, Ruoming
    Agrawal, Gagan
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (01) : 37 - 53
  • [3] Data mining methods, applications, and tools
    Chen, LD
    Sakaguchi, T
    Frolick, MN
    [J]. INFORMATION SYSTEMS MANAGEMENT, 2000, 17 (01) : 65 - 70
  • [4] Distributed Data Mining Tasks and Patterns as Services
    Talia, Domenico
    [J]. EURO-PAR 2008 WORKSHOPS - PARALLEL PROCESSING, 2009, 5415 : 415 - 422
  • [5] Distributed data mining services leveraging WSRF
    Congiusta, Antonio
    Talia, Domenico
    Trunfio, Paolo
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF GRID COMPUTING THEORY METHODS AND APPLICATIONS, 2007, 23 (01): : 34 - 41
  • [6] Web services composition for distributed data mining
    Ali, AS
    Rana, OF
    Taylor, IJ
    [J]. 2005 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, PROCEEDINGS, 2005, : 11 - 18
  • [7] Applications of data mining in Web services
    Nayak, R
    Tong, C
    [J]. WEB INFORMATION SYSTEMS - WISE 2004, PROCEEDINGS, 2004, 3306 : 199 - 205
  • [8] Selected Data Mining Tools for Data Analysis in Distributed Environment
    Moshkov, Mikhail
    Zielosko, Beata
    Tetteh, Evans Teiko
    [J]. ENTROPY, 2022, 24 (10)
  • [9] Distributed Data Association Rule Mining: Tools and Techniques
    Sethi, Manoj
    Jindal, Rajni
    [J]. PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 481 - 485
  • [10] Data management services in ChinaGrid for data mining applications
    Wu, Song
    Wang, Wei
    Xiong, Muzhou
    Jin, Hai
    [J]. EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 421 - 432