Distributed data mining on grids: Services, tools, and applications

被引:65
|
作者
Cannataro, M [1 ]
Congiusta, A
Pugliese, A
Talia, D
Trunfio, P
机构
[1] Univ Catanzaro, I-88100 Catanzaro, Italy
[2] Univ Calabria, DEIS, I-87036 Arcavacata Di Rende, CS, Italy
关键词
grid computing; grid programming; grid scheduling; knowledge grid; data mining;
D O I
10.1109/TSMCB.2004.836890
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data mining algorithms are widely used today for the analysis of large corporate and scientific datasets stored in databases and data archives. Industry, science, and commerce fields often need to analyze very large datasets maintained over geographically distributed sites by using the computational power of distributed and parallel systems. The grid can play a significant role in providing an effective computational support for distributed knowledge discovery applications. For the development of data mining applications on grids we designed a system called KNOWLEDGE GRID. This paper describes the KNOWLEDGE GRID framework and presents the toolset provided by the KNOWLEDGE GRID for implementing distributed knowledge discovery. The paper discusses how to design and implement data mining applications by using the KNOWLEDGE GRID tools starting from searching grid resources, composing software and data components, and executing the resulting data mining process on a grid. Some performance results are also discussed.
引用
收藏
页码:2451 / 2465
页数:15
相关论文
共 50 条
  • [31] Virtual services in data grids
    Jagatheesan, A
    Moore, R
    Rajasekar, A
    Zhu, B
    [J]. 11TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 2002, : 420 - 420
  • [32] How Distributed Data Mining Tasks can Thrive as Knowledge Services
    Talia, Domenico
    Trunfio, Paolo
    [J]. COMMUNICATIONS OF THE ACM, 2010, 53 (07) : 132 - 137
  • [33] Data mining tools
    Mikut, Ralf
    Reischl, Markus
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (05) : 431 - 443
  • [34] Data mining tools
    Bartschat, Andreas
    Reischl, Markus
    Mikut, Ralf
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 9 (04)
  • [35] Grid-based Approaches for Distributed Data Mining Applications
    Aouad, Lamine M.
    An-Lekhac, Nhien
    Kechadi, Tahar
    [J]. JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2009, 3 (04) : 517 - 534
  • [36] Grid-based approaches for distributed data mining applications
    Aouad, Lamine M.
    Le-Khac, Nhien-An
    Kechadi, Tahar M.
    [J]. DCABES 2007 Proceedings, Vols I and II, 2007, : 772 - 775
  • [37] Developing distributed data mining applications in the KNOWLEDGE GRID framework
    Bueti, G
    Congiusta, A
    Talia, D
    [J]. HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2004, 2005, 3402 : 156 - 169
  • [38] Ancillary Services to Grids Provided with Distributed Generation
    Strzelecki, Ryszard
    Benysek, Grzegorz
    Jarnut, Marcin
    Smolenski, Robert
    Maciejewski, Bartosz
    [J]. CPE: 2009 COMPATIBILITY AND POWER ELECTRONICS, 2009, : 29 - 34
  • [39] Big Trajectory Data Mining: A Survey of Methods, Applications, and Services
    Wang, Di
    Miwa, Tomio
    Morikawa, Takayuki
    [J]. SENSORS, 2020, 20 (16) : 1 - 33
  • [40] Open active services for data-intensive distributed applications
    Collet, C
    Vargas-Solar, G
    Grazziotin-Ribeiro, H
    [J]. 2000 INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM - PROCEEDINGS, 2000, : 349 - 359