Distributed data mining on grids: Services, tools, and applications

被引:65
|
作者
Cannataro, M [1 ]
Congiusta, A
Pugliese, A
Talia, D
Trunfio, P
机构
[1] Univ Catanzaro, I-88100 Catanzaro, Italy
[2] Univ Calabria, DEIS, I-87036 Arcavacata Di Rende, CS, Italy
关键词
grid computing; grid programming; grid scheduling; knowledge grid; data mining;
D O I
10.1109/TSMCB.2004.836890
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data mining algorithms are widely used today for the analysis of large corporate and scientific datasets stored in databases and data archives. Industry, science, and commerce fields often need to analyze very large datasets maintained over geographically distributed sites by using the computational power of distributed and parallel systems. The grid can play a significant role in providing an effective computational support for distributed knowledge discovery applications. For the development of data mining applications on grids we designed a system called KNOWLEDGE GRID. This paper describes the KNOWLEDGE GRID framework and presents the toolset provided by the KNOWLEDGE GRID for implementing distributed knowledge discovery. The paper discusses how to design and implement data mining applications by using the KNOWLEDGE GRID tools starting from searching grid resources, composing software and data components, and executing the resulting data mining process on a grid. Some performance results are also discussed.
引用
收藏
页码:2451 / 2465
页数:15
相关论文
共 50 条
  • [41] Open active services for data-intensive distributed applications
    Collet, Christine
    Vargas-Solar, Genoveva
    Grazziotin-Ribeiro, Helena
    [J]. Proceedings of the International Database Engineering and Applications Symposium, IDEAS, 2000, : 349 - 359
  • [42] A Survey on Big Data, Mining: (Tools, Techniques, Applications and Notable Uses)
    Oweis, Nour E.
    Owais, Suhail S.
    George, Waseem
    Suliman, Mona G.
    Snasel, Vaclav
    [J]. INTELLIGENT DATA ANALYSIS AND APPLICATIONS, 2015, 370 : 109 - 119
  • [43] Using web services for remote data access and distributed applications
    Pais, V. F.
    Stancalie, V.
    [J]. FUSION ENGINEERING AND DESIGN, 2006, 81 (15-17) : 2013 - 2017
  • [44] Methods and Tools for Mining Multivariate Temporal Data in Clinical and Biomedical Applications
    Bellazzi, Riccardo
    Sacchi, Lucia
    Concaro, Stefano
    [J]. 2009 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-20, 2009, : 5629 - 5632
  • [45] DISTRIBUTED DATA MINING
    Fiolet, Valerie
    Toursel, Bernard
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2005, 6 (01): : 99 - 109
  • [46] The Weka4WS framework for distributed data mining in service-oriented Grids
    Talia, Domenico
    Trunfio, Paolo
    Verta, Oreste
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2008, 20 (16): : 1933 - 1951
  • [47] An architecture to support distributed data mining services in e-commerce environments
    Krishnaswamy, S
    Zaslavsky, A
    Loke, SW
    [J]. WECWIS 2000: SECOND INTERNATIONAL WORKSHOP ON ADVANCED ISSUES OF E-COMMERCE AND WEB-BASED INFORMATION SYSTEMS, PROCEEDING, 2000, : 239 - 246
  • [48] Distributed data mining and its applications to intelligent textual information processing
    Qiu, SB
    Qiu, M
    [J]. Innovations Through Information Technology, Vols 1 and 2, 2004, : 366 - 370
  • [49] Applications of distributed mining techniques for knowledge discovery in dispersed sensory data
    Bala, J
    Weng, YL
    Williams, A
    Gogia, BK
    Lesser, HK
    [J]. PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2003, : 513 - 516
  • [50] An architecture for distributed search and data-mining in condition monitoring applications
    Jackson, Tom
    Fletcher, Martyn
    Liang, Bojian
    Jessop, Mark
    Austin, Jim
    [J]. 2007 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2007, : 3871 - 3882