Building scalable data mining grid applications - An Application Description Schema and associated grid services

被引:0
|
作者
Stankovski, Vlado [1 ]
Wegener, Dennis [2 ]
机构
[1] Univ Ljubljana, Fac Civil & Geodet Engn, Jamova Cesta 2, Ljubljana, Slovenia
[2] Fraunhofer Inst Intelligent Anal & Informat Syst, St Augustin, Germany
关键词
grid; distributed applications; data mining; middleware;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Grid-enabling existing stand-alone data mining programs, data and other resources, Such as computational servers, is motivated by the possibility for their sharing via local and wide area networks. Expected benefits are improved effectiveness, efficiency, wider access and better use of existing resources. In this paper, the problem of how to grid enable a variety of existing data mining programs, is investigated. The presented Solution is a simple procedure, which was developed under the DataMiningGrid project. The actual data mining program, which is a batch-style executable, is uploaded on a grid server and ail XML document that describes the program is prepared and registered with the underlying grid information services. The XML document conforms to an Application Description Schema, and is used to facilitate discovery and execution of the program in the grid environment. Over 20 stand-alone data mining programs have already been grid enabled by using the DataMiningGrid system. By using Triana, a workflow editor and manager which represents the end-user interface to the grid infrastructure, it is possible to combine grid enabled data mining programs and data into complex data mining applications. Grid-enabled resource sharing may facilitate novel, scalable, distributed data mining applications, which have not been possible before.
引用
收藏
页码:221 / +
页数:2
相关论文
共 50 条
  • [1] Building grid-enabled data-mining applications
    Depoutovitch, A
    Wainstein, A
    [J]. DR DOBBS JOURNAL, 2005, 30 (12): : 41 - 45
  • [2] Building web services for scientific grid applications
    Kandaswamy, Gopi
    Fang, Liang
    Huang, Yi
    Shirasuna, Satoshi
    Marru, Suresh
    Gannon, Dennis
    [J]. IBM Journal of Research and Development, 2006, 50 (2-3): : 249 - 260
  • [3] Building web services for scientific grid applications
    Kandaswamy, G
    Fang, L
    Huang, Y
    Shirasuna, S
    Marru, S
    Gannon, D
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2006, 50 (2-3) : 249 - 260
  • [4] The application of grid on distributed data mining
    Jiang, WS
    Yu, JH
    [J]. PROCEEDINGS OF THE 11TH JOINT INTERNATIONAL COMPUTER CONFERENCE, 2005, : 643 - 646
  • [5] Data mining on the grid for the grid
    Chawla, Nitesh V.
    Thain, Douglas
    Lichtenwalter, Ryan
    Cieslak, David A.
    [J]. 2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 2681 - 2685
  • [6] Towards a scalable Scientific Data Grid model and services
    Abdullah, Azizol
    Othman, Mohamed
    Sulaiman, Md Nasir
    Ibrahim, Hamidah
    Othman, Abu Talib
    [J]. 2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 20 - +
  • [7] TOWARDS A SCALABLE SCIENTIFIC DATA GRID MODEL AND SERVICES
    Abdullah, Azizol
    Othman, Mohamed
    Sulaiman, Md Nasir
    Ibrahim, Hamidah
    Othman, Abu Talib
    [J]. IIUM ENGINEERING JOURNAL, 2009, 10 (02): : 97 - 107
  • [8] The application of grid in grid services
    Zhao, JM
    Zhu, XH
    [J]. DCABES 2004, PROCEEDINGS, VOLS, 1 AND 2, 2004, : 37 - 39
  • [9] MS-Analyzer: preprocessing and data mining services for proteomics applications on the Grid
    Cannataro, Mario
    Veltri, Pierangelo
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2007, 19 (15): : 2047 - 2066
  • [10] Data mining and life sciences applications on the grid
    Cannataro, Mario
    Guzzi, Pietro Hiram
    Sarica, Alessia
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 3 (03) : 216 - 238