A Service-Oriented Framework for Executing Data Mining Workflows on Grids

被引:0
|
作者
Lackovic, Marco [1 ]
Talia, Domenico [1 ]
Trunfio, Paolo [1 ]
机构
[1] Univ Calabria, DEIS, I-87036 Arcavacata Di Rende, CS, Italy
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Workflow environments are widely used in data mining systems to manage data and execution flows associated to complex applications. Weka, one of the most used open-source data mining systems, includes the KnowledgeFlow environment which provides a drag-and-drop interface to compose and execute data mining workflows. The Weka KnowledgeFlow allows users to execute a whole workflow only on a single computer. On the other hand, most data mining workflows include several independent branches that could be run in parallel on a set of distributed machines to reduce the overall execution time. We implemented distributed workflow execution in Weka4WS, a framework that extends Weka and its KnowledgeFlow environment to exploit distributed resources available in a Grid using Web Service technologies. In this paper we describe the Weka4WS architecture and the functionalities provided by its service-oriented KnowledgeFlow component, showing its use to compose and execute simple parallel data mining workflows. Furthermore, we present ongoing work aimed at supporting also data-parallel workflows on a Grid.
引用
收藏
页码:70 / 77
页数:8
相关论文
共 50 条
  • [1] The Weka4WS framework for distributed data mining in service-oriented Grids
    Talia, Domenico
    Trunfio, Paolo
    Verta, Oreste
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2008, 20 (16): : 1933 - 1951
  • [2] A Framework for the Requirements Analysis of Service-oriented Workflows
    Mueller, Jochen L.
    Buechner, Andreas
    Mueller, Paul
    [J]. NWESP 2007: THIRD INTERNATIONAL CONFERENCE ON NEXT GENERATION WEB SERVICES PRACTICES, PROCEEDINGS, 2007, : 104 - 109
  • [3] A Service-Oriented Collaborative Framework for High-Performance Data Transfer in Grids
    Wang, Chien-Min
    Chen, Hsi-Min
    Hsu, Chun-Chen
    Lee, Jonathan
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2011, 12 (06): : 899 - 909
  • [4] A Novel Framework for Defining and Submitting Workflows to Service-Oriented Systems
    Bendoukha, Hayat
    Slimani, Yahya
    Benyettou, Abdelkader
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2014, 10 (03): : 365 - 383
  • [5] A service-oriented framework for integration of domain-specific data models in scientific workflows
    Bender, Andreas
    Poschlad, Angela
    Bozic, Stefan
    Kondov, Ivan
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 1087 - 1096
  • [6] A service-oriented system to support data integration on data grids
    Gounaris, Anastasios
    Comito, Carmela
    Sakellariou, Rizos
    Talia, Domenico
    [J]. CCGRID 2007: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, 2007, : 627 - +
  • [7] Service-oriented distributed data-mining
    Cheung, William K.
    Zhang, Xiao-Feng
    Wong, Ho-Fai
    Liu, Jiming
    Luo, Zong-Wei
    Tong, Frank C. H.
    [J]. IEEE INTERNET COMPUTING, 2006, 10 (04) : 44 - 54
  • [8] GRASG - A framework for "Gridifying" and Running Applications on Service-Oriented Grids
    Ho, Quoc-Thuan
    Hung, Terence
    Jie, Wei
    Chan, Hoong-Maeng
    Sindhu, Emilda
    [J]. SIXTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID: SPANNING THE WORLD AND BEYOND, 2006, : 305 - +
  • [9] Service-oriented device ecology workflows
    Loke, SW
    [J]. SERVICE-ORIENTED COMPUTING - ICSOC 2003, 2003, 2910 : 559 - 574
  • [10] Mobile Service Management in Service-Oriented Grids
    Kirkham, Tom
    Solsvik, Fredrik
    Piotter, Robert
    Gallop, Julian
    [J]. ERCIM NEWS, 2007, (70): : 17 - 17