A Service-Oriented Framework for Executing Data Mining Workflows on Grids

被引:0
|
作者
Lackovic, Marco [1 ]
Talia, Domenico [1 ]
Trunfio, Paolo [1 ]
机构
[1] Univ Calabria, DEIS, I-87036 Arcavacata Di Rende, CS, Italy
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Workflow environments are widely used in data mining systems to manage data and execution flows associated to complex applications. Weka, one of the most used open-source data mining systems, includes the KnowledgeFlow environment which provides a drag-and-drop interface to compose and execute data mining workflows. The Weka KnowledgeFlow allows users to execute a whole workflow only on a single computer. On the other hand, most data mining workflows include several independent branches that could be run in parallel on a set of distributed machines to reduce the overall execution time. We implemented distributed workflow execution in Weka4WS, a framework that extends Weka and its KnowledgeFlow environment to exploit distributed resources available in a Grid using Web Service technologies. In this paper we describe the Weka4WS architecture and the functionalities provided by its service-oriented KnowledgeFlow component, showing its use to compose and execute simple parallel data mining workflows. Furthermore, we present ongoing work aimed at supporting also data-parallel workflows on a Grid.
引用
收藏
页码:70 / 77
页数:8
相关论文
共 50 条
  • [21] Service-oriented Production Grids and user support
    Terstyanszky, Gabor
    Kiss, Tamas
    Delaitre, Thierry
    Winter, Stephen
    Kacsuk, Peter
    Kecskemeti, Gabor
    [J]. 2006 7TH IEEE/ACM INTERNATIONAL CONFERENCE ON GRID COMPUTING, 2006, : 323 - +
  • [22] A protocol for recording provenance in service-oriented grids
    Groth, P
    Luck, M
    Moreau, L
    [J]. PRINCIPLES OF DISTRIBUTED SYSTEMS, 2005, 3544 : 124 - 139
  • [23] Data mining and service rating in service-oriented architectures to improve information sharing
    Chen, Ying
    Cohen, Brad
    [J]. 2005 IEEE Aerospace Conference, Vols 1-4, 2005, : 3246 - 3256
  • [24] A service-oriented framework for remote sensing big data processing
    Enayati, Roohollah
    Ravanmehr, Reza
    Aghazarian, Vahe
    [J]. EARTH SCIENCE INFORMATICS, 2023, 16 (01) : 591 - 616
  • [25] A service-oriented framework for remote sensing big data processing
    Roohollah Enayati
    Reza Ravanmehr
    Vahe Aghazarian
    [J]. Earth Science Informatics, 2023, 16 : 591 - 616
  • [26] A service management framework for service-oriented enterprises
    Huang, Y
    Kumaran, S
    Chung, JY
    [J]. CEC 2004: IEEE INTERNATIONAL CONFERENCE ON E-COMMERCE TECHNOLOGY, PROCEEDINGS, 2004, : 181 - 186
  • [27] Anteater: A service-oriented architecture for high-performance data mining
    Guedes, Dorgival
    Meira, Wagner, Jr.
    Ferreira, Renato
    [J]. IEEE INTERNET COMPUTING, 2006, 10 (04) : 36 - 43
  • [28] Orange4WS Environment for Service-Oriented Data Mining
    Podpecan, Vid
    Zemenova, Monika
    Lavrac, Nada
    [J]. COMPUTER JOURNAL, 2012, 55 (01): : 82 - 98
  • [29] An extensible service oriented distributed data mining framework
    Kumar, A
    Kantardzic, M
    Ramaswamy, P
    Sadeghian, P
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA'04), 2004, : 256 - 263
  • [30] A SERVICE-ORIENTED FRAMEWORK FOR MAS MODELING
    Yves, Wautelet
    Youssef, Achbany
    Manuel, Kolp
    [J]. ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL ISAS-1: INFORMATION SYSTEMS ANALYSIS AND SPECIFICATION, VOL 1, 2008, : 120 - 128