Programming knowledge discovery workflows in service-oriented distributed systems

被引:8
|
作者
Cesario, Eugenio [1 ]
Lackovic, Marco [2 ]
Talia, Domenico [1 ,2 ]
Trunfio, Paolo [2 ]
机构
[1] ICAR CNR, Arcavacata Di Rende, CS, Italy
[2] Univ Calabria, DEIS, I-87036 Arcavacata Di Rende, CS, Italy
来源
关键词
distributed data mining; workflows; Grid computing; Knowledge Grid;
D O I
10.1002/cpe.2936
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In several scientific and business domains, very large data repositories are generated. To find interesting and useful information in those repositories, efficient data mining techniques and knowledge discovery processes must be used. The exploitation of data mining techniques in science helps scientists in hypothesis formation and gives them a support on their scientific practices, whereas in industrial processes, data mining can exploit existing data sources as a real value for companies that can take advantage from the knowledge that can be extracted from their large data sources. Data mining tasks are often composed by multiple stages that may be linked to each other to form various execution flows. Moreover, data mining tasks are often distributed because they involve data and tools located over geographically distributed environments. Therefore, it is fundamental to exploit effective paradigms, such as services and workflows, to model data mining tasks that are both multi-staged and distributed. This paper discusses data mining services and workflows for analyzing scientific data in high-performance distributed environments such as Grids and Clouds. We discuss how it is possible to define basic and complex services for supporting distributed data mining tasks in Grids. We also present a workflow formalism and a service-oriented programming framework, named DIS3GNO, for designing and running distributed knowledge discovery processes in the Knowledge Grid system. DIS3GNO supports all the phases of a knowledge discovery process, including composition, execution, and results visualization. After introducing DIS3GNO, some relevant use cases implemented by it and a performance evaluation of the system are discussed. Copyright (C) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:1482 / 1504
页数:23
相关论文
共 50 条
  • [21] Distributed Service-Oriented Robotics
    Remy, Sekou L.
    Blake, M. Brian
    IEEE INTERNET COMPUTING, 2011, 15 (02) : 70 - 74
  • [22] SORC: Service-Oriented Distributed Revision Control for Collaborative Web Programming
    Bin Sarib, Ahmad Sholehin
    Shen, Haifeng
    PROCEEDINGS OF THE 2014 IEEE 18TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2014, : 638 - 643
  • [23] Service-Oriented Knowledge Management
    Dai, Wei
    Rubin, Stuart H.
    2012 IEEE 13TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2012, : 556 - 563
  • [24] Automating Cataloging and Discovery of Services for Service-Oriented Robotic Systems
    Oliveira, Lucas Bueno R.
    Martins, Diogo Brandao
    Amaral, Felipe Augusto
    Oquendo, Flavio
    Nakagawa, Elisa Yumi
    2014 2ND BRAZILIAN ROBOTICS SYMPOSIUM (SBR) / 11TH LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS) / 6TH ROBOCONTROL WORKSHOP ON APPLIED ROBOTICS AND AUTOMATION, 2014, : 151 - 156
  • [25] Worklets: A service-oriented implementation of dynamic flexibility in workflows
    Adams, Michael
    ter Hofstede, Arthur H. M.
    Edmond, David
    van der Aalst, Wil M. P.
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2006: COOPIS, DOA, GADA, AND ODBAS, PT 1, PROCEEDINGS, 2006, 4275 : 291 - 308
  • [26] Amadeus: A holistic service-oriented environment for grid workflows
    Brandic, Ivona
    Pllana, Sabri
    Benkner, Siegfried
    GCC 2006: FIFTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING WORKSHOPS, PROCEEDINGS, 2006, : 259 - +
  • [27] A Procedure for Modeling and Analysis of Service-Oriented and Distributed Productive Systems
    Garcia, Jose I.
    Junqueira, Fabricio
    Morales, Roy A.
    Miyagi, Paulo E.
    2008 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING, VOLS 1 AND 2, 2008, : 941 - +
  • [28] Introducing Trust in Service-oriented Distributed Systems through Blockchain
    Autili, Marco
    Gallo, Francesco
    Inverardi, Paola
    Pompilio, Claudio
    Tivoli, Massimo
    2019 IEEE 30TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2019), 2019, : 149 - 154
  • [29] Software brokers for quality of services in service-oriented distributed systems
    Xu, JY
    Du, WC
    SECOND ANNUAL CONFERENCE ON COMMUNICATION NETWORKS AND SERVICES RESEARCH, PROCEEDINGS, 2004, : 341 - 344
  • [30] Distributed policy specification and enforcement in service-oriented business systems
    Tsai, WT
    Liu, XX
    Chen, YO
    ICEBE 2005: IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING, PROCEEDINGS, 2005, : 10 - 17