Scientific workflow management and the Kepler system

被引:832
|
作者
Ludascher, Bertram [1 ]
Altintas, Ilkay
Berkley, Chad
Higgins, Dan
Jaeger, Efrat
Jones, Matthew
Lee, Edward A.
Tao, Jing
Zhao, Yang
机构
[1] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
[2] Univ Calif Davis, Genome Ctr, Davis, CA 95616 USA
[3] Univ Calif San Diego, San Diego Supercomp Ctr, San Diego, CA 92093 USA
[4] Univ Calif Santa Barbara, Natl Ctr Ecol Anal & Synth, Santa Barbara, CA 93101 USA
[5] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
来源
关键词
scientific workflows; Grid workflows; scientific data management; problem-solving environments; dataflow networks;
D O I
10.1002/cpe.994
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many scientific disciplines are now data and information driven, and new scientific knowledge is often gained by scientists putting together data analysis and knowledge discovery 'pipelines'. A related trend is that more and more scientific communities realize the benefits of sharing their data and computational services, and are thus contributing to a distributed data and computational community infrastructure (a.k.a. 'the Grid'). However, this infrastructure is only a means to an end and ideally scientists should not be too concerned with its existence. The goal is for scientists to focus on development and use of what we call scientific workflows. These are networks of analytical steps that may involve, e.g., database access and querying steps, data analysis and mining steps, and many other steps including computationally intensive jobs on high-performance cluster computers. In this paper we describe characteristics of and requirements for scientific workflows as identified in a number of our application projects. We then elaborate on Kepler, a particular scientific workflow system, currently under development across a number of scientific data management projects. We describe some key features of Kepler and its underlying Ptolemy II system, planned extensions, and areas of future research. Kepler is a community-driven, open source project, and we always welcome related projects and new contributors to join. Copyright (c) 2005 John Wiley & Sons, Ltd.
引用
收藏
页码:1039 / 1065
页数:27
相关论文
共 50 条
  • [21] Scientific Workflow Management - for Whom?
    Olabarriaga, S. D.
    Jaghoori, M.
    Taffoni, G.
    Castelli, G.
    Vuerli, C.
    Pierantoni, G.
    Carley, E.
    Korkhov, V.
    Sciacca, E.
    Becciani, U.
    Bentley, B.
    2014 IEEE 10TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), VOL 1, 2014, : 298 - 305
  • [22] Scientific Workflow Management in Proteomics
    de Bruin, Jeroen S.
    Deelder, Andre M.
    Palmblad, Magnus
    MOLECULAR & CELLULAR PROTEOMICS, 2012, 11 (07)
  • [23] Flexible IO services in the Kepler Grid workflow system
    Abramson, D
    Kommineni, J
    Altintas, I
    FIRST INTERNATIONAL CONFERENCE ON E-SCIENCE AND GRID COMPUTING, PROCEEDINGS, 2005, : 255 - 262
  • [24] Kepler plus CometCloud: Dynamic Scientific Workflow Execution on Federated Cloud Resources
    Wang, Jianwu
    AbdelBaky, Moustafa
    Diaz-Montes, Javier
    Purawat, Shweta
    Parashar, Manish
    Altintas, Ilkay
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 700 - 711
  • [25] Experiences in workflow management for scientific computing
    Jablonski, S
    Stein, K
    Teschke, M
    EIGHTH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 1997, : 56 - 61
  • [26] A WORKFLOW MANAGEMENT ENGINE FOR SCIENTIFIC APPLICATIONS
    Costan, Alexandru
    Cristea, Valentin
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2011, 73 (02): : 73 - 88
  • [27] A Kepler Scientific Workflow to Facilitate and Standardize Marine Monitoring Sensor Parsing and Dynamic Adaption
    Li, Xiu
    Song, Jingdong
    Huang, Rongsheng
    2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 1023 - 1026
  • [28] Tools, methods and services enhancing the usage of the Kepler-based scientific workflow framework
    Plociennik, Marcin
    Winczewski, Szymon
    Ciecielag, Pawel
    Imbeaux, Frederic
    Guillerminet, Bernard
    Huynh, Philippe
    Owsiak, Michal
    Spyra, Piotr
    Aniel, Thierry
    Palak, Bartek
    Zok, Tomasz
    Pych, Wojciech
    Rybicki, Jaroslaw
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2014, 29 : 1733 - 1744
  • [29] Research and Realization on the University Scientific Research Information Management System Based on Workflow
    Hu, Haiyan
    Yan, Hui
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 2897 - 2900
  • [30] SW-ONTOLOGY A Proposal for Semantic Modeling of a Scientific Workflow Management System
    Gaspar, Wander
    Silva, Laryssa
    Braga, Regina
    Campos, Fernanda
    ICEIS 2010: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2010, : 115 - 120