Scientific workflow management and the Kepler system

被引:832
|
作者
Ludascher, Bertram [1 ]
Altintas, Ilkay
Berkley, Chad
Higgins, Dan
Jaeger, Efrat
Jones, Matthew
Lee, Edward A.
Tao, Jing
Zhao, Yang
机构
[1] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
[2] Univ Calif Davis, Genome Ctr, Davis, CA 95616 USA
[3] Univ Calif San Diego, San Diego Supercomp Ctr, San Diego, CA 92093 USA
[4] Univ Calif Santa Barbara, Natl Ctr Ecol Anal & Synth, Santa Barbara, CA 93101 USA
[5] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
来源
关键词
scientific workflows; Grid workflows; scientific data management; problem-solving environments; dataflow networks;
D O I
10.1002/cpe.994
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many scientific disciplines are now data and information driven, and new scientific knowledge is often gained by scientists putting together data analysis and knowledge discovery 'pipelines'. A related trend is that more and more scientific communities realize the benefits of sharing their data and computational services, and are thus contributing to a distributed data and computational community infrastructure (a.k.a. 'the Grid'). However, this infrastructure is only a means to an end and ideally scientists should not be too concerned with its existence. The goal is for scientists to focus on development and use of what we call scientific workflows. These are networks of analytical steps that may involve, e.g., database access and querying steps, data analysis and mining steps, and many other steps including computationally intensive jobs on high-performance cluster computers. In this paper we describe characteristics of and requirements for scientific workflows as identified in a number of our application projects. We then elaborate on Kepler, a particular scientific workflow system, currently under development across a number of scientific data management projects. We describe some key features of Kepler and its underlying Ptolemy II system, planned extensions, and areas of future research. Kepler is a community-driven, open source project, and we always welcome related projects and new contributors to join. Copyright (c) 2005 John Wiley & Sons, Ltd.
引用
收藏
页码:1039 / 1065
页数:27
相关论文
共 50 条
  • [41] Enabling scalable scientific workflow management in the Cloud
    Zhao, Yong
    Li, Youfu
    Raicu, Ioan
    Lu, Shiyong
    Tian, Wenhong
    Liu, Heng
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 46 : 3 - 16
  • [42] Scientific workflow management in a distributed production environment
    Baker, N
    McClatchey, R
    LeGoff, JM
    FIRST INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING WORKSHOP, PROCEEDINGS, 1997, : 291 - 299
  • [43] Challenges of Provenance in Scientific Workflow Management Systems
    Alam, Khairul
    Roy, Banani
    2022 IEEE/ACM WORKSHOP ON WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE, WORKS, 2022, : 10 - 18
  • [44] Natural Language Processing using Kepler Workflow System: First Steps
    Goyal, Ankit
    Singh, Alok
    Bhargava, Shitij
    Crawl, Daniel
    Altintas, Ilkay
    Hsu, Chun-Nan
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 712 - 721
  • [45] Optimising workflow using a workflow management system
    Vaandering, A.
    Coevoet, M.
    RADIOTHERAPY AND ONCOLOGY, 2016, 119 : S287 - S287
  • [46] Kepler:: An extensible system for design and execution of scientific workflows
    Altintas, I
    Berkley, C
    Jaeger, E
    Jones, M
    Ludäscher, B
    Mock, S
    16TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2004, : 423 - 424
  • [47] Kepler-Based Collaborative Workflow System for Metabolic Syndrome Estimation
    Youn, Chan-Hyun
    Kim, Dong Hyun
    Lim, Soo
    Jang, Hak Chul
    Shim, Eun Bo
    Choi, Yeon Shik
    Park, Hyo-Derk
    Lee, Hong Kyu
    TENCON 2010: 2010 IEEE REGION 10 CONFERENCE, 2010, : 1481 - 1485
  • [48] Kepler: a scientific biography
    Gualandi, Andrea
    JOURNAL FOR THE HISTORY OF ASTRONOMY, 2009, 40 : 219 - 219
  • [49] Version management of workflow system
    Wu, Shaofei
    Liu, Bin
    ADVANCING SCIENCE THROUGH COMPUTATION, 2008, : 319 - 321
  • [50] Workflow Management System for DMS
    Nedic, Nemanja
    Svenda, Goran
    INFORMATION TECHNOLOGY AND CONTROL, 2013, 42 (04): : 380 - 392