Scaling up workflow-based applications

被引:31
|
作者
Callaghan, Scott [2 ]
Deelman, Ewa [1 ]
Gunter, Dan [5 ]
Juve, Gideon [2 ]
Maechling, Philip [2 ]
Brooks, Christopher [6 ]
Vahi, Karan [1 ]
Milner, Kevin [2 ]
Graves, Robert [3 ]
Field, Edward [4 ]
Okaya, David [2 ]
Jordan, Thomas [2 ]
机构
[1] USC Informat Sci Inst, Marina Del Rey, CA 90292 USA
[2] Univ So Calif, Los Angeles, CA 90089 USA
[3] URS Corp, Pasadena, CA 91101 USA
[4] US Geol Survey, Pasadena, CA 91106 USA
[5] Univ Calif Berkeley, Lawrence Berkeley Lab, Berkeley, CA 94720 USA
[6] Univ San Francisco, San Francisco, CA 94117 USA
基金
美国国家科学基金会;
关键词
Scientific workflows; Distributed applications; Workflow scalability;
D O I
10.1016/j.jcss.2009.11.005
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific applications, often expressed as workflows are making use of large-scale national cyberinfrastructure to explore the behavior of systems, search for phenomena in large-scale data, and to conduct many other scientific endeavors As the complexity of the systems being studied grows and as the data set sizes Increase, the scale of the computational workflows increases as well. In some cases, workflows now have hundreds of thousands of individual tasks Managing such scale is difficult from the point of view of workflow description, execution, and analysis In this paper, we describe the challenges faced by workflow management and performance analysis systems when dealing with an earthquake science application. CyberShake, executing on the TeraGrid. The scientific goal of the SCEC CyberShake project is to calculate probabilistic seismic hazard curves for sites in Southern California. For each site of interest, the CyberShake platform includes two large-scale MPI calculations and approximately 840,000 embarrassingly parallel post-processing jobs. In this paper, we show how we approach the scalability challenges in our workflow management and log mining systems. (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:428 / 446
页数:19
相关论文
共 50 条
  • [31] Workflow-Based Architecture for Collaborative Video Annotation
    Hofmann, Cristian
    Hollender, Nina
    Fellner, Dieter W.
    [J]. ONLINE COMMUNITIES AND SOCIAL COMPUTING, PROCEEDINGS, 2009, 5621 : 33 - +
  • [32] Workflow-based information system for furniture budgeting
    Vidal, JC
    Lama, M
    Bugarín, A
    Barro, S
    [J]. ETFA 2003: IEEE CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION, VOL 1, PROCEEDINGS, 2003, : 54 - 60
  • [33] Workflow-Based Internet Platform for Mass Supercomputing
    Biryal'tsev, E. V.
    Galimov, M. R.
    Elizarov, A. M.
    [J]. LOBACHEVSKII JOURNAL OF MATHEMATICS, 2018, 39 (05) : 647 - 654
  • [34] A Workflow-based Cooperative Project Management System
    Chen, Yi
    Hou, Kun
    Wang, Rui
    [J]. 2011 TENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2011, : 69 - 73
  • [35] A workflow-based web service composition system
    Karakoc, E.
    Kardas, K.
    Senkul, P.
    [J]. 2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WORKSHOPS PROCEEDINGS, 2006, : 113 - +
  • [36] WorkFlow-Based New Development Tool for MIS
    Yang Guojun
    Zheng Ying
    Cheng Wenjing
    [J]. 2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 2, PROCEEDINGS, 2009, : 715 - +
  • [37] Automated Analysis of Industrial Workflow-based Models
    Cortes-Cornax, Mario
    Krishna, Ajay
    Mos, Adrian
    Salaun, Gwen
    [J]. 33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, : 120 - 127
  • [38] Workflow-based knowledge flow modeling and control
    Zhang, Xiao-Gang
    Li, Ming-Shu
    [J]. Ruan Jian Xue Bao/Journal of Software, 2005, 16 (02): : 184 - 193
  • [39] Ontological formalization for workflow-based computational experiments
    Smirnov, Pavel A.
    [J]. 4TH INTERNATIONAL YOUNG SCIENTIST CONFERENCE ON COMPUTATIONAL SCIENCE, 2015, 66 : 487 - 495
  • [40] Workflow-based grid portal for quantum mechanics
    Byun, SW
    Lee, YK
    Kwon, YW
    Ryu, SH
    Jeong, CS
    [J]. GRID AND COOPERATIVE COMPUTING GCC 2004 WORKSHOPS, PROCEEDINGS, 2004, 3252 : 625 - 632