Overhead analysis of scientific workflows in grid environments

被引:33
|
作者
Prodan, Radu [1 ]
Fahringer, Thomas [1 ]
机构
[1] Univ Innsbruck, Inst Comp Sci, A-6020 Innsbruck, Austria
关键词
distributed systems; distributed applications; distributed/Internet-based software engineering tools and techniques; performance measurements; monitors; performance evaluation; performance attributes;
D O I
10.1109/TPDS.2007.70734
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Scientific workflows are a topic of great interest in the Grid community that sees in the workflow model an attractive paradigm for programming distributed wide-area Grid infrastructures. Traditionally, the Grid workflow execution is approached as a pure best effort scheduling problem that maps the activities onto the Grid processors based on appropriate optimization or local matchmaking heuristics such that the overall execution time is minimized. Even though such heuristics often deliver effective results, the execution in dynamic and unpredictable Grid environments is prone to severe performance losses that must be understood for minimizing the completion time or for the efficient use of high-performance resources. In this paper, we propose a new systematic approach to help the scientists and middleware developers understand the most severe sources of performance losses that occur when executing scientific workflows in dynamic Grid environments. We introduce an ideal model for the lowest execution time that can be achieved by a workflow and explain the difference to the real measured Grid execution time based on a hierarchy of performance overheads for Grid computing. We describe how to systematically measure and compute the overheads from individual activities to larger workflow regions and adjust well-known parallel processing metrics to the scope of Grid computing, including speedup and efficiency. We present a distributed online tool for computing and analyzing the performance overheads in real time based on event correlation techniques and introduce several performance contracts as quality-of-service parameters to be enforced during the workflow execution beyond traditional best effort practices. We illustrate our method through postmortem and online performance analysis of two real-world workflow applications executed in the Austrian Grid environment.
引用
收藏
页码:378 / 393
页数:16
相关论文
共 50 条
  • [1] A Framework for the Flexible Deployment of Scientific Workflows in Grid Environments
    Fabra, Javier
    Hernandez, Sergio
    Alvarez, Pedro
    Ezpeleta, Joaquin
    [J]. THIRD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, GRIDS, AND VIRTUALIZATION (CLOUD COMPUTING 2012), 2012, : 43 - 50
  • [2] Formal Verification of Temporal Properties for Reduced Overhead in Grid Scientific Workflows
    Jun-Wei Cao
    Fan Zhang
    Ke Xu
    Lian-Chen Liu
    Cheng Wu
    [J]. Journal of Computer Science and Technology, 2011, 26 : 1017 - 1030
  • [3] Formal Verification of Temporal Properties for Reduced Overhead in Grid Scientific Workflows
    Cao, Jun-Wei
    Zhang, Fan
    Xu, Ke
    Liu, Lian-Chen
    Wu, Cheng
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2011, 26 (06) : 1017 - 1030
  • [4] Formal Verification of Temporal Properties for Reduced Overhead in Grid Scientific Workflows
    曹军威
    张帆
    许可
    刘连臣
    吴澄
    [J]. Journal of Computer Science & Technology, 2011, 26 (06) : 1017 - 1030
  • [5] Online performance monitoring and analysis of grid scientific workflows
    Truong, HL
    Fahringer, T
    [J]. ADVANCES IN GRID COMPUTING - EGC 2005, 2005, 3470 : 1154 - 1164
  • [6] Monitoring of Grid scientific workflows
    Balis, Bartosz
    Bubak, Marian
    Labno, Bartlomiej
    [J]. SCIENTIFIC PROGRAMMING, 2008, 16 (2-3) : 205 - 216
  • [7] Dynamic instrumentation, performance monitoring and analysis of Grid scientific workflows
    Truong H.-L.
    Fahringer T.
    Dustdar S.
    [J]. Journal of Grid Computing, 2005, 3 (1-2) : 1 - 18
  • [8] Execution of service workflows in grid environments
    Senna, C. R.
    Bittencourt, L. F.
    Madeira, E. R. M.
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2010, 5 (1-2) : 88 - 108
  • [9] Monitoring Infrastructure for Grid Scientific Workflows
    Balis, Bartosz
    Bubak, Marian
    [J]. 2008 THIRD WORKSHOP ON WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE (WORKS 2008), 2008, : 1 - 10
  • [10] Automatically composed workflows for grid environments
    Blythe, J
    Deelman, E
    Gil, Y
    [J]. IEEE INTELLIGENT SYSTEMS, 2004, 19 (04) : 16 - 23