DEE: A distributed fault tolerant workflow enactment engine for Grid computing

被引:0
|
作者
Duan, RB [1 ]
Prodan, R [1 ]
Fahringer, T [1 ]
机构
[1] Univ Innsbruck, Inst Comp Sci, A-6020 Innsbruck, Austria
关键词
Grid computing; checkpointing; dependence analysis; distributed enactment engine; fault tolerance; overhead analysis;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
It is a complex task to design and implement a workflow management system that supports scalable executions of large-scale scientific workflows for dynamic and heterogeneous Grid environments. In this paper we describe the Distributed workflow Enactment Engine (DEE) of the ASKALON Grid application development environment for Grid computing. DEE proposes a de-centralized architecture that simplifies and reduces the overhead for managing large workflows through partitioning, improved data locality, and reduced workflow-level check-pointing overhead. We report experimental results for a real-world material science workflow application.
引用
收藏
页码:704 / 716
页数:13
相关论文
共 50 条
  • [21] A hybrid and adaptive model for fault-tolerant distributed computing
    Gorender, S
    Macêdo, R
    Raynal, M
    [J]. 2005 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2005, : 412 - 421
  • [22] Active fault-tolerant system for open distributed computing
    Lanka, Rodrigo
    Oda, Kentaro
    Yoshida, Takaichi
    [J]. AUTONOMIC AND TRUSTED COMPUTING, PROCEEDINGS, 2006, 4158 : 581 - 590
  • [23] Fundamentals of fault-tolerant distributed computing in asynchronous environments
    Gärtner, FC
    [J]. ACM COMPUTING SURVEYS, 1999, 31 (01) : 1 - 26
  • [24] Fault-tolerant distributed mass storage for LHC computing
    Wiebalck, A
    Breuer, PT
    Lindenstruth, V
    Steinbeck, TM
    [J]. CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, : 266 - 273
  • [25] An adaptive programming model for fault-tolerant distributed computing
    Gorender, Sergio
    Macedo, Raimundo Jose de Araujo
    Raynal, Michel
    [J]. IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2007, 4 (01) : 18 - 31
  • [26] A dynamic fault-tolerant model for open distributed computing
    Lanka, Rodrigo
    Oda, Kentaro
    Najima, Horoki
    Yoshida, Takaichi
    [J]. SEVENTEENTH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, : 25 - +
  • [27] Mining workflow processes from distributed workflow enactment event logs
    Kim, Kwanghoon Pio
    [J]. KNOWLEDGE MANAGEMENT & E-LEARNING-AN INTERNATIONAL JOURNAL, 2012, 4 (04) : 528 - 553
  • [28] A Workflow Engine for Computing Clouds
    Franz, Daniel
    Tao, Jie
    Marten, Holger
    Streit, Achim
    [J]. CLOUD COMPUTING 2011: THE SECOND INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, GRIDS, AND VIRTUALIZATION, 2011, : 1 - 6
  • [29] Service Oriented Architecture For Load Balancing With Fault Tolerant In Grid Computing
    Indhumathi, V.
    Nasira, G. M.
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER APPLICATIONS (ICACA), 2016, : 313 - 317
  • [30] A hybrid policy for fault tolerant load balancing in grid computing environments
    Balasangameshwara, Jasma
    Raju, Nedunchezhian
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2012, 35 (01) : 412 - 422