Workflow resiliency for large-scale distributed applications

被引:5
|
作者
Toan Nguyen [1 ]
Desideri, Jean-Antoine [1 ]
Selmin, Vittorio [2 ]
机构
[1] INRIA, Ctr Rech Grenoble Rhone Alpes, FR-38334 Saint Ismier, France
[2] Alenia Aeronaut, I-10146 Turin, Italy
关键词
workflows; resiliency; distributed computing; parallel computing; large-scale applications;
D O I
10.1109/ADVCOMP.2009.9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Large-scale simulation and optimization are demanding applications that require high-performance computing platforms. Because their economic impact is fundamental to the industry, they also require robust, seamless and effective mechanisms to support dynamic user interactions, as well as fault-tolerance and resiliency on parallel computing platforms. Distributed workflows are considered here as a means to support large-scale dynamic and resilient multiphysics simulation and optimization applications, such as multiphysics aircraft simulation.
引用
收藏
页码:7 / +
页数:2
相关论文
共 50 条
  • [1] Distributed workflow management for large-scale grid environments
    Schneider, J
    Linnert, B
    Burchard, LO
    [J]. INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET , PROCEEDINGS, 2006, : 229 - +
  • [2] Monitoring Workflow Applications in Large Scale Distributed Systems
    Sbirlea, Dragos
    Simion, Alina
    Pop, Florin
    Cristea, Valentin
    [J]. 2009 INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS (INCOS 2009), 2009, : 162 - 169
  • [3] A Workflow for Parallel and Distributed Computing of Large-Scale Genomic Data
    Choi, Hyun-Hwa
    Kim, Byoung-Seob
    Ahn, Shin-Young
    Bae, Seung-Jo
    [J]. 2013 8TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2013, : 215 - 218
  • [4] Different aspects of workflow scheduling in large-scale distributed systems
    Stavrinides, Georgios L.
    Rodrigo Duro, Francisco
    Karatza, Helen D.
    Garcia Blas, Javier
    Carretero, Jesus
    [J]. SIMULATION MODELLING PRACTICE AND THEORY, 2017, 70 : 120 - 134
  • [5] Automatic Generation of Optimized Workflow for Distributed Computations on Large-Scale Matrices
    Sabry, Farida
    Erradi, Abdelkarim
    Nassar, Mohamed
    Malluhi, Qutaibah M.
    [J]. SERVICE-ORIENTED COMPUTING, ICSOC 2014, 2014, 8831 : 79 - 92
  • [6] An autonomic operating environment for large-scale distributed applications
    Lehman, TJ
    Deen, RG
    Kaufman, JH
    [J]. INTEGRATED COMPUTER-AIDED ENGINEERING, 2006, 13 (01) : 81 - 99
  • [7] TRACES GENERATION TO SIMULATE LARGE-SCALE DISTRIBUTED APPLICATIONS
    Dalle, Olivier
    Mancini, Emilio P.
    [J]. PROCEEDINGS OF THE 2011 WINTER SIMULATION CONFERENCE (WSC), 2011, : 2993 - 3001
  • [8] Towards a common infrastructure for large-scale distributed applications
    Nikolaou, C
    Marazakis, M
    Papadakis, D
    Yeorgiannakis, Y
    Sairamesh, J
    [J]. RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 1997, 1324 : 173 - 193
  • [9] Performance and Cost Optimization for Multiple Large-scale Grid Workflow Applications
    Duan, Rubing
    Prodan, Radu
    Fahringer, Thomas
    [J]. 2007 ACM/IEEE SC07 CONFERENCE, 2010, : 500 - 511
  • [10] Watchdog - a workflow management system for the distributed analysis of large-scale experimental data
    Kluge, Michael
    Friedel, Caroline C.
    [J]. BMC BIOINFORMATICS, 2018, 19