High performance fault-tolerance for clouds

被引:0
|
作者
Kyriazis, Dimosthenis [1 ]
Anagnostopoulos, Vasileios [1 ]
Arcangeli, Andrea [2 ]
Gilbert, David [2 ]
Kalogeras, Dimitrios [3 ]
Kat, Ronen [4 ]
Klein, Cristian [5 ]
Kokkinos, Panagiotis [3 ]
Kuperman, Yossi [4 ]
Nider, Joel [4 ]
Svard, Petter [5 ]
Tomas, Luis [5 ]
Varvarigos, Emmanuel [3 ]
Varvarigou, Theodora [1 ]
机构
[1] Natl Tech Univ Athens, Iroon Polytech 9, Athens, Greece
[2] Red Hat Ltd, Cork, Ireland
[3] Patras Univ Campus, Comp Technol Inst & Press Diophantus, Rion, Greece
[4] IBM Haifa Res Lab, Haifa, Israel
[5] Umea Univ, SE-90187 Umea, Sweden
关键词
cloud computing; fault-tolerance; high-performance; live-migration; resource consolidation;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cloud computing and virtualized infrastructures are currently the baseline environments for the provision of services in different application domains. While the number of service consumers increasingly grows, service providers aim at exploiting infrastructures that enable non-disruptive service provisioning, thus minimizing or even eliminating downtime. Nonetheless, to achieve the latter current approaches are either application-specific or cost inefficient, requiring the use of dedicated hardware. In this paper we present the reference architecture of a fault-tolerance scheme, which not only enhances cloud environments with the aforementioned capabilities but also achieves high-performance as required by mission critical every day applications. To realize the proposed approach, a new paradigm for memory and I/O externalization and consolidation is introduced, while current implementation references are also provided.
引用
下载
收藏
页码:251 / 257
页数:7
相关论文
共 50 条
  • [1] FTS: A high-performance CORBA fault-tolerance service
    Friedman, R
    Hadad, E
    PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL WORKSHOP ON OBJECT-ORIENTED REAL-TIME DEPENDABLE SYSTEMS, 2002, : 61 - 68
  • [2] High speed dynamic fault-tolerance
    Sengupta, J
    Bansal, PK
    IEEE REGION 10 INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONIC TECHNOLOGY, VOLS 1 AND 2, 2001, : 669 - 675
  • [3] Quantitative Fault-Tolerance for Reliable Workflows on Heterogeneous IaaS Clouds
    Xie, Guoqi
    Zeng, Gang
    Li, Renfa
    Li, Keqin
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (04) : 1223 - 1236
  • [4] FAULT-TOLERANCE
    GROSSPIETSCH, KE
    MICROPROCESSING AND MICROPROGRAMMING, 1993, 38 (1-5): : 783 - 783
  • [5] Designing masking fault-tolerance via nonmasking fault-tolerance
    Arora, A
    Kulkarni, SS
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1998, 24 (06) : 435 - 450
  • [6] Scaleability, performance, and fault-tolerance of PACS architectures
    Blume, H
    Prior, F
    di Pierro, MC
    Goble, J
    Logdberg, J
    Kenney, RS
    Goeringer, F
    MEDICAL IMAGING 1998 - PACS DESIGN AND EVALUATION: ENGINEERING AND CLINICAL ISSUES, 1998, 3339 : 112 - 126
  • [7] PERFORMANCE AND FAULT-TOLERANCE OF NEURAL NETWORKS FOR OPTIMIZATION
    PROTZEL, PW
    PALUMBO, DL
    ARRAS, MK
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1993, 4 (04): : 600 - 614
  • [8] Resilience for Collaborative Applications on Clouds Fault-Tolerance for Distributed HPC Applications
    Toan Nguyen
    Desideri, Jean-Antoine
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2012, PT IV, 2012, 7336 : 418 - 433
  • [9] ON FAULT-TOLERANCE OF SYNTAX
    SLISSENKO, AO
    THEORETICAL COMPUTER SCIENCE, 1993, 119 (01) : 215 - 222
  • [10] A Two-Level Fault-Tolerance Technique for High Performance Computing Applications
    Aseeri, Aishah M.
    Fadel, Mai A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (12) : 46 - 54