Fault-Tolerance in the Scope of Cloud Computing

被引:9
|
作者
Rehman, A. U. [1 ]
Aguiar, Rui L.
Barraca, Joao Paulo
机构
[1] Inst Telecomunicacoes, P-3810193 Aveiro, Portugal
关键词
Cloud computing; Fault tolerant systems; Fault tolerance; Computer architecture; Computational modeling; Measurement; Servers; fault-tolerance; system-level metrics; component-level metrics; fault-tolerance frameworks; fog computing; 5G networks; edge computing; emerging cloud technologies; REAL-TIME; CHALLENGES; OPPORTUNITIES; TAXONOMY; NETWORK; ARCHITECTURE; RELIABILITY; FOOTPRINT; TRENDS; ISSUES;
D O I
10.1109/ACCESS.2022.3182211
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fault-tolerance methods are required to ensure high availability and high reliability in cloud computing environments. In this survey, we address fault-tolerance in the scope of cloud computing. Recently, cloud computing-based environments have presented new challenges to support fault-tolerance and opened new paths to develop novel strategies, architectures, and standards. We provide a detailed background of cloud computing to establish a comprehensive understanding of the subject, from basic to advanced. We then highlight fault-tolerance components and system-level metrics and identify the needs and applications of fault-tolerance in cloud computing. Furthermore, we discuss state-of-the-art proactive and reactive approaches to cloud computing fault-tolerance. We further structure and discuss current research efforts on cloud computing fault-tolerance architectures and frameworks. Finally, we conclude by enumerating future research directions specific to cloud computing fault-tolerance development.
引用
收藏
页码:63422 / 63441
页数:20
相关论文
共 50 条
  • [1] A Fault-Tolerance Shim for Serverless Computing
    Sreekanti, Vikram
    Wu, Chenggang
    Chhatrapati, Saurav
    Gonzalez, Joseph E.
    Hellerstein, Joseph M.
    Faleiro, Jose M.
    [J]. PROCEEDINGS OF THE FIFTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS (EUROSYS'20), 2020,
  • [2] A new fault-tolerance framework for grid computing
    Derbal, Youcef
    [J]. MULTIAGENT AND GRID SYSTEMS, 2006, 2 (02) : 115 - 133
  • [3] Fault-tolerance approaches for distributed and cloud computing environments: A systematic review, taxonomy and future directions
    Kirti, Medha
    Maurya, Ashish Kumar
    Yadav, Rama Shankar
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (13):
  • [4] Fault-Tolerance in the Scope of Software-Defined Networking (SDN)
    Rehman, A. U.
    Aguiar, Rui L.
    Barraca, Joao Paulo
    [J]. IEEE ACCESS, 2019, 7 : 124474 - 124490
  • [5] A lightweight software fault-tolerance system in the cloud environment
    Chen, Gang
    Jin, Hai
    Zou, Deqing
    Zhou, Bing Bing
    Qiang, Weizhong
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (12): : 2982 - 2998
  • [6] An Efficient Intermediate Data Fault-Tolerance Approach in the Cloud
    Song, Baoyan
    Ren, Cai
    Li, Xuecheng
    Ding, Linlin
    [J]. 2014 11TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA), 2014, : 203 - 206
  • [7] Toward a Smart Cloud: A Review of Fault-Tolerance Methods in Cloud Systems
    Mukwevho, Mukosi Abraham
    Celik, Turgay
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2021, 14 (02) : 589 - 605
  • [8] A SCHEME OF DATA CONFIDENTIALITY AND FAULT-TOLERANCE IN CLOUD STORAGE
    Fu, Yongkang
    Sun, Bin
    [J]. 2012 IEEE 2ND INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENT SYSTEMS (CCIS) VOLS 1-3, 2012, : 228 - 233
  • [9] METHODS AND MODELS FOR COMPUTING SURVIVABILITY AND FAULT-TOLERANCE OF A NETWORK
    GAGIN, AA
    [J]. MICROELECTRONICS AND RELIABILITY, 1993, 33 (10): : 1533 - 1552
  • [10] FAULT-TOLERANCE
    GROSSPIETSCH, KE
    [J]. MICROPROCESSING AND MICROPROGRAMMING, 1993, 38 (1-5): : 783 - 783