Performance evaluation of automatic checkpoint-based fault tolerance for AMPI and charm

被引:5
|
作者
Department of Computer Science, University of Illinois at Urbana-Champaign [1 ]
机构
来源
Oper Syst Rev ACM | 2006年 / 2卷 / 90-99期
关键词
Fault tolerant computer systems;
D O I
10.1145/1131322.1131340
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [21] Research on Fault Tolerance Strategy Based on Two Level Checkpoint Server in Autonomous Vehicular Cloud
    Fan, Jun
    Li, Ru
    Zhang, Xin
    PROCEEDINGS OF 2017 IEEE 7TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC), 2017, : 381 - 384
  • [22] Performance and reliability evaluation of passive replication schemes in application level fault tolerance
    Garg, S
    Huang, YN
    Kintala, CMR
    Trivedi, KS
    Yajnik, S
    TWENTY-NINTH ANNUAL INTERNATIONAL SYMPOSIUM ON FAULT-TOLERANT COMPUTING, DIGEST OF PAPERS, 1999, : 322 - 329
  • [23] Performance Evaluation of Data Utility for a Differential Privacy Scheme Supporting Fault Tolerance
    Zhang, Lei
    Wang, Mingxiang
    Xiu, Jianxin
    SYMMETRY-BASEL, 2023, 15 (10):
  • [24] Performance evaluation of adaptive routing algorithms for achieving fault tolerance in NoC fabrics
    Zhu, Haibo
    Pande, Partha Pratim
    Grecu, Cristian
    2007 IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES, AND PROCESSORS, 2007, : 42 - +
  • [25] A decentralized fault tolerance model based on level of performance for grid environment
    Mohammed Rebbah
    Yahya Slimani
    Abdelkader Benyettou
    Lionel Brunie
    Cluster Computing, 2016, 19 : 13 - 27
  • [26] A decentralized fault tolerance model based on level of performance for grid environment
    Rebbah, Mohammed
    Slimani, Yahya
    Benyettou, Abdelkader
    Brunie, Lionel
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (01): : 13 - 27
  • [27] MEASUREMENT-BASED EVALUATION OF OPERATING SYSTEM FAULT-TOLERANCE
    LEE, I
    TANG, D
    IYER, RK
    HSUEH, MC
    IEEE TRANSACTIONS ON RELIABILITY, 1993, 42 (02) : 238 - 249
  • [28] Algorithm-based fault tolerance applied to high performance computing
    Bosilca, George
    Delmas, Remi
    Dongarra, Jack
    Langou, Julien
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2009, 69 (04) : 410 - 416
  • [29] Improving byzantine fault tolerance based on stake evaluation and consistent hashing
    Wu, Guangfu
    Lai, Xin
    He, Daojing
    Chan, Sammy
    Fu, Xiaoyan
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2024, 17 (04) : 1963 - 1975
  • [30] Performance-based fault detection and fault-tolerant control for automatic control systems
    Li, Linlin
    Luo, Hao
    Ding, Steven X.
    Yang, Ying
    Peng, Kaixiang
    AUTOMATICA, 2019, 99 : 308 - 316