Reliability of Centralized vs. Parallel Software Models for Composable Storage Systems

被引:1
|
作者
Blaum, Mario [1 ]
Muench, Paul [1 ]
机构
[1] IBM Res Div Almaden, San Jose, CA 95120 USA
关键词
Hyperconverged architectures; hyper-converged infrastructure (HCI); cloud applications; DIMM failure rate; metadata server; composable systems;
D O I
10.1109/QRS54544.2021.00064
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Modern storage systems consist of many hardware and software components. The core of these systems are server drawers containing data, where at least one of such drawers consists of parity (a special case is two mirrored drawers). We analyze the failure rate of two such systems both based on hyperconverged architectures: one centralized, in which the drawers share the metadata server, and one parallel, in which each drawer has its own metadata server. Inherently the parallel systems will have greater reliability. However, the new CXL and Gen-Z architectures are enabling a centralized approach where resources from multiple servers are combined to make a single virtual server. In this paper we analyze what techniques can make the probability of failure of the centralized approach approximate the probability of failure of the parallel approach. We identified the probability of Dual In-Line Memory Modules (DIMMs) failure as the key differentiator between the probability of failure of the centralized and parallel systems, and we suggest methods to compensate for DIMMs with high probability of failure.
引用
下载
收藏
页码:534 / 542
页数:9
相关论文
共 50 条
  • [21] Applying Software Reliability Growth Models to DOD Systems
    Long, E. Andrew
    Nikora, Allen P.
    23RD IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSRE 2012), 2012, : 27 - 36
  • [22] Individual storage vs. standard storage - Systematic comparison of four different storage systems
    Entzer, J.
    Muench-Berndl, K.
    Oppitz, U.
    STRAHLENTHERAPIE UND ONKOLOGIE, 2006, 182 : 105 - 105
  • [23] Student approaches to projects: Software engineering vs. information systems
    Scime, A
    INFORMATION TECHNOLOGY AND ORGANIZATIONS: TRENDS, ISSUES, CHALLENGES AND SOLUTIONS, VOLS 1 AND 2, 2003, : 584 - 586
  • [24] The parallel lives of Autonomous Systems: ASN Allocations vs. BGP
    Nemmi, Eugenio Nerio
    Sassi, Francesco
    La Morgia, Massimo
    Testart, Cecilia
    Mei, Alessandro
    Dainotti, Alberto
    PROCEEDINGS OF THE 2021 ACM INTERNET MEASUREMENT CONFERENCE, IMC 2021, 2021, : 593 - 611
  • [25] Reliability Analysis of Energy-Efficient Parallel Storage Systems
    Yin, Shu
    Li, Kenli
    Qin, Xiao
    Ruan, Xiaojun
    Chen, Haiquan
    Xie, Jiong
    Zhu, Xiaomin
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 1448 - 1455
  • [26] Meeting a demand vs. enhancing protections in homogeneous parallel systems
    Levitin, Gregory
    Hausken, Kjell
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2009, 94 (11) : 1711 - 1717
  • [27] ShardFS vs. IndexFS: Replication vs. Caching Strategies for Distributed Metadata Management in Cloud Storage Systems
    Xiao, Lin
    Ren, Kai
    Zheng, Qing
    Gibson, Garth A.
    ACM SOCC'15: PROCEEDINGS OF THE SIXTH ACM SYMPOSIUM ON CLOUD COMPUTING, 2015, : 236 - 249
  • [28] 4.1.0 Panel 4.1.0: Requirements Engineering for Software vs. Systems in General?
    Vienna University of Technology, ICT, Austria
    不详
    不详
    不详
    不详
    INCOSE Int. Sym., 2007, 1 (577-586):
  • [29] Performance evaluation of software RAID vs. hardware RAID for parallel virtual file system
    Hsieh, J
    Stanton, C
    Ali, R
    NINTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 2002, : 307 - 313
  • [30] Hotel reservation systems on the Internet -: Custom design vs. standard software
    Güler, S
    Klein, S
    INFORMATION AND COMMUNICATION TECHNOLOGIES IN TOURISM 1999, 1999, : 201 - 217