Reliability of Centralized vs. Parallel Software Models for Composable Storage Systems

被引:1
|
作者
Blaum, Mario [1 ]
Muench, Paul [1 ]
机构
[1] IBM Res Div Almaden, San Jose, CA 95120 USA
关键词
Hyperconverged architectures; hyper-converged infrastructure (HCI); cloud applications; DIMM failure rate; metadata server; composable systems;
D O I
10.1109/QRS54544.2021.00064
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Modern storage systems consist of many hardware and software components. The core of these systems are server drawers containing data, where at least one of such drawers consists of parity (a special case is two mirrored drawers). We analyze the failure rate of two such systems both based on hyperconverged architectures: one centralized, in which the drawers share the metadata server, and one parallel, in which each drawer has its own metadata server. Inherently the parallel systems will have greater reliability. However, the new CXL and Gen-Z architectures are enabling a centralized approach where resources from multiple servers are combined to make a single virtual server. In this paper we analyze what techniques can make the probability of failure of the centralized approach approximate the probability of failure of the parallel approach. We identified the probability of Dual In-Line Memory Modules (DIMMs) failure as the key differentiator between the probability of failure of the centralized and parallel systems, and we suggest methods to compensate for DIMMs with high probability of failure.
引用
下载
收藏
页码:534 / 542
页数:9
相关论文
共 50 条
  • [31] Data Center Investment vs. System Reliability in Power Distribution Systems
    Wiboonrat, Montri
    2019 14TH ANNUAL CONFERENCE SYSTEM OF SYSTEMS ENGINEERING (SOSE), 2019, : 146 - 151
  • [32] Analysis of Parallel Discrete Systems: Persistent Sets vs. Concurrent Simulation
    Karatkevich, Andrei
    PRZEGLAD ELEKTROTECHNICZNY, 2009, 85 (07): : 182 - 184
  • [33] Software Process Models vs. Descriptions: What do Practitioners Use and Need?
    Diebold, Philipp
    Scherr, Simon Andre
    2016 IEEE/ACM INTERNATIONAL CONFERENCE ON SOFTWARE AND SYSTEM PROCESSES (ICSSP), 2016, : 66 - 75
  • [34] Testing of Automotive Systems - Complex vs. Simple Environment Models
    Sobotka, Jan
    Krejci, Lukas
    2018 16TH BIENNIAL BALTIC ELECTRONICS CONFERENCE (BEC), 2018,
  • [35] Information-centric vs. storage/data-centric systems
    Milligan, Charles
    Halladay, Steve
    Hansen, Deren
    ICEIS 2006: PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS: INFORMATION SYSTEMS ANALYSIS AND SPECIFICATION, 2006, : 501 - +
  • [36] Residential vs. community battery storage systems Consumer preferences in Germany
    Kalkbrenner, Bernhard J.
    ENERGY POLICY, 2019, 129 : 1355 - 1363
  • [37] Predicted vs. Actual Bit Error Rates for MMS HPCA Memories and Their Impact on Software Reliability
    Wood, Paul
    Furman, Judith
    Monreal, Roberto
    2021 IEEE AEROSPACE CONFERENCE (AEROCONF 2021), 2021,
  • [38] Design patterns as components of functional models for analyzing the reliability of software systems
    Araujo, K
    Bowles, JB
    ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 2004 PROCEEDINGS, 2004, : 184 - 189
  • [39] Use of software reliability models for the maintenance of information systems: a case study
    Catelani, M.
    Salvaneschi, P.
    Zanobini, A.
    Mugnaini, M.
    Conference Record - IEEE Instrumentation and Measurement Technology Conference, 1994, 2 : 640 - 643
  • [40] Evaluation of Standard Reliability Growth Models in the Context of Automotive Software Systems
    Rana, Rakesh
    Staron, Miroslaw
    Mellegard, Niklas
    Berger, Christian
    Hansson, Jorgen
    Nilsson, Martin
    Toerner, Fredrik
    PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT, 2013, 7983 : 324 - 329