Algorithms for optimal replica placement under correlated failure in hierarchical failure domains

被引:0
|
作者
Mills, K. Alex [1 ]
Chandrasekaran, R. [1 ]
Mittal, Neeraj [1 ]
机构
[1] Univ Texas Dallas, 800 W Campbell Rd, Richardson, TX 75080 USA
基金
美国国家科学基金会;
关键词
Replica placement; Correlated failure; Combinatorial optimization; Fault-tolerant storage; Data center management; RELIABILITY;
D O I
10.1016/j.tcs.2020.01.004
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In data centers, data replication is the primary method used to ensure availability of customer data. To avoid correlated failure, cloud storage infrastructure providers model hierarchical failure domains using a tree, and avoid placing a large number of data replicas within the same failure domain (i.e. on the same branch of the tree). Typical best practices ensure that replicas are distributed across failure domains, but relatively little is known concerning optimization algorithms for distributing data replicas. Using a hierarchical model, we answer how to distribute replicas across failure domains optimally. We formulate a novel optimization problem for replica placement in data centers. As part of our problem, we formalize and present a new criterion for optimizing a replica placement. Our overall goal is to choose placements in which correlated failures disable as few replicas as possible. In this work, we provide two optimization algorithms for dependency models represented by trees. We first present an O(n + rho log rho) time dynamic programming algorithm for optimally placing rho replicas of a single block on the leaves (representing servers) of a tree with n vertices. We next consider the problem of optimally placing replicas of multiple blocks of data, where every block may have a different replication factor. For this problem, we give a dynamic programming algorithm that runs in O(n rho(3)(max)delta(2)m(poly(delta))), where m denotes the number of blocks, rho(max) denotes the maximum replication factor of a block, and delta denotes the maximum difference in the replication factors of any two blocks. The running time of the algorithm is polynomial when the delta, which we refer to as the skew, is a constant. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:482 / 518
页数:37
相关论文
共 50 条
  • [21] Metaheuristic algorithms for capacitated controller placement in software defined networks considering failure resilience
    Mohanty, Sagarika
    Sahoo, Bibhudatta
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024,
  • [22] Failure mode analysis and validation of hierarchical sandwich structures under compression
    Guo, Xinxun
    Fang, Yaochu
    Wu, Wentao
    Luo, Yaoge
    Zhang, Wenlong
    Guo, Yongli
    Sun, Bing
    Zhu, Xiang
    [J]. STRUCTURES, 2022, 45 : 787 - 798
  • [23] OPTIMAL REPLACEMENT UNDER ADDITIVE DAMAGE AND OTHER FAILURE MODELS
    TAYLOR, HM
    [J]. NAVAL RESEARCH LOGISTICS, 1975, 22 (01) : 1 - 18
  • [24] OPTIMAL MAINTENANCE POLICIES UNDER DETERMINISTIC DETERIORATION AND STOCHASTIC FAILURE
    FEICHTINGER, G
    [J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 1983, 4 (02): : 153 - 162
  • [25] Optimal Model for Failure Foresight Capacitated Controller Placement in Software-Defined Networks
    Killi, Bala Prakasa Rao
    Rao, Seela Veerabhadreswara
    [J]. IEEE COMMUNICATIONS LETTERS, 2016, 20 (06) : 1108 - 1111
  • [26] Reliability evaluation of phased mission system under multiple correlated failure modes
    Du, Xuejiao
    Zhou, Caihong
    Gai, Jingbo
    Yan, Ke
    [J]. QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2023, 39 (08) : 3266 - 3284
  • [27] Parametric Synthesis of Failure-Free Control Algorithms under Uncertainty
    V. N. Efanov
    T. R. Suyargulov
    [J]. Automation and Remote Control, 2001, 62 : 963 - 977
  • [28] Parametric synthesis of failure-free control algorithms under uncertainty
    Efanov, VN
    Suyargulov, TR
    [J]. AUTOMATION AND REMOTE CONTROL, 2001, 62 (06) : 963 - 977
  • [29] A Scalable Decomposition Algorithm for PMU Placement Under Multiple-Failure Contingencies
    Chen, Richard Li-Yang
    Ruthruff, Joseph
    [J]. 2014 IEEE PES GENERAL MEETING - CONFERENCE & EXPOSITION, 2014,
  • [30] HIERARCHICAL FAILURE ANALYSIS AND OPTIMAL TOUGHNESS DESIGN OF CARBON NANOTUBE-REINFORCED COMPOSITES
    Chen, Y. L.
    Liu, B.
    Huang, Y.
    Hwang, K. C.
    [J]. ADVANCES IN HETEROGENEOUS MATERIAL MECHANICS 2011, 2011, : 335 - 335