Algorithms for optimal replica placement under correlated failure in hierarchical failure domains

被引：0

作者：

Mills, K. Alex ^{[1
]}

Chandrasekaran, R. ^{[1
]}

Mittal, Neeraj ^{[1
]}

机构：

[1] Univ Texas Dallas, 800 W Campbell Rd, Richardson, TX 75080 USA

来源：

THEORETICAL COMPUTER SCIENCE | 2020年 / 809卷

基金：

美国国家科学基金会;

关键词：

Replica placement; Correlated failure; Combinatorial optimization; Fault-tolerant storage; Data center management; RELIABILITY;

D O I：

10.1016/j.tcs.2020.01.004

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In data centers, data replication is the primary method used to ensure availability of customer data. To avoid correlated failure, cloud storage infrastructure providers model hierarchical failure domains using a tree, and avoid placing a large number of data replicas within the same failure domain (i.e. on the same branch of the tree). Typical best practices ensure that replicas are distributed across failure domains, but relatively little is known concerning optimization algorithms for distributing data replicas. Using a hierarchical model, we answer how to distribute replicas across failure domains optimally. We formulate a novel optimization problem for replica placement in data centers. As part of our problem, we formalize and present a new criterion for optimizing a replica placement. Our overall goal is to choose placements in which correlated failures disable as few replicas as possible. In this work, we provide two optimization algorithms for dependency models represented by trees. We first present an O(n + rho log rho) time dynamic programming algorithm for optimally placing rho replicas of a single block on the leaves (representing servers) of a tree with n vertices. We next consider the problem of optimally placing replicas of multiple blocks of data, where every block may have a different replication factor. For this problem, we give a dynamic programming algorithm that runs in O(n rho(3)(max)delta(2)m(poly(delta))), where m denotes the number of blocks, rho(max) denotes the maximum replication factor of a block, and delta denotes the maximum difference in the replication factors of any two blocks. The running time of the algorithm is polynomial when the delta, which we refer to as the skew, is a constant. (C) 2020 Elsevier B.V. All rights reserved.

引用

页码：482 / 518

页数：37

共 50 条

[21] Metaheuristic algorithms for capacitated controller placement in software defined networks considering failure resilience
Mohanty, Sagarika
Sahoo, Bibhudatta
[J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024,
[22] Failure mode analysis and validation of hierarchical sandwich structures under compression
Guo, Xinxun
Fang, Yaochu
Wu, Wentao
Luo, Yaoge
Zhang, Wenlong
Guo, Yongli
Sun, Bing
Zhu, Xiang
[J]. STRUCTURES, 2022, 45 : 787 - 798
[23] OPTIMAL REPLACEMENT UNDER ADDITIVE DAMAGE AND OTHER FAILURE MODELS
TAYLOR, HM
[J]. NAVAL RESEARCH LOGISTICS, 1975, 22 (01) : 1 - 18
[24] OPTIMAL MAINTENANCE POLICIES UNDER DETERMINISTIC DETERIORATION AND STOCHASTIC FAILURE
FEICHTINGER, G
[J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 1983, 4 (02): : 153 - 162
[25] Optimal Model for Failure Foresight Capacitated Controller Placement in Software-Defined Networks
Killi, Bala Prakasa Rao
Rao, Seela Veerabhadreswara
[J]. IEEE COMMUNICATIONS LETTERS, 2016, 20 (06) : 1108 - 1111
[26] Reliability evaluation of phased mission system under multiple correlated failure modes
Du, Xuejiao
Zhou, Caihong
Gai, Jingbo
Yan, Ke
[J]. QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2023, 39 (08) : 3266 - 3284
[27] Parametric Synthesis of Failure-Free Control Algorithms under Uncertainty
V. N. Efanov
T. R. Suyargulov
[J]. Automation and Remote Control, 2001, 62 : 963 - 977
[28] Parametric synthesis of failure-free control algorithms under uncertainty
Efanov, VN
Suyargulov, TR
[J]. AUTOMATION AND REMOTE CONTROL, 2001, 62 (06) : 963 - 977
[29] A Scalable Decomposition Algorithm for PMU Placement Under Multiple-Failure Contingencies
Chen, Richard Li-Yang
Ruthruff, Joseph
[J]. 2014 IEEE PES GENERAL MEETING - CONFERENCE & EXPOSITION, 2014,
[30] HIERARCHICAL FAILURE ANALYSIS AND OPTIMAL TOUGHNESS DESIGN OF CARBON NANOTUBE-REINFORCED COMPOSITES
Chen, Y. L.
Liu, B.
Huang, Y.
Hwang, K. C.
[J]. ADVANCES IN HETEROGENEOUS MATERIAL MECHANICS 2011, 2011, : 335 - 335

← 1 2 3 4 5 →