DiBA: Distributed Power Budget Allocation for Large-Scale Computing Clusters

被引:4
|
作者
Badiei, Masoud [1 ]
Zhan, Xin [2 ]
Azimi, Reza [2 ]
Reda, Sherief [2 ]
Li, Na [1 ]
机构
[1] Harvard Univ, SEAS, Cambridge, MA 02138 USA
[2] Brown Univ, Sch Engn, Providence, RI 02912 USA
关键词
Distributed algorithm; Primal-dual algorithm; Power management; Consensus algorithm;
D O I
10.1109/CCGrid.2016.101
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Power management has become a central issue in large-scale computing clusters where a considerable amount of energy is consumed and a large operational cost is incurred annually. Traditional power management techniques have a centralized design that creates challenges for scalability of computing clusters. In this work, we develop a framework for distributed power budget allocation that maximizes the utility of computing nodes subject to a total power budget constraint. To eliminate the role of central coordinator in the primal-dual technique, we propose a distributed power budget allocation algorithm (DiBA) which maximizes the combined performance of a cluster subject to a power budget constraint in a distributed fashion. Specifically, DiBA is a consensus-based algorithm in which each server determines its optimal power consumption locally by communicating its state with neighbors (connected nodes) in a cluster. We characterize a synchronous primal-dual technique to obtain a benchmark for comparison with the distributed algorithm that we propose. We demonstrate numerically that DiBA is a scalable algorithm that outperforms the conventional primal-dual method on large scale clusters in terms of convergence time. Further, DiBA eliminates the communication bottleneck in the primal-dual method. We thoroughly evaluate the characteristics of DiBA through simulations of large-scale clusters. Furthermore, we provide results from a proof-of-concept implementation on a real experimental cluster.
引用
收藏
页码:70 / 79
页数:10
相关论文
共 50 条
  • [1] Optimal computing budget allocation to the differential evolution algorithm for large-scale portfolio optimization
    Liu, Wei-han
    [J]. JOURNAL OF SIMULATION, 2017, 11 (04) : 380 - 390
  • [2] Distributed Job Allocation for Large-Scale Manycores
    Ramachandran, Subramanian
    Mueller, Frank
    [J]. HIGH PERFORMANCE COMPUTING, 2016, 9697 : 404 - 425
  • [3] SCIDDLE - A TOOL FOR LARGE-SCALE DISTRIBUTED COMPUTING
    ARBENZ, P
    SPRENGER, C
    LUTHI, HP
    VOGEL, S
    [J]. CONCURRENCY-PRACTICE AND EXPERIENCE, 1995, 7 (02): : 121 - 146
  • [4] Adaptive Pilot Power Allocation for Distributed Antenna Systems with Large-scale CSI
    Zhang, Yingjie
    Feng, Wei
    Ge, Ning
    [J]. 2014 21ST INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS (ICT), 2014, : 246 - 251
  • [5] Large-scale scientific irregular computing on clusters and grids
    Brezany, P
    Bubak, M
    Malawski, M
    Zajac, K
    [J]. COMPUTATIONAL SCIENCE-ICCS 2002, PT I, PROCEEDINGS, 2002, 2329 : 484 - 493
  • [6] On the optimal computing budget allocation problem for large scale simulation optimization
    Al-Salem, Mohammed
    Almomani, Mohammad
    Alrefaei, Mahmoud
    Diabat, Ali
    [J]. SIMULATION MODELLING PRACTICE AND THEORY, 2017, 71 : 149 - 159
  • [7] Force allocation in a large-scale distributed active surface
    Fromherz, MPJ
    Jackson, WB
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2003, 11 (05) : 641 - 655
  • [8] Dynamic Power Pricing using Distributed Resource Allocation for Large-Scale DSA Systems
    Khalfi, Bassem
    Ben Ghorbel, Mahdi
    Hamdaoui, Bechir
    Guizani, Mohsen
    [J]. 2015 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2015, : 1090 - 1094
  • [9] Large-Scale Feature Matching with Distributed and Heterogeneous Computing
    Mills, Steven
    Eyers, David
    Leung, Kai-Cheung
    Tang, Xiaoxin
    Huang, Zhiyi
    [J]. PROCEEDINGS OF 2013 28TH INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ 2013), 2013, : 208 - 213
  • [10] Distributed Domain Generation for Large-Scale Scientific Computing
    Ertl, Christoph
    Mundani, Ralf-Peter
    [J]. 2021 20TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC), 2021, : 105 - 113