RLP: Power Management Based on a Latency-Aware Roofline Model

被引:0
|
作者
Wang, Bo [1 ]
Kozhokanova, Anara [1 ]
Terboven, Christian [1 ]
Mueller, Matthias [1 ]
机构
[1] Rhein Westfal TH Aachen, IT Ctr, Aachen, Germany
关键词
power management; memory access latency; roofline model; PERFORMANCE;
D O I
10.1109/IPDPS54959.2023.00052
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The ever-growing power draw in high-performance computing (HPC) clusters and the rising energy costs enforce a pressing urge for energy-efficient computing. Consequently, advanced infrastructure orchestration is required to regulate power dissipation efficiently. In this work, we propose a novel approach for managing power consumption at runtime based on the well-known roofline model and call it Roofline Power (RLP) management. The RLP employs rigorously selected but generally available hardware performance events to construct rooflines, with minimal overheads. In particular, RLP extends the original roofline model to include the memory access latency metric for the first time. The extension identifies whether execution is bandwidth, latency, or compute-bound, and improves the modeling accuracy. We evaluated the RLP model on servergrade CPUs and a GPU with real-world HPC workloads in two scenarios: optimization with and without power capping. Compared to system default settings, RLP reduces the energyto-solution up to 22% with negligible performance degradation. The other scenario accelerates the execution up to 14.7% under power capping. In addition, RLP outperforms other state-of-the-art techniques in generality and effectiveness.
引用
收藏
页码:446 / 456
页数:11
相关论文
共 50 条
  • [21] A Low-Power Packet Memory Architecture with a Latency-Aware Packet Mapping Method
    Lee, Hyuk-Jun
    Kim, Seung-Chul
    Chung, Eui-Young
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (04): : 963 - 966
  • [22] PRESTO: a latency-aware power-capping orchestrator for cloud-native microservices
    Brondolin, Rolando
    Santambrogio, Marco D.
    2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING AND SELF-ORGANIZING SYSTEMS (ACSOS 2020), 2020, : 11 - 20
  • [23] Latency-aware DVFS for efficient power state transitions on many-core architectures
    Zhiquan Lai
    King Tin Lam
    Cho-Li Wang
    Jinshu Su
    The Journal of Supercomputing, 2015, 71 : 2720 - 2747
  • [24] Latency-aware DVFS for efficient power state transitions on many-core architectures
    Lai, Zhiquan
    Lam, King Tin
    Wang, Cho-Li
    Su, Jinshu
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (07): : 2720 - 2747
  • [25] Green latency-aware data placement in data centers
    Fan, Yuqi
    Ding, Hongli
    Wang, Lusheng
    Yuan, Xiaojing
    COMPUTER NETWORKS, 2016, 110 : 46 - 57
  • [26] Latency-Aware Industrial Fog Application Orchestration with Kubernetes
    Eidenbenz, Raphael
    Pignolet, Yvonne-Anne
    Ryser, Alain
    2020 FIFTH INTERNATIONAL CONFERENCE ON FOG AND MOBILE EDGE COMPUTING (FMEC), 2020, : 164 - 171
  • [27] Optimization of latency-aware flow allocation in NGFI networks
    Klinkowski, Miroslaw
    COMPUTER COMMUNICATIONS, 2020, 161 : 344 - 359
  • [28] Efficient cooperative cache management for latency-aware data intelligent processing in edge environment
    Li, Chunlin
    Liu, Jun
    Zhang, Qingchuan
    Luo, Youlong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 123 : 48 - 67
  • [29] Latency-Aware Placement Heuristic in Fog Computing Environment
    Amira, Rayane Benamer
    Hana, Teyeb
    Ben Hadj-Alouane, Nejib
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS (OTM 2018), PT II, 2018, 11230 : 241 - 257
  • [30] The Design and Implementation of a Latency-Aware Packet Classification for OpenFlow Protocol based on FPGA
    Chiu, Yu-Kai
    Ruan, Shanq-Jang
    Shen, Chung-An
    Hung, Chun-Chi
    PROCEEDINGS OF 2018 VII INTERNATIONAL CONFERENCE ON NETWORK, COMMUNICATION AND COMPUTING (ICNCC 2018), 2018, : 64 - 69