RLP: Power Management Based on a Latency-Aware Roofline Model

被引：0

作者：

Wang, Bo ^{[1
]}

Kozhokanova, Anara ^{[1
]}

Terboven, Christian ^{[1
]}

Mueller, Matthias ^{[1
]}

机构：

[1] Rhein Westfal TH Aachen, IT Ctr, Aachen, Germany

来源：

2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS | 2023年

关键词：

power management; memory access latency; roofline model; PERFORMANCE;

D O I：

10.1109/IPDPS54959.2023.00052

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The ever-growing power draw in high-performance computing (HPC) clusters and the rising energy costs enforce a pressing urge for energy-efficient computing. Consequently, advanced infrastructure orchestration is required to regulate power dissipation efficiently. In this work, we propose a novel approach for managing power consumption at runtime based on the well-known roofline model and call it Roofline Power (RLP) management. The RLP employs rigorously selected but generally available hardware performance events to construct rooflines, with minimal overheads. In particular, RLP extends the original roofline model to include the memory access latency metric for the first time. The extension identifies whether execution is bandwidth, latency, or compute-bound, and improves the modeling accuracy. We evaluated the RLP model on servergrade CPUs and a GPU with real-world HPC workloads in two scenarios: optimization with and without power capping. Compared to system default settings, RLP reduces the energyto-solution up to 22% with negligible performance degradation. The other scenario accelerates the execution up to 14.7% under power capping. In addition, RLP outperforms other state-of-the-art techniques in generality and effectiveness.

引用

页码：446 / 456

页数：11

共 50 条

[21] A Low-Power Packet Memory Architecture with a Latency-Aware Packet Mapping Method
Lee, Hyuk-Jun
Kim, Seung-Chul
Chung, Eui-Young
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (04): : 963 - 966
[22] PRESTO: a latency-aware power-capping orchestrator for cloud-native microservices
Brondolin, Rolando
Santambrogio, Marco D.
2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING AND SELF-ORGANIZING SYSTEMS (ACSOS 2020), 2020, : 11 - 20
[23] Latency-aware DVFS for efficient power state transitions on many-core architectures
Zhiquan Lai
King Tin Lam
Cho-Li Wang
Jinshu Su
The Journal of Supercomputing, 2015, 71 : 2720 - 2747
[24] Latency-aware DVFS for efficient power state transitions on many-core architectures
Lai, Zhiquan
Lam, King Tin
Wang, Cho-Li
Su, Jinshu
JOURNAL OF SUPERCOMPUTING, 2015, 71 (07): : 2720 - 2747
[25] Green latency-aware data placement in data centers
Fan, Yuqi
Ding, Hongli
Wang, Lusheng
Yuan, Xiaojing
COMPUTER NETWORKS, 2016, 110 : 46 - 57
[26] Latency-Aware Industrial Fog Application Orchestration with Kubernetes
Eidenbenz, Raphael
Pignolet, Yvonne-Anne
Ryser, Alain
2020 FIFTH INTERNATIONAL CONFERENCE ON FOG AND MOBILE EDGE COMPUTING (FMEC), 2020, : 164 - 171
[27] Optimization of latency-aware flow allocation in NGFI networks
Klinkowski, Miroslaw
COMPUTER COMMUNICATIONS, 2020, 161 : 344 - 359
[28] Efficient cooperative cache management for latency-aware data intelligent processing in edge environment
Li, Chunlin
Liu, Jun
Zhang, Qingchuan
Luo, Youlong
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 123 : 48 - 67
[29] Latency-Aware Placement Heuristic in Fog Computing Environment
Amira, Rayane Benamer
Hana, Teyeb
Ben Hadj-Alouane, Nejib
ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS (OTM 2018), PT II, 2018, 11230 : 241 - 257
[30] The Design and Implementation of a Latency-Aware Packet Classification for OpenFlow Protocol based on FPGA
Chiu, Yu-Kai
Ruan, Shanq-Jang
Shen, Chung-An
Hung, Chun-Chi
PROCEEDINGS OF 2018 VII INTERNATIONAL CONFERENCE ON NETWORK, COMMUNICATION AND COMPUTING (ICNCC 2018), 2018, : 64 - 69

← 1 2 3 4 5 →