RLP: Power Management Based on a Latency-Aware Roofline Model

被引：0

作者：

Wang, Bo ^{[1
]}

Kozhokanova, Anara ^{[1
]}

Terboven, Christian ^{[1
]}

Mueller, Matthias ^{[1
]}

机构：

[1] Rhein Westfal TH Aachen, IT Ctr, Aachen, Germany

来源：

2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS | 2023年

关键词：

power management; memory access latency; roofline model; PERFORMANCE;

D O I：

10.1109/IPDPS54959.2023.00052

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The ever-growing power draw in high-performance computing (HPC) clusters and the rising energy costs enforce a pressing urge for energy-efficient computing. Consequently, advanced infrastructure orchestration is required to regulate power dissipation efficiently. In this work, we propose a novel approach for managing power consumption at runtime based on the well-known roofline model and call it Roofline Power (RLP) management. The RLP employs rigorously selected but generally available hardware performance events to construct rooflines, with minimal overheads. In particular, RLP extends the original roofline model to include the memory access latency metric for the first time. The extension identifies whether execution is bandwidth, latency, or compute-bound, and improves the modeling accuracy. We evaluated the RLP model on servergrade CPUs and a GPU with real-world HPC workloads in two scenarios: optimization with and without power capping. Compared to system default settings, RLP reduces the energyto-solution up to 22% with negligible performance degradation. The other scenario accelerates the execution up to 14.7% under power capping. In addition, RLP outperforms other state-of-the-art techniques in generality and effectiveness.

引用

页码：446 / 456

页数：11

共 50 条

[1] Montgolfier: Latency-Aware Power Management System for Heterogeneous Servers
Cai, Haoran
Cao, Qiang
Sheng, Feng
Zhang, Manyi
Qi, Chuanyi
Yao, Jie
Xie, Changsheng
2016 IEEE 35TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2016,
[2] Latency-Aware Power Management in Software-Defined Radios
Malm, Nicolas
Ruttik, Kalle
Tirkkonen, Olav
JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2020, 2020
[3] Latency-aware decentralized resource management for IoT applications
Avasalcai, Cosmin
Dustdar, Schahram
PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON THE INTERNET OF THINGS (IOT'18), 2018,
[4] Autonomic and Latency-Aware Degree of Parallelism Management in SPar
Vogel, Adriano
Griebler, Dalvan
De Sensi, Daniele
Danelutto, Marco
Fernandes, Luiz Gustavo
EURO-PAR 2018: PARALLEL PROCESSING WORKSHOPS, 2019, 11339 : 28 - 39
[5] Latency-Aware Collaborative Perception
Lei, Zixing
Ren, Shunli
Hu, Yue
Zhang, Wenjun
Chen, Siheng
COMPUTER VISION - ECCV 2022, PT XXXII, 2022, 13692 : 316 - 332
[6] Latency-Aware Application Module Management for Fog Computing Environments
Mahmud, Redowan
Ramamohanarao, Kotagiri
Buyya, Rajkumar
ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2019, 19 (01)
[7] Roofline Model Based Performance-Aware Energy Management for Scientific Computing
Wang, Yunlan
Zhao, Tianhai
Li, Lu
Hou, Zhengxiong
Gu, Jianhua
2018 9TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP 2018), 2018, : 74 - 80
[8] Performance Analysis of Latency-Aware Data Management in Industrial IoT Networks
Raptis, Theofanis P.
Passarella, Andrea
Conti, Marco
SENSORS, 2018, 18 (08)
[9] Reliable Latency-Aware Routing for Clustered WSNs
Tufail, Ali
INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2012,
[10] PLATO: Predictive latency-aware total ordering
Balakrishnan, Mahesh
Birman, Ken
Phanishayee, Amar
SRDS 2006: 25TH IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 2006, : 175 - 185

← 1 2 3 4 5 →