RLP: Power Management Based on a Latency-Aware Roofline Model

被引:0
|
作者
Wang, Bo [1 ]
Kozhokanova, Anara [1 ]
Terboven, Christian [1 ]
Mueller, Matthias [1 ]
机构
[1] Rhein Westfal TH Aachen, IT Ctr, Aachen, Germany
关键词
power management; memory access latency; roofline model; PERFORMANCE;
D O I
10.1109/IPDPS54959.2023.00052
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The ever-growing power draw in high-performance computing (HPC) clusters and the rising energy costs enforce a pressing urge for energy-efficient computing. Consequently, advanced infrastructure orchestration is required to regulate power dissipation efficiently. In this work, we propose a novel approach for managing power consumption at runtime based on the well-known roofline model and call it Roofline Power (RLP) management. The RLP employs rigorously selected but generally available hardware performance events to construct rooflines, with minimal overheads. In particular, RLP extends the original roofline model to include the memory access latency metric for the first time. The extension identifies whether execution is bandwidth, latency, or compute-bound, and improves the modeling accuracy. We evaluated the RLP model on servergrade CPUs and a GPU with real-world HPC workloads in two scenarios: optimization with and without power capping. Compared to system default settings, RLP reduces the energyto-solution up to 22% with negligible performance degradation. The other scenario accelerates the execution up to 14.7% under power capping. In addition, RLP outperforms other state-of-the-art techniques in generality and effectiveness.
引用
收藏
页码:446 / 456
页数:11
相关论文
共 50 条
  • [1] Montgolfier: Latency-Aware Power Management System for Heterogeneous Servers
    Cai, Haoran
    Cao, Qiang
    Sheng, Feng
    Zhang, Manyi
    Qi, Chuanyi
    Yao, Jie
    Xie, Changsheng
    2016 IEEE 35TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2016,
  • [2] Latency-Aware Power Management in Software-Defined Radios
    Malm, Nicolas
    Ruttik, Kalle
    Tirkkonen, Olav
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2020, 2020
  • [3] Latency-aware decentralized resource management for IoT applications
    Avasalcai, Cosmin
    Dustdar, Schahram
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON THE INTERNET OF THINGS (IOT'18), 2018,
  • [4] Autonomic and Latency-Aware Degree of Parallelism Management in SPar
    Vogel, Adriano
    Griebler, Dalvan
    De Sensi, Daniele
    Danelutto, Marco
    Fernandes, Luiz Gustavo
    EURO-PAR 2018: PARALLEL PROCESSING WORKSHOPS, 2019, 11339 : 28 - 39
  • [5] Latency-Aware Collaborative Perception
    Lei, Zixing
    Ren, Shunli
    Hu, Yue
    Zhang, Wenjun
    Chen, Siheng
    COMPUTER VISION - ECCV 2022, PT XXXII, 2022, 13692 : 316 - 332
  • [6] Latency-Aware Application Module Management for Fog Computing Environments
    Mahmud, Redowan
    Ramamohanarao, Kotagiri
    Buyya, Rajkumar
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2019, 19 (01)
  • [7] Roofline Model Based Performance-Aware Energy Management for Scientific Computing
    Wang, Yunlan
    Zhao, Tianhai
    Li, Lu
    Hou, Zhengxiong
    Gu, Jianhua
    2018 9TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP 2018), 2018, : 74 - 80
  • [8] Performance Analysis of Latency-Aware Data Management in Industrial IoT Networks
    Raptis, Theofanis P.
    Passarella, Andrea
    Conti, Marco
    SENSORS, 2018, 18 (08)
  • [9] Reliable Latency-Aware Routing for Clustered WSNs
    Tufail, Ali
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2012,
  • [10] PLATO: Predictive latency-aware total ordering
    Balakrishnan, Mahesh
    Birman, Ken
    Phanishayee, Amar
    SRDS 2006: 25TH IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 2006, : 175 - 185