RLP: Power Management Based on a Latency-Aware Roofline Model

被引:0
|
作者
Wang, Bo [1 ]
Kozhokanova, Anara [1 ]
Terboven, Christian [1 ]
Mueller, Matthias [1 ]
机构
[1] Rhein Westfal TH Aachen, IT Ctr, Aachen, Germany
关键词
power management; memory access latency; roofline model; PERFORMANCE;
D O I
10.1109/IPDPS54959.2023.00052
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The ever-growing power draw in high-performance computing (HPC) clusters and the rising energy costs enforce a pressing urge for energy-efficient computing. Consequently, advanced infrastructure orchestration is required to regulate power dissipation efficiently. In this work, we propose a novel approach for managing power consumption at runtime based on the well-known roofline model and call it Roofline Power (RLP) management. The RLP employs rigorously selected but generally available hardware performance events to construct rooflines, with minimal overheads. In particular, RLP extends the original roofline model to include the memory access latency metric for the first time. The extension identifies whether execution is bandwidth, latency, or compute-bound, and improves the modeling accuracy. We evaluated the RLP model on servergrade CPUs and a GPU with real-world HPC workloads in two scenarios: optimization with and without power capping. Compared to system default settings, RLP reduces the energyto-solution up to 22% with negligible performance degradation. The other scenario accelerates the execution up to 14.7% under power capping. In addition, RLP outperforms other state-of-the-art techniques in generality and effectiveness.
引用
收藏
页码:446 / 456
页数:11
相关论文
共 50 条
  • [31] Latency-aware Scheduling in the Cloud-Edge Continuum
    Chiaro, Cristopher
    Monaco, Doriana
    Sacco, Alessio
    Casetti, Claudio
    Marchetto, Guido
    PROCEEDINGS OF 2024 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, NOMS 2024, 2024,
  • [32] Latency-Aware Offloading for Mobile Edge Computing Networks
    Feng, Wei
    Liu, Hao
    Yao, Yingbiao
    Cao, Diqiu
    Zhao, Mingxiong
    IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) : 2673 - 2677
  • [33] Latency-aware virtual desktops optimization in distributed clouds
    Tian Guo
    Prashant Shenoy
    K. K. Ramakrishnan
    Vijay Gopalakrishnan
    Multimedia Systems, 2018, 24 : 73 - 94
  • [34] Energy and Latency-aware Resource Reconfiguration in Fog Environments
    Godinho, Noe
    Silva, Henrique
    Curado, Marilia
    Paquete, Luis
    2020 IEEE 19TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2020,
  • [35] Latency-Aware Accelerator of SIMECK Lightweight Block Cipher
    Alharbi, Adel R.
    Tariq, Hassan
    Aljaedi, Amer
    Aljuhni, Abdullah
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [36] Latency-Aware Kubernetes Scheduling for Microservices Orchestration at the Edge
    Centofanti, C.
    Tiberti, W.
    Marotta, A.
    Graziosi, F.
    Cassioli, D.
    2023 IEEE 9TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION, NETSOFT, 2023, : 426 - 431
  • [37] On Latency-aware Self-aligning Cloud Storage
    Haider, Syed Ali
    Kazi, Khurram
    Zaidi, S. M. H.
    Raja, M. Yasin Akhtar
    2013 10TH INTERNATIONAL CONFERENCE ON HIGH CAPACITY OPTICAL NETWORKS AND ENABLING TECHNOLOGIES (HONET-CNS), 2013, : 189 - 192
  • [38] Simulation Study on Latency-aware Network in Edge Computing
    Zheng, Qinling
    Ping, Zhan
    2019 6TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT 2019), 2019, : 150 - 155
  • [39] Latency-aware Traffic Provisioning for Content Delivery Networks
    Hei, Jinghao
    Than, Huiyou
    Zhang, Pengfei
    Tan, Haisheng
    2022 8TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS, BIGCOM, 2022, : 11 - 18
  • [40] Latency-aware virtual desktops optimization in distributed clouds
    Guo, Tian
    Shenoy, Prashant
    Ramakrishnan, K. K.
    Gopalakrishnan, Vijay
    MULTIMEDIA SYSTEMS, 2018, 24 (01) : 73 - 94