Runtime Energy Minimization of Distributed Many-Core Systems using Transfer Learning

被引:0
|
作者
Jenkus, Dainius [1 ]
Xia, Fei [1 ]
Shafik, Rishad [1 ]
Yakovlev, Alex [1 ]
机构
[1] Newcastle Univ, Sch Engn, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The heterogeneity of computing resources continues to permeate into many-core systems making energy-efficiency a challenging objective. Existing rule-based and model-driven methods return sub-optimal energy-efficiency and limited scalability as system complexity increases to the domain of distributed systems. This is exacerbated further by dynamic variations of workloads and quality-of-service (QoS) demands. This work presents a QoS-aware runtime management method for energy minimization using a transfer learning (TL) driven exploration strategy. It enhances standard Q-learning to improve both learning speed and operational optimality (i.e., QoS and energy). The core to our approach is a multi-dimensional knowledge transfer across a task's state-action space. It accelerates the learning of dynamic voltage/frequency scaling (DVFS) control actions for tuning power/performance trade-offs. Firstly, the method identifies and transfers already learned policies between explored and behaviorally similar states referred to as Intra-Task Learning Transfer (ITLT). Secondly, if no similar "expert" states are available, it accelerates exploration at a local state's level through what's known as Intra-State Learning Transfer (ISLT). A comparative evaluation of the approach indicates faster and more balanced exploration. This is shown through energy savings ranging from 7.30% to 18.06%, and improved QoS from 10.43% to 14.3%, when compared to existing exploration strategies. This method is demonstrated under WordPress and TensorFlow workloads on a server cluster.
引用
收藏
页码:1209 / 1214
页数:6
相关论文
共 50 条
  • [21] Defragmentation for Efficient Runtime Resource Management in NoC-Based Many-Core Systems
    Ng, Jim
    Wang, Xiaohang
    Singh, Amit Kumar
    Mak, Terrence
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2016, 24 (11) : 3359 - 3372
  • [22] On the Acceleration of Wavefront Applications using Distributed Many-Core Architectures
    Pennycook, S. J.
    Hammond, S. D.
    Mudalige, G. R.
    Wright, S. A.
    Jarvis, S. A.
    COMPUTER JOURNAL, 2012, 55 (02): : 138 - 153
  • [23] RMC: an Integrated Runtime System for Adaptive Many-Core Computing
    Park, Jinsu
    Cho, Eunbi
    Baek, Woongki
    2016 PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE (EMSOFT), 2016,
  • [24] An OpenCL Runtime System for a Heterogeneous Many-Core Virtual Platform
    Chen, Kuan-Chung
    Chen, Chung-Ho
    2014 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2014, : 2197 - 2200
  • [25] Fast power and energy management for future many-core systems
    Liu Y.
    Cox G.
    Deng Q.
    Draper S.C.
    Bianchini R.
    ACM Transactions on Modeling and Performance Evaluation of Computing Systems, 2017, 2 (03)
  • [26] Hierarchical energy monitoring for task mapping in many-core systems
    Castilhos, Guilherme
    Mandelli, Marcelo
    Ost, Luciano
    Moraes, Fernando Gehm
    JOURNAL OF SYSTEMS ARCHITECTURE, 2016, 63 : 80 - 92
  • [27] Demystifying the Cost of Task Migration in Distributed Memory Many-Core Systems
    Ruaro, Marcelo
    Moraes, Fernando G.
    2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017, : 148 - 151
  • [28] DBP: Distributed Power Budgeting for Many-Core Systems in Dark Silicon
    Wang, Hai
    He, Wenjun
    Yang, Qinhui
    Peng, Xizhu
    Tang, He
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (12) : 5727 - 5731
  • [29] Runtime Thermal Management Using Software Agents for Multi- and Many-Core Architectures
    Al Faruque, Mohammad Abdullah
    Jahn, Janmartin
    Ebi, Thomas
    Henkel, Joerg
    IEEE DESIGN & TEST OF COMPUTERS, 2010, 27 (06): : 58 - 68
  • [30] QoS Manager for Energy Efficient Many-Core Operating Systems
    Holmbacka, Simon
    Agren, Dag
    Lafond, Sebastien
    Lilius, Johan
    PROCEEDINGS OF THE 2013 21ST EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING, 2013, : 318 - 322