Runtime Energy Minimization of Distributed Many-Core Systems using Transfer Learning

被引:0
|
作者
Jenkus, Dainius [1 ]
Xia, Fei [1 ]
Shafik, Rishad [1 ]
Yakovlev, Alex [1 ]
机构
[1] Newcastle Univ, Sch Engn, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The heterogeneity of computing resources continues to permeate into many-core systems making energy-efficiency a challenging objective. Existing rule-based and model-driven methods return sub-optimal energy-efficiency and limited scalability as system complexity increases to the domain of distributed systems. This is exacerbated further by dynamic variations of workloads and quality-of-service (QoS) demands. This work presents a QoS-aware runtime management method for energy minimization using a transfer learning (TL) driven exploration strategy. It enhances standard Q-learning to improve both learning speed and operational optimality (i.e., QoS and energy). The core to our approach is a multi-dimensional knowledge transfer across a task's state-action space. It accelerates the learning of dynamic voltage/frequency scaling (DVFS) control actions for tuning power/performance trade-offs. Firstly, the method identifies and transfers already learned policies between explored and behaviorally similar states referred to as Intra-Task Learning Transfer (ITLT). Secondly, if no similar "expert" states are available, it accelerates exploration at a local state's level through what's known as Intra-State Learning Transfer (ISLT). A comparative evaluation of the approach indicates faster and more balanced exploration. This is shown through energy savings ranging from 7.30% to 18.06%, and improved QoS from 10.43% to 14.3%, when compared to existing exploration strategies. This method is demonstrated under WordPress and TensorFlow workloads on a server cluster.
引用
收藏
页码:1209 / 1214
页数:6
相关论文
共 50 条
  • [31] Fast Energy Evaluation of Embedded Applications for Many-core Systems
    Rosa, Felipe
    Ost, Luciano
    Raupp, Thiago
    Moraes, Fernando
    Reis, Ricardo
    2014 24TH INTERNATIONAL WORKSHOP ON POWER AND TIMING MODELING, OPTIMIZATION AND SIMULATION (PATMOS), 2014,
  • [32] Modular and Distributed Management of Many-Core SoCs
    Ruaro, Marcelo
    Sant'ana, Anderson
    Jantsch, Axel
    Moraes, Fernando Gehm
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2021, 38 (1-2):
  • [33] DeFrag: Defragmentation for Efficient Runtime Resource Allocation in NoC-based Many-core Systems
    Ng, Jim
    Wang, Xiaohang
    Singh, Amit Kumar
    Mak, Terrence
    23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 345 - 352
  • [34] A Lifetime-Aware Runtime Mapping Approach for Many-core Systems in the Dark Silicon Era
    Haghbayan, Mohammad-Hashem
    Miele, Antonio
    Rahmani, Amir M.
    Liljeberg, Pasi
    Tenhunen, Hannu
    PROCEEDINGS OF THE 2016 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2016, : 854 - 857
  • [35] An efficient runtime power allocation scheme for many-core systems inspired from auction theory
    Wang, Xiaohang
    Zhao, Baoxin
    Mak, Terrence
    Yang, Mei
    Jiang, Yingtao
    Daneshtalab, Masoud
    INTEGRATION-THE VLSI JOURNAL, 2015, 50 : 147 - 157
  • [36] Reliability-Aware Runtime Power Management for Many-Core Systems in the Dark Silicon Era
    Rahmani, Amir M.
    Haghbayan, Mohammad-Hashem
    Miele, Antonio
    Liljeberg, Pasi
    Jantsch, Axel
    Tenhunen, Hannu
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2017, 25 (02) : 427 - 440
  • [37] Skeletons for multi/many-core systems
    Aldinucci, Marco
    Danelutto, Marco
    Kilpatrick, Peter
    PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 265 - 272
  • [38] A Latency Simulator for Many-core Systems
    Kumar, Sunil
    Cucinotta, Tommaso
    Lipari, Giuseppe
    44TH ANNUAL SIMULATION SYMPOSIUM 2011 (ANSS 2011) - 2011 SPRING SIMULATION MULTICONFERENCE - BK 2 OF 8, 2011, : 151 - 158
  • [39] Distributed Reinforcement Learning for Power Limited Many-core System Performance Optimization
    Chen, Zhuo
    Marculescu, Diana
    2015 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2015, : 1521 - 1526
  • [40] Parallel Many-Core Avionics Systems
    Panic, Milos
    Quinones, Eduardo
    Zaykov, Pavel G.
    Hernandez, Carles
    Abella, Jaume
    Cazorla, Francisco J.
    2014 INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE (EMSOFT), 2014,