Runtime Energy Minimization of Distributed Many-Core Systems using Transfer Learning

被引:0
|
作者
Jenkus, Dainius [1 ]
Xia, Fei [1 ]
Shafik, Rishad [1 ]
Yakovlev, Alex [1 ]
机构
[1] Newcastle Univ, Sch Engn, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The heterogeneity of computing resources continues to permeate into many-core systems making energy-efficiency a challenging objective. Existing rule-based and model-driven methods return sub-optimal energy-efficiency and limited scalability as system complexity increases to the domain of distributed systems. This is exacerbated further by dynamic variations of workloads and quality-of-service (QoS) demands. This work presents a QoS-aware runtime management method for energy minimization using a transfer learning (TL) driven exploration strategy. It enhances standard Q-learning to improve both learning speed and operational optimality (i.e., QoS and energy). The core to our approach is a multi-dimensional knowledge transfer across a task's state-action space. It accelerates the learning of dynamic voltage/frequency scaling (DVFS) control actions for tuning power/performance trade-offs. Firstly, the method identifies and transfers already learned policies between explored and behaviorally similar states referred to as Intra-Task Learning Transfer (ITLT). Secondly, if no similar "expert" states are available, it accelerates exploration at a local state's level through what's known as Intra-State Learning Transfer (ISLT). A comparative evaluation of the approach indicates faster and more balanced exploration. This is shown through energy savings ranging from 7.30% to 18.06%, and improved QoS from 10.43% to 14.3%, when compared to existing exploration strategies. This method is demonstrated under WordPress and TensorFlow workloads on a server cluster.
引用
收藏
页码:1209 / 1214
页数:6
相关论文
共 50 条
  • [41] Highly Scalable Multiplication for Distributed Sparse Multivariate Polynomials on Many-Core Systems
    Gastineau, Mickael
    Laskar, Jacques
    COMPUTER ALGEBRA IN SCIENTIFIC COMPUTING, CASC 2013, 2013, 8136 : 100 - 115
  • [42] Resource-Aware MapReduce Runtime for Multi/Many-core Architectures
    Iliakis, Konstantinos
    Xydis, Sotirios
    Soudris, Dimitrios
    PROCEEDINGS OF THE 2020 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2020), 2020, : 897 - 902
  • [43] A Distributed Energy-aware Task Mapping to Achieve Thermal Balancing and Improve Reliability of Many-core Systems
    Mandelli, Marcelo
    Castilhos, Guilherme
    Sassatelli, Gilles
    Ost, Luciano
    Moraes, Fernando G.
    2015 28TH SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN (SBCCI), 2015,
  • [44] Speedup and Parallelization Models for Energy-Efficient Many-Core Systems Using Performance Counters
    Al-hayanni, Mohammed A. N.
    Shafik, Rishad
    Rafiev, Ashur
    Xia, Fei
    Yakovlev, Alex
    2017 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2017, : 410 - 417
  • [45] Distributed Peak Power Management for Many-core Architectures
    Sartori, John
    Kumar, Rakesh
    DATE: 2009 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2009, : 1556 - 1559
  • [46] Acceleration of ensemble machine learning methods using many-core devices
    Tamerus, A.
    Washbrook, A.
    Wyeth, D.
    21ST INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2015), PARTS 1-9, 2015, 664
  • [47] Runtime Adaptive Matrix Multiplication for the SW26010 Many-Core Processor
    Wu, Zheng
    Li, Mingfan
    Chi, Mengxian
    Xu, Le
    An, Hong
    IEEE ACCESS, 2020, 8 : 156915 - 156928
  • [48] Task Migration for Dynamic Power and Performance Characteristics on Many-Core Distributed Operating Systems
    Holmbacka, Simon
    Lund, Wictor
    Lafond, Sebastien
    Lilius, Johan
    PROCEEDINGS OF THE 2013 21ST EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING, 2013, : 310 - 317
  • [49] Runtime Performance and Power Optimization of Parallel Disparity Estimation on Many-Core Platforms
    Leech, Charles
    Kumar, Charan
    Acharyya, Amit
    Yang, Sheng
    Merrett, Geoff, V
    Al-Hashimi, Bashir M.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (02)
  • [50] Synchronization Strategies on Many-Core SMT Systems
    Navarro-Torres, Agustin
    Alastruey-Benede, Jesus
    Ibanez-Marin, Pablo
    Carpen-Amarie, Maria
    2021 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2021), 2021, : 54 - 63