L1-Bandwidth Aware Thread Allocation in Multicore SMT Processors

被引:0
|
作者
Feliu, Josue [1 ]
Sahuquillo, Julio [1 ]
Petit, Salvador [1 ]
Duato, Jose [1 ]
机构
[1] Univ Politecn Valencia, Dept Comp Engn DISCA, E-46071 Valencia, Spain
关键词
SMT; thread allocation; bandwidth-aware scheduling; MANAGEMENT;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Improving the utilization of shared resources is a key issue to increase performance in SMT processors. Recent work has focused on resource sharing policies to enhance the processor performance, but their proposals mainly concentrate on novel hardware mechanisms that adapt to the dynamic resource requirements of the running threads. This work addresses the L1 cache bandwidth problem in SMT processors experimentally on real hardware. Unlike previous work, this paper concentrates on thread allocation, by selecting the proper pair of co-runners to be launched to the same core. The relation between L1 bandwidth requirements of each benchmark and its performance (IPC) is analyzed. We found that for individual benchmarks, performance is strongly connected to L1 bandwidth consumption, and this observation remains valid when several co-runners are launched to the same SMT core. Based on these findings we propose two L1 bandwidth aware thread to core (t2c) allocation policies, namely Static and Dynamic t2c allocation, respectively. The aim of these policies is to properly balance L1 bandwidth requirements of the running threads among the processor cores. Experiments on a Xeon E5645 processor show that the proposed policies significantly improve the performance of the Linux OS kernel regardless the number of cores considered.
引用
收藏
页码:123 / 132
页数:10
相关论文
共 8 条
  • [1] Thread Isolation to Improve Symbiotic Scheduling on SMT Multicore Processors
    Feliu, Josue
    Sahuquillo, Julio
    Petit, Salvador
    Eeckhout, Lieven
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (02) : 359 - 373
  • [2] BALANCER: bandwidth allocation and cache partitioning for multicore processors
    Navarro-Torres, Agustin
    Alastruey-Benede, Jesus
    Ibanez, Pablo
    Vinals-Yufera, Victor
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (09): : 10252 - 10276
  • [3] BALANCER: bandwidth allocation and cache partitioning for multicore processors
    Agustín Navarro-Torres
    Jesús Alastruey-Benedé
    Pablo Ibáñez
    Víctor Viñals-Yúfera
    [J]. The Journal of Supercomputing, 2023, 79 : 10252 - 10276
  • [4] Hardware-aware thread scheduling: the case of asymmetric multicore processors
    Peternier, Achille
    Ansaloni, Danilo
    Bonetta, Daniele
    Pautasso, Cesare
    Binder, Walter
    [J]. PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012), 2012, : 400 - 407
  • [5] Energy-aware thread co-location in heterogeneous multicore processors
    Nishtala, Rajiv
    Mosse, Daniel
    Petrucci, Vinicius
    [J]. 2013 PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE (EMSOFT), 2013,
  • [6] Improving execution unit occupancy on SMT-based processors through hardware-aware thread scheduling
    Peternier, Achille
    Ansaloni, Danilo
    Bonetta, Daniele
    Pautasso, Cesare
    Binder, Walter
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 30 : 229 - 241
  • [7] CATZ: Time-zone-aware bandwidth allocation in layer 1 VPNs
    Cavdar, Cicek
    Gencata, Aysegul
    Mukherjee, Biswanath
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2007, 45 (04) : 60 - 66
  • [8] Shrinking L1 Instruction Caches to Improve Energy-Delay in SMT Embedded Processors
    Ferreron-Labari, Alexandra
    Ortin-Obon, Marta
    Suarez-Gracia, Dario
    Alastruey-Benede, Jesus
    Vinals-Yufera, Victor
    [J]. ARCHITECTURE OF COMPUTING SYSTEMS - ARCS 2013, 2013, 7767 : 256 - 267