Task Scheduling on Manycore Processors with Home Caches

被引:0
|
作者
Muddukrishna, Ananya [1 ]
Podobas, Artur [1 ]
Brorsson, Mats [1 ]
Vlassov, Vladimir [1 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Modern manycore processors feature a highly scalable and software-configurable cache hierarchy. For performance, manycore programmers will not only have to efficiently utilize the large number of cores but also understand and configure the cache hierarchy to suit the application. Relief from this manycore programming nightmare can be provided by task-based programming models where programmers parallelize using tasks and an architecture-specific runtime system maps tasks to cores and in addition configures the cache hierarchy. In this paper, we focus on the cache hierarchy of the Tilera TILEPro64 processor which features a software-configurable coherence waypoint called the home cache. We first show the runtime system performance bottleneck of scheduling tasks oblivious to the nature of home caches. We then demonstrate a technique in which the runtime system controls the assignment of home caches to memory blocks and schedules tasks to minimize home cache access penalties. Test results of our technique have shown a significant execution time performance improvement on selected benchmarks leading to the conclusion that by taking processor architecture features into account, task-based programming models can indeed provide continued performance and allow programmers to smoothly transit from the multicore to manycore era.
引用
收藏
页码:357 / 367
页数:11
相关论文
共 50 条
  • [1] Energy-Efficient Task Scheduling in Manycore Processors with Frequency Scaling Overhead
    Eitschberger, Patrick
    Keller, Joerg
    [J]. 23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 541 - 548
  • [2] Locality-Aware Task Scheduling and Data Distribution for OpenMP Programs on NUMA Systems and Manycore Processors
    Muddukrishna, Ananya
    Jonsson, Peter A.
    Brorsson, Mats
    [J]. SCIENTIFIC PROGRAMMING, 2015, 2015
  • [3] Adaptive Task Scheduling on Multicore Processors
    Nour, Samar
    Mahmoud, Shahira
    Saleh, Mohamed
    [J]. INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 575 - 584
  • [4] Task scheduling algorithms for heterogeneous processors
    Topcuoglu, H
    Hariri, S
    Wu, MY
    [J]. (HCW '99) - EIGHTH HETEROGENEOUS COMPUTING WORKSHOP, PROCEEDINGS, 1999, : 3 - 14
  • [5] Hierarchical Scheduling of DAG Structured Computations on Manycore Processors with Dynamic Thread Grouping
    Xia, Yinglong
    Prasanna, Viktor K.
    Li, James
    [J]. JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, 2010, 6253 : 154 - +
  • [6] On Linear Learning with Manycore Processors
    Wszola, Eliza
    Mendler-Dunner, Celestine
    Jaggi, Martin
    Pueschel, Markus
    [J]. 2019 IEEE 26TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC), 2019, : 184 - 194
  • [7] Reinventing EDA with manycore processors
    Sapatnekar, Sachin
    Devgan, Anirudh
    Pryor, Duaine
    Haritan, Eshel
    Kirkpatrick, Desmond A.
    Keutzer, Kurt
    Meier, Stephen
    Spyrou, Tom
    [J]. 2008 45TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, VOLS 1 AND 2, 2008, : 126 - +
  • [8] Task scheduling on minimal processors with genetic algorithms
    Yao, WS
    You, JY
    [J]. PROCEEDINGS OF THE 6TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2002, : 210 - 214
  • [9] Rethinking Caches for Throughput Processors
    Keckler, Stephen W.
    [J]. COMMUNICATIONS OF THE ACM, 2014, 57 (12) : 90 - 90
  • [10] Prediction caches for superscalar processors
    Bennett, JE
    Flynn, MJ
    [J]. THIRTIETH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, PROCEEDINGS, 1997, : 81 - 90