Task Scheduling on Manycore Processors with Home Caches

被引:0
|
作者
Muddukrishna, Ananya [1 ]
Podobas, Artur [1 ]
Brorsson, Mats [1 ]
Vlassov, Vladimir [1 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Modern manycore processors feature a highly scalable and software-configurable cache hierarchy. For performance, manycore programmers will not only have to efficiently utilize the large number of cores but also understand and configure the cache hierarchy to suit the application. Relief from this manycore programming nightmare can be provided by task-based programming models where programmers parallelize using tasks and an architecture-specific runtime system maps tasks to cores and in addition configures the cache hierarchy. In this paper, we focus on the cache hierarchy of the Tilera TILEPro64 processor which features a software-configurable coherence waypoint called the home cache. We first show the runtime system performance bottleneck of scheduling tasks oblivious to the nature of home caches. We then demonstrate a technique in which the runtime system controls the assignment of home caches to memory blocks and schedules tasks to minimize home cache access penalties. Test results of our technique have shown a significant execution time performance improvement on selected benchmarks leading to the conclusion that by taking processor architecture features into account, task-based programming models can indeed provide continued performance and allow programmers to smoothly transit from the multicore to manycore era.
引用
收藏
页码:357 / 367
页数:11
相关论文
共 50 条
  • [21] OPTIMAL TASK-SCHEDULING ON DISTRIBUTED PARALLEL PROCESSORS
    CHANG, CS
    NELSON, R
    YAO, DD
    [J]. PERFORMANCE EVALUATION, 1994, 20 (1-3) : 207 - 221
  • [22] Scheduling Task to Heterogeneous Processors by Modified ACO Algorithm
    Premkumar, M.
    Babu, V. Srikanth
    Somwya, R.
    [J]. SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 565 - 576
  • [23] Efficient task scheduling with duplication for bounded number of processors
    Pasham, S
    Lin, WM
    [J]. 11th International Conference on Parallel and Distributed Systems, Vol I, Proceedings, 2005, : 543 - 549
  • [24] Dynamic on-line task scheduling on parallel processors
    Xia, CH
    Michailidis, G
    Bambos, N
    [J]. PERFORMANCE EVALUATION, 2001, 46 (2-3) : 219 - 233
  • [25] Task Scheduling for Many-Cores with S-NUCA Caches
    Pathania, Anuj
    Henkel, Joerg
    [J]. PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 557 - 562
  • [26] Scheduling parallel tasks with energy and time constraints on. multiple manycore processors in a cloud computing environment
    Li, Keqin
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 82 : 591 - 605
  • [27] A profiling based task scheduling approach for multicore network processors
    Tang, Feilong
    You, Ilsun
    Tang, Can
    Yu, Shui
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (04): : 855 - 869
  • [28] Task scheduling algorithm for interconnection constrained network of heterogeneous processors
    Ilavarasan, E.
    Thambidurai, P.
    Punithavathi, N.
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2004, 3356 : 31 - 39
  • [29] Task scheduling by limited duplication on a bounded set of heterogeneous processors
    Yin, Fei
    Du, Xiaoli
    Jiang, Changjun
    Deng, Rong
    [J]. DCABES 2007 PROCEEDINGS, VOLS I AND II, 2007, : 454 - 458
  • [30] A cluster-based strategy for scheduling task on heterogeneous processors
    Boeres, C
    Viterbo, J
    Rebello, VEF
    [J]. 16TH SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2004, : 214 - 221