Impact of memory contention on dynamic scheduling on NUMA multiprocessors

被引:9
|
作者
Durand, D
Montaut, T
Kervella, L
Jalby, W
机构
[1] UNIV VERSAILLES,LAB MASI,F-78000 VERSAILLES,FRANCE
[2] INST RECH INFORMAT & SYST ALEATOIRES,F-35042 RENNES,FRANCE
基金
美国国家科学基金会;
关键词
dynamic scheduling; load balancing; memory performance; NUMA multiprocessors; self-scheduling;
D O I
10.1109/71.544359
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Self-scheduling is a method for task scheduling in parallel programs, in which each processor acquires a new block of tasks for execution whenever it becomes idle. To get the best performance, the block size must be chosen to balance the scheduling overhead against the load imbalance. To determine the best block size, a better understanding of the role of load imbalance in self-scheduling performance is needed. In this paper we study the effect of memory contention on task duration distributions and, hence, load balancing in self-scheduling on a Nonuniform Memory Access (NUMA) machine. Experimental studies on a BBN TC2000 are used to reveal the strengths and weaknesses of analytical performance models to predict running time and optimal block size. The models are shown to be very accurate for small block sizes. However, the models fail when the block size is large due to a previously unrecognized source of load imbalance. We extend the analytical models to address this failure. The implications for the construction of compilers and runtime systems are discussed.
引用
收藏
页码:1201 / 1214
页数:14
相关论文
共 50 条
  • [21] Home-cluster based processor scheduling with page placement for NUMA multiprocessors
    Koita, T
    Katayama, T
    Saisho, K
    Fukuda, A
    INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, PROCEEDINGS, 1999, : 539 - 545
  • [22] Kernel mechanisms with dynamic task-aware scheduling to reduce resource contention in NUMA multi-core systems
    Chiang, Mei-Ling
    Yang, Chieh-Jui
    Tu, Shu-Wei
    JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 121 : 72 - 87
  • [23] Design and analysis of static memory management policies for CC-NUMA multiprocessors
    Iyer, R
    Wang, HJ
    Bhuyan, LN
    JOURNAL OF SYSTEMS ARCHITECTURE, 2002, 48 (1-3) : 59 - 80
  • [24] Scheduling parallel tasks onto NUMA multiprocessors with inter-processor communication overhead
    Lai, GJ
    Fang, JF
    Sung, PS
    Pean, DL
    PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, PROCEEDINGS, 2003, 2745 : 65 - 75
  • [25] Accounting for memory bank contention and delay in high-bandwidth multiprocessors
    Blelloch, GE
    Gibbons, PB
    Matias, Y
    Zagha, M
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1997, 8 (09) : 943 - 958
  • [26] Fair scheduling of dynamic task systems on multiprocessors
    Srinivasan, A
    Anderson, JH
    JOURNAL OF SYSTEMS AND SOFTWARE, 2005, 77 (01) : 67 - 80
  • [27] PAGE PLACEMENT POLICIES FOR NUMA MULTIPROCESSORS
    LAROWE, RP
    ELLIS, CS
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1991, 11 (02) : 112 - 129
  • [28] Scheduling Support for Transactional Memory Contention Management
    Maldonado, Walther
    Marlier, Patrick
    Felber, Pascal
    Suissa, Adi
    Hendler, Danny
    Fedorova, Alexandra
    Lawall, Julia L.
    Muller, Gilles
    PPOPP 2010: PROCEEDINGS OF THE 2010 ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2010, : 79 - 89
  • [29] Scheduling Support for Transactional Memory Contention Management
    Maldonado, Walther
    Marlier, Patrick
    Felber, Pascal
    Suissa, Adi
    Hendler, Danny
    Fedorova, Alexandra
    Lawall, Julia L.
    Muller, Gilles
    ACM SIGPLAN NOTICES, 2010, 45 (05) : 79 - 89
  • [30] RESOURCE CONTENTION IN SHARED-MEMORY MULTIPROCESSORS - A PARAMETERIZED PERFORMANCE DEGRADATION MODEL
    NANDA, AK
    SHING, HD
    TZEN, TH
    NI, LM
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1991, 12 (04) : 313 - 328