Adaptive parallel job scheduling with flexible coscheduling

被引:23
|
作者
Frachtenberg, E
Feitelson, DG
Petrini, F
Fernández, J
机构
[1] Los Alamos Natl Lab, Modeling Algorithms & Informat Grp CCS3, Comp & Computat Sci Div, Los Alamos, NM 87545 USA
[2] Hebrew Univ Jerusalem, Sch Comp Sci & Engn, IL-91904 Jerusalem, Israel
[3] Univ Murcia, Fac Informat, Dept Ingn & Tecnol Comp, E-30071 Murcia, Spain
关键词
cluster computing; load balancing; job scheduling; gang scheduling; parallel architectures; flexible coscheduling;
D O I
10.1109/TPDS.2005.130
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many scientific and high-performance computing applications consist of multiple processes running on different processors that communicate frequently. Because of their synchronization needs, these applications can suffer severe performance penalties if their processes are not all coscheduled to run together. Two common approaches to coscheduling jobs are batch scheduling, wherein nodes are dedicated for the duration of the run, and gang scheduling, wherein time slicing is coordinated across processors. Both work well when jobs are load-balanced and make use of the entire parallel machine. However, these conditions are rarely met and most realistic workloads consequently suffer from both internal and external fragmentation, in which resources and processors are left idle because jobs cannot be packed with perfect efficiency. This situation leads to reduced utilization and suboptimal performance. Flexible CoScheduling (FCS) addresses this problem by monitoring each job's computation granularity and communication pattern and scheduling jobs based on their synchronization and load-balancing requirements. In particular, jobs that do not require stringent synchronization are identified, and are not coscheduled; instead, these processes are used to reduce fragmentation. FCS has been fully implemented on top of the STORM resource manager on a 256-processor Alpha cluster and compared to batch, gang, and implicit coscheduling algorithms. This paper describes in detail the implementation of FCS and its performance evaluation with a variety of workloads, including large-scale benchmarks, scientific applications, and dynamic workloads. The experimental results show that FCS saturates at higher loads than other algorithms (up to 54 percent higher in some cases), and displays lower response times and slowdown than the other algorithms in nearly all scenarios.
引用
收藏
页码:1066 / 1077
页数:12
相关论文
共 50 条
  • [1] Efficient algorithms for flexible job shop scheduling with parallel machines
    Kubiak, Wieslaw
    Feng, Yanling
    Li, Guo
    Sethi, Suresh P.
    Sriskandarajah, Chelliah
    [J]. NAVAL RESEARCH LOGISTICS, 2020, 67 (04) : 272 - 288
  • [2] Quantum algorithms for process parallel flexible job shop scheduling
    Denkena, Berend
    Schinkel, Fritz
    Pirnay, Jonathan
    Wilmsmeier, Soren
    [J]. CIRP JOURNAL OF MANUFACTURING SCIENCE AND TECHNOLOGY, 2021, 33 : 100 - 114
  • [3] Batch division and parallel scheduling optimization of flexible job shop
    Xu B.
    Fei X.
    Zhang X.
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2016, 22 (08): : 1953 - 1964
  • [4] An Extended Flexible Job Shop Scheduling Problem with Parallel Operations
    Lunardi, Willian Tessaro
    Voos, Holger
    [J]. APPLIED COMPUTING REVIEW, 2018, 18 (02): : 46 - 56
  • [5] Adaptive Representation for Flexible Job-Shop Scheduling and Rescheduling
    Unachak, Prakarn
    Goodman, Erik
    [J]. WORLD SUMMIT ON GENETIC AND EVOLUTIONARY COMPUTATION (GEC 09), 2009, : 511 - 516
  • [6] Adaptive multimeme algorithm for flexible job shop scheduling problem
    Zuo, Yi
    Gong, Maoguo
    Jiao, Licheng
    [J]. NATURAL COMPUTING, 2017, 16 (04) : 677 - 698
  • [7] A kind of adaptive negotiation mechanism for flexible job shop scheduling
    Lv, Cixing
    Sheng, Zhongqi
    Li, Sufen
    Yin, Chaowan
    Zhu, Yunlong
    [J]. COMPUTATIONAL INTELLIGENCE, PT 2, PROCEEDINGS, 2006, 4114 : 324 - 333
  • [8] A Dynamic Adaptive Firefly Algorithm for Flexible Job Shop Scheduling
    Devi, K. Gayathri
    Mishra, R. S.
    Madan, A. K.
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (01): : 429 - 448
  • [9] Adaptive multimeme algorithm for flexible job shop scheduling problem
    Yi Zuo
    Maoguo Gong
    Licheng Jiao
    [J]. Natural Computing, 2017, 16 : 677 - 698
  • [10] An Improved Adaptive Genetic Algorithm in Flexible Job Shop Scheduling
    Huang Ming
    Wang Lu-ming
    Liang Xu
    [J]. PROCEEDINGS OF 2016 2ND INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTERNET OF THINGS (CCIOT), 2016, : 177 - 184