Data-Driven Locality-Aware Batch Scheduling

被引:0
|
作者
Gonthier, Maxime [1 ,2 ,3 ]
Larsson, Elisabeth [4 ]
Marchal, Loris [5 ]
Nettelblad, Carl [4 ]
Thibault, Samuel [6 ]
机构
[1] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
[2] ENS Lyon, Lyon, France
[3] INRIA, Le Chesnay Rocquencourt, France
[4] Uppsala Univ, Dept Informat Technol, Uppsala, Sweden
[5] ENS Lyon, CNRS, Lyon, France
[6] Univ Bordeaux, LaBRI, Talence, France
关键词
Batch scheduling; Job input sharing; Data aware; Job scheduling; High Performance Data Analytics; SLURM;
D O I
10.1109/IPDPSW63119.2024.00058
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Clusters employ workload schedulers such as the Sturm Workload Manager to allocate computing jobs onto nodes. These schedulers usually aim at a good trade-off between increasing resource utilization and user satisfaction (decreasing job waiting time). However, these schedulers are typically unaware of jobs sharing large input files, which may happen in data intensive scenarios. The same input files may end up being loaded several times, leading to a waste of resources. We study how to design a data-aware job scheduler that is able to keep large input files on the computing nodes, without impacting other memory needs, and can benefit from previously-loaded tiles to decrease data transfers in order to reduce the waiting times ofjobs. We present three schedulers capable of distributing the load between the computing nodes as well as re-using input files already loaded in the memory of some node as much as possible. We perform simulations with single node jobs using traces of real HPC-cluster usage, to compare them to classical job schedulers. The results show that keeping data in local memory between successive jobs and using data -locality information to schedule jobs improves performance compared to a widely -used scheduler (FCFS, with and without backfilling): a reduction in job waiting time (a 7.5% improvement in stretch), and a decrease in the amount of data transfers (7%).
引用
收藏
页码:202 / 211
页数:10
相关论文
共 50 条
  • [1] Taming Big Data SVM with Locality-Aware Scheduling
    Ye, Mao
    Wang, Jun
    Yin, Jiangling
    Han, Dezhi
    [J]. 2016 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2016), 2016, : 37 - 44
  • [2] Locality-Aware Mapping and Scheduling for Multicores
    Ding, Wei
    Zhang, Yuanrui
    Kandemir, Mahmut
    Srinivas, Jithendra
    Yedlapalli, Praveen
    [J]. PROCEEDINGS OF THE 2013 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO), 2013, : 335 - 346
  • [3] Data-Driven Batch Scheduling
    Bent, John
    Denehy, Timothy E.
    Livny, Miron
    Arpaci-Dusseau, Andrea C.
    Arpaci-Dusseau, Remzi H.
    [J]. DADC 2009: SECOND INTERNATIONAL WORKSHOP ON DATA AWARE DISTRIBUTED COMPUTING, 2009, : 1 - 10
  • [4] Locality-aware predictive scheduling of network processors
    Wolf, T
    Franklin, MA
    [J]. ISPASS: 2001 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2001, : 152 - 159
  • [5] Locality-aware process scheduling for embedded MPSoCs
    Kandemir, M
    Chen, GL
    [J]. DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 870 - 875
  • [6] Locality-Aware Scheduling for Scalable Heterogeneous Environments
    Kamatar, Alok, V
    Friese, Ryan D.
    Gioiosa, Roberto
    [J]. PROCEEDINGS OF 2020 10TH IEEE/ACM INTERNATIONAL WORKSHOP ON RUNTIME AND OPERATING SYSTEMS FOR SUPERCOMPUTERS (ROSS 2020), 2020, : 50 - 58
  • [7] Locality-Aware Scheduling for Containers in Cloud Computing
    Zhao, Dongfang
    Mohamed, Mohamed
    Ludwig, Heiko
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (02) : 635 - 646
  • [8] Locality-Aware CTA Scheduling for Gaming Applications
    Ukarande, Aditya
    Patidar, Suryakant
    Rangan, Ram
    [J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2022, 19 (01)
  • [9] Locality-Aware Scheduling for Containers in Cloud Computing
    Babu, G. Charles
    Hanuman, A. Sai
    Kiran, J. Sasi
    Babu, B. Sankara
    [J]. INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES, ICICCT 2019, 2020, 89 : 177 - 185
  • [10] Locality-Aware Dynamic Task Graph Scheduling
    Maglalang, Jordyn
    Krishnamoorthy, Sriram
    Agrawal, Kunal
    [J]. 2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2017, : 70 - 80