Data-Driven Locality-Aware Batch Scheduling

被引:0
|
作者
Gonthier, Maxime [1 ,2 ,3 ]
Larsson, Elisabeth [4 ]
Marchal, Loris [5 ]
Nettelblad, Carl [4 ]
Thibault, Samuel [6 ]
机构
[1] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
[2] ENS Lyon, Lyon, France
[3] INRIA, Le Chesnay Rocquencourt, France
[4] Uppsala Univ, Dept Informat Technol, Uppsala, Sweden
[5] ENS Lyon, CNRS, Lyon, France
[6] Univ Bordeaux, LaBRI, Talence, France
关键词
Batch scheduling; Job input sharing; Data aware; Job scheduling; High Performance Data Analytics; SLURM;
D O I
10.1109/IPDPSW63119.2024.00058
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Clusters employ workload schedulers such as the Sturm Workload Manager to allocate computing jobs onto nodes. These schedulers usually aim at a good trade-off between increasing resource utilization and user satisfaction (decreasing job waiting time). However, these schedulers are typically unaware of jobs sharing large input files, which may happen in data intensive scenarios. The same input files may end up being loaded several times, leading to a waste of resources. We study how to design a data-aware job scheduler that is able to keep large input files on the computing nodes, without impacting other memory needs, and can benefit from previously-loaded tiles to decrease data transfers in order to reduce the waiting times ofjobs. We present three schedulers capable of distributing the load between the computing nodes as well as re-using input files already loaded in the memory of some node as much as possible. We perform simulations with single node jobs using traces of real HPC-cluster usage, to compare them to classical job schedulers. The results show that keeping data in local memory between successive jobs and using data -locality information to schedule jobs improves performance compared to a widely -used scheduler (FCFS, with and without backfilling): a reduction in job waiting time (a 7.5% improvement in stretch), and a decrease in the amount of data transfers (7%).
引用
收藏
页码:202 / 211
页数:10
相关论文
共 50 条
  • [41] Data-driven Spatial Locality
    Miucin, Svetozar
    Fedorova, Alexandra
    [J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS (MEMSYS 2018), 2018, : 243 - 253
  • [42] LAPT: A locality-aware page table for thread and data mapping
    Cruz, Eduardo H. M.
    Diener, Matthias
    Alves, Marco A. Z.
    Pilla, Laercio L.
    Navaux, Philippe O. A.
    [J]. PARALLEL COMPUTING, 2016, 54 : 59 - 71
  • [43] Zeus: Locality-aware Distributed Transactions
    Katsarakis, Antonios
    Ma, Yijun
    Tan, Zhaowei
    Bainbridge, Andrew
    Balkwill, Matthew
    Dragojevic, Aleksandar
    Grot, Boris
    Radunovic, Bozidar
    Zhang, Yongguang
    [J]. PROCEEDINGS OF THE SIXTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS (EUROSYS '21), 2021, : 145 - 161
  • [44] Extending smart containers for data locality-aware skeleton programming
    Ernstsson, August
    Kessler, Christoph
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (05):
  • [45] Locality-Aware Data Replication in the Last-Level Cache
    Kurian, George
    Devadas, Srinivas
    Khan, Omer
    [J]. 2014 20TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA-20), 2014, : 1 - 12
  • [46] Locality-aware policies to improve job scheduling on 3D tori
    Pascual, Jose A.
    Miguel-Alonso, Jose
    Lozano, Jose A.
    [J]. JOURNAL OF SUPERCOMPUTING, 2015, 71 (03): : 966 - 994
  • [47] Locality-aware policies to improve job scheduling on 3D tori
    Jose A. Pascual
    Jose Miguel-Alonso
    Jose A. Lozano
    [J]. The Journal of Supercomputing, 2015, 71 : 966 - 994
  • [48] An Locality-Aware Scheduling Based on a Novel Scheduling Model to Improve System Throughput of MapReduce Cluster
    Zhao, Hui
    Yang, Shuqiang
    Chen, Zhikun
    Yin, Hong
    Jin, Songchang
    [J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 111 - 115
  • [49] BOLAS: Bipartite-graph Oriented Locality-Aware Scheduling for MapReduce Tasks
    Xue, Ruini
    Gao, Shengli
    Ao, Lixiang
    Guan, Zhongyang
    [J]. 2015 14TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC), 2015, : 37 - 45
  • [50] Locality-Aware Laplacian Mesh Smoothing
    Aupy, Guillaume
    Park, JeongHyung
    Raghavan, Padma
    [J]. PROCEEDINGS 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - ICPP 2016, 2016, : 588 - 597