Data-Driven Job Dispatching in HPC Systems

被引:12
|
作者
Galleguillos, Cristian [1 ,2 ]
Sirbu, Alina [3 ]
Kiziltan, Zeynep [1 ]
Babaoglu, Ozalp [1 ]
Borghesi, Andrea [1 ]
Bridi, Thomas [1 ]
机构
[1] Univ Bologna, Dept Comp Sci & Engn, Bologna, Italy
[2] Pontificia Univ Catolica Valparaiso, Escuela Ingn Informat, Valparaiso, Chile
[3] Univ Pisa, Dept Comp Sci, Pisa, Italy
关键词
D O I
10.1007/978-3-319-72926-8_37
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching strategies become critical for keeping system utilization high while keeping waiting times low for jobs competing for HPC system resources. In this paper, we take a data-driven approach and investigate whether better dispatching decisions can be made by transforming the log data produced by an HPC system into useful knowledge about its workload. In particular, we focus on job duration, develop a data-driven approach to job duration prediction, and analyze the effect of different prediction approaches in making dispatching decisions using a real workload dataset collected from Eurora, a hybrid HPC system. Experiments on various dispatching methods show promising results.
引用
收藏
页码:449 / 461
页数:13
相关论文
共 50 条
  • [41] Data-Driven Abstractions for Verification of Linear Systems
    Coppola, Rudi
    Peruffo, Andrea
    Mazo Jr, Manuel
    IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 2737 - 2742
  • [42] A Review of Data-Driven Discovery for Dynamic Systems
    North, Joshua S.
    Wikle, Christopher K.
    Schliep, Erin M.
    INTERNATIONAL STATISTICAL REVIEW, 2023, 91 (03) : 464 - 492
  • [43] DATA-DRIVEN BALANCING OF LINEAR DYNAMICAL SYSTEMS
    Gosea, Ion Victor
    Gugercin, Serkan
    Beattie, Christopher
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2022, 44 (01): : A554 - A582
  • [44] Editorial: Collaborative Computing for Data-Driven Systems
    Wang, Xinheng
    Iqbal, Muddesar
    Gao, Honghao
    Huang, Kaizhu
    Tchernykh, Andrei
    MOBILE NETWORKS & APPLICATIONS, 2020, 25 (04): : 1348 - 1350
  • [45] Editorial: Collaborative Computing for Data-Driven Systems
    Xinheng Wang
    Muddesar Iqbal
    Honghao Gao
    Kaizhu Huang
    Andrei Tchernykh
    Mobile Networks and Applications, 2020, 25 : 1348 - 1350
  • [46] On the Robustness of Data-Driven Controllers for Linear Systems
    Anguluri, Rajasekhar
    Al Makdah, Abed AlRahman
    Katewa, Vaibhav
    Pasqualetti, Fabio
    LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 404 - 412
  • [47] Data-driven testing methodology for RFID systems
    Lu, An
    Fang, Wenbin
    Xu, Chang
    Cheung, Shing-Chi
    Liu, Yu
    FRONTIERS OF COMPUTER SCIENCE IN CHINA, 2010, 4 (03): : 354 - 364
  • [48] Data-driven identification for nonlinear dynamic systems
    Lyshevski, Sergey Edward
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2024, 44 (02) : 166 - 171
  • [49] Corpus building for data-driven TTS systems
    Zhu, WB
    Zhang, W
    Shi, Q
    Chen, FX
    Li, HP
    Ma, XJ
    Shen, LQ
    PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 199 - 202
  • [50] Data-Driven Continuous Evolution of Smart Systems
    Bosch, Jan
    Olsson, Helena Holmstrom
    PROCEEDINGS OF 2016 IEEE/ACM 11TH INTERNATIONAL SYMPOSIUM ON SOFTWARE ENGINEERING FOR ADAPTIVE AND SELF-MANAGING SYSTEMS (SEAMS), 2016, : 28 - 34