Planning for distributed workflows: constraint-based coscheduling of computational jobs and data placement in distributed environments

被引:0
|
作者
Makatun, Dzmitry [1 ,3 ]
Lauret, Jerome [2 ]
Rudova, Hana [4 ]
Sumbera, Michal [3 ]
机构
[1] Czech Tech Univ, Fac Nucl Phys & Phys Engn, CR-16635 Prague, Czech Republic
[2] Brookhaven Natl Lab, STAR, Upton, NY 11973 USA
[3] Acad Sci Czech Republ, Nucl Phys Inst, Prague, Czech Republic
[4] Masaryk Univ, CS-60177 Brno, Czech Republic
关键词
D O I
10.1088/1742-6596/608/1/012028A
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
When running data intensive applications on distributed computational resources long I/O overheads may be observed as access to remotely stored data is performed. Latencies and bandwidth can become the major limiting factor for the overall computation performance and can reduce the CPU/WallTime ratio to excessive TO wait. Reusing the knowledge of our previous research, we propose a constraint programming based planner that schedules computational jobs and data placements (transfers) in a distributed environment in order to optimize resource utilization and reduce the overall processing completion time. The optimization is achieved by ensuring that none of the resources (network links, data storages and CPUs) are oversaturated at any moment of time and either (a) that the data is pre-placed at the site where the job runs or (b) that the jobs are scheduled where the data is already present. Such an approach eliminates the idle CPU cycles occurring when the job is waiting for the I/O from a remote site and would have wide application in the community. Our planner was evaluated and simulated based on data extracted from log files of batch and data management systems of the STAR experiment. The results of evaluation and estimation of performance improvements are discussed in this paper.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A distributed constraint-based scheduler
    Lamma, E
    Mello, P
    Milano, M
    ARTIFICIAL INTELLIGENCE IN ENGINEERING, 1997, 11 (02): : 91 - 105
  • [2] Constraint-Based Distributed Planning as an Enabler for Self-Management
    Ghamri-Doudane, Samir
    Fabre, Eric
    Ciavaglia, Laurent
    BELL LABS TECHNICAL JOURNAL, 2010, 15 (03) : 193 - 198
  • [3] Distributed constraint-based local search
    Michel, Laurent
    See, Andrew
    Van Hentenryck, Pascal
    PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING - CP 2006, 2006, 4204 : 344 - 358
  • [4] Distributed constraint-based railway simulation
    Schlenker, H
    APPLICATIONS OF DECLARATIVE PROGRAMMING AND KNOWLEDGE MANAGEMENT, 2005, 3392 : 215 - 226
  • [5] A Constraint-Based Approach to Automatic Data Partitioning for Distributed Memory Execution
    Lee, Wonchan
    Papadakis, Manolis
    Slaughter, Elliott
    Aiken, Alex
    PROCEEDINGS OF SC19: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2019,
  • [6] Constraint-Based Oracles for Timed Distributed Systems
    Benharrat, Nassim
    Gaston, Christophe
    Hierons, Robert M.
    Lapitre, Arnault
    Le Gall, Pascale
    TESTING SOFTWARE AND SYSTEMS (ICTSS 2017), 2017, 10533 : 276 - 292
  • [7] A constraint-based approach for distributed decision support
    Erschler, J
    Huguet, MJ
    COOP '96 - SECOND INTERNATIONAL WORKSHOP ON THE DESIGN OF COOPERATIVE SYSTEMS, 1996, : 587 - 603
  • [8] Constraint-based protocols for distributed problem solving
    Borghoff, UM
    Pareschi, R
    Arcelli, F
    Formato, F
    SCIENCE OF COMPUTER PROGRAMMING, 1998, 30 (1-2) : 201 - 225
  • [9] Constraint-based deployment of distributed components in a dynamic network
    Hoareau, D
    Mahéo, Y
    ARCHITECTURE OF COMPUTING SYSTEMS - ARCS 2006, PROCEEDINGS, 2006, 3894 : 450 - 464
  • [10] Data placement for scientific applications in distributed environments
    Chervenak, Ann
    Deelman, Ewa
    Livny, Miron
    Su, Mei-Hui
    Schuler, Rob
    Bharathi, Shishir
    Mehta, Gaurang
    Vahi, Karan
    2007 8TH IEEE/ACM INTERNATIONAL CONFERENCE ON GRID COMPUTING, 2007, : 146 - +