Multicore job scheduling in the Worldwide LHC Computing Grid

被引:1
|
作者
Forti, A. [1 ]
Perez-Calero Yzquierdo, A. [2 ,3 ]
Hartmann, T. [4 ]
Alef, M. [4 ]
Lahiff, A. [5 ]
Templon, J. [6 ]
Dal Pra, S. [7 ]
Gila, M. [8 ]
Skipsey, S. [9 ]
Acosta-Silva, C. [2 ,10 ]
Filipcic, A. [11 ]
Walker, R. [12 ]
Walker, C. J. [13 ]
Traynor, D.
Gadrat, S. [14 ]
机构
[1] Univ Manchester, Sch Phys & Astron, Oxford Rd, Manchester M13 9PL, Lancs, England
[2] Univ Autonoma Barcelona, PIC, E-08193 Barcelona, Spain
[3] CIEMAT, Ctr Invest Energet Medioamb & Tecnol, E-28040 Madrid, Spain
[4] Karlsruhe Inst Technol, Steinbuch Ctr Comp, D-76021 Karlsruhe, Germany
[5] Rutherford Appleton Lab, Didcot OX11 0QX, Oxon, England
[6] Natl Inst Subatom Phys, NL-1098 XG Amsterdam, Netherlands
[7] INFN CNAF, I-40127 Bologna, Italy
[8] ETH Zentrum, RZ, Swiss Ctr Sci Comp, CH-8092 Zurich, Switzerland
[9] Univ Glasgow, Sch Phys & Astron, Glasgow G12 8QQ, Lanark, Scotland
[10] Univ Autonoma Barcelona, IFAE, E-08193 Barcelona, Spain
[11] Jozef Stefan Inst, Ljubljana 1000, Slovenia
[12] Univ Munich, Fak Phys, D-80799 Munich, Germany
[13] Queen Mary Univ London, Sch Phys & Astron, London E1 4NS, England
[14] Ctr Calcul IN2P3, F-69622 Lyon, France
关键词
D O I
10.1088/1742-6596/664/6/062016
中图分类号
O57 [原子核物理学、高能物理学];
学科分类号
070202 ;
摘要
After the successful first run of the LHC, data taking is scheduled to restart in Summer 2015 with experimental conditions leading to increased data volumes and event complexity. In order to process the data generated in such scenario and exploit the multicore architectures of current CPUs, the LHC experiments have developed parallelized software for data reconstruction and simulation. However, a good fraction of their computing effort is still expected to be executed as single-core tasks. Therefore, jobs with diverse resources requirements will be distributed across the Worldwide LHC Computing Grid (WLCG), making workload scheduling a complex problem in itself. In response to this challenge, the WLCG Multicore Deployment Task Force has been created in order to coordinate the joint effort from experiments and WLCG sites. The main objective is to ensure the convergence of approaches from the different LHC Virtual Organizations (VOs) to make the best use of the shared resources in order to satisfy their new computing needs, minimizing any inefficiency originated from the scheduling mechanisms, and without imposing unnecessary complexities in the way sites manage their resources. This paper describes the activities and progress of the Task Force related to the aforementioned topics, including experiences from key sites on how to best use different batch system technologies, the evolution of workload submission tools by the experiments and the knowledge gained from scale tests of the different proposed job submission strategies.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] The LHC computing grid project at CERN
    Lamanna, M
    [J]. NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2004, 534 (1-2): : 1 - 6
  • [42] Job Scheduling in a Grid Cluster
    Skenteridou, Kyriaki
    Karatza, Helen D.
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (CITS), 2015,
  • [43] Job scheduling for a computing system
    Komandrovskii, VG
    [J]. AUTOMATION AND REMOTE CONTROL, 2005, 66 (12) : 1929 - 1936
  • [44] Job Scheduling for a Computing System
    V. G. Komandrovskii
    [J]. Automation and Remote Control, 2005, 66 : 1929 - 1936
  • [45] An enhanced meta-scheduling system for grid computing that considers the job type and priority
    Al-Khateeb, Asef
    Rashid, Nur'Aini Abdul
    Abdullah, Rosni
    [J]. COMPUTING, 2012, 94 (05) : 389 - 410
  • [46] Strategic Oscillation for Exploitation and Exploration of ACS Algorithm for Job Scheduling in Static Grid Computing
    Alobaedy, Mustafa Muwafak
    Ku-Mahamud, Ku Ruhana
    [J]. 2015 SECOND INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGY AND INFORMATION MANAGEMENT (ICCTIM), 2015, : 87 - 92
  • [47] A novel algorithm for fault tolerant job Scheduling and load balancing in grid computing environment
    Naik, K. Jairam
    Jagan, A.
    Narayana, N. Satya
    [J]. 2015 INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT), 2015, : 1113 - 1118
  • [48] A novel multi-agent reinforcement learning approach for job scheduling in Grid computing
    Wu, Jun
    Xu, Xin
    Zhang, Pengcheng
    Liu, Chunming
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2011, 27 (05): : 430 - 439
  • [49] An analysis of MIPS group based job scheduling algorithm with other algorithms in grid computing
    Gomathi, S.
    Manimegalai, D.
    [J]. International Journal of Computer Science Issues, 2011, 8 (6 6-3): : 335 - 340
  • [50] An enhanced meta-scheduling system for grid computing that considers the job type and priority
    Asef Al-Khateeb
    Nur’Aini Abdul Rashid
    Rosni Abdullah
    [J]. Computing, 2012, 94 : 389 - 410