Multicore job scheduling in the Worldwide LHC Computing Grid

被引：1

作者：

Forti, A. ^{[1
]}

Perez-Calero Yzquierdo, A. ^{[2
,3
]}

Hartmann, T. ^{[4
]}

Alef, M. ^{[4
]}

Lahiff, A. ^{[5
]}

Templon, J. ^{[6
]}

Dal Pra, S. ^{[7
]}

Gila, M. ^{[8
]}

Skipsey, S. ^{[9
]}

Acosta-Silva, C. ^{[2
,10
]}

Filipcic, A. ^{[11
]}

Walker, R. ^{[12
]}

Walker, C. J. ^{[13
]}

Traynor, D.

Gadrat, S. ^{[14
]}

机构：

[1] Univ Manchester, Sch Phys & Astron, Oxford Rd, Manchester M13 9PL, Lancs, England

[2] Univ Autonoma Barcelona, PIC, E-08193 Barcelona, Spain

[3] CIEMAT, Ctr Invest Energet Medioamb & Tecnol, E-28040 Madrid, Spain

[4] Karlsruhe Inst Technol, Steinbuch Ctr Comp, D-76021 Karlsruhe, Germany

[5] Rutherford Appleton Lab, Didcot OX11 0QX, Oxon, England

[6] Natl Inst Subatom Phys, NL-1098 XG Amsterdam, Netherlands

[7] INFN CNAF, I-40127 Bologna, Italy

[8] ETH Zentrum, RZ, Swiss Ctr Sci Comp, CH-8092 Zurich, Switzerland

[9] Univ Glasgow, Sch Phys & Astron, Glasgow G12 8QQ, Lanark, Scotland

[10] Univ Autonoma Barcelona, IFAE, E-08193 Barcelona, Spain

[11] Jozef Stefan Inst, Ljubljana 1000, Slovenia

[12] Univ Munich, Fak Phys, D-80799 Munich, Germany

[13] Queen Mary Univ London, Sch Phys & Astron, London E1 4NS, England

[14] Ctr Calcul IN2P3, F-69622 Lyon, France

来源：

21ST INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2015), PARTS 1-9 | 2015年 / 664卷

关键词：

D O I：

10.1088/1742-6596/664/6/062016

中图分类号：

O57 [原子核物理学、高能物理学];

学科分类号：

070202 ;

摘要：

After the successful first run of the LHC, data taking is scheduled to restart in Summer 2015 with experimental conditions leading to increased data volumes and event complexity. In order to process the data generated in such scenario and exploit the multicore architectures of current CPUs, the LHC experiments have developed parallelized software for data reconstruction and simulation. However, a good fraction of their computing effort is still expected to be executed as single-core tasks. Therefore, jobs with diverse resources requirements will be distributed across the Worldwide LHC Computing Grid (WLCG), making workload scheduling a complex problem in itself. In response to this challenge, the WLCG Multicore Deployment Task Force has been created in order to coordinate the joint effort from experiments and WLCG sites. The main objective is to ensure the convergence of approaches from the different LHC Virtual Organizations (VOs) to make the best use of the shared resources in order to satisfy their new computing needs, minimizing any inefficiency originated from the scheduling mechanisms, and without imposing unnecessary complexities in the way sites manage their resources. This paper describes the activities and progress of the Task Force related to the aforementioned topics, including experiences from key sites on how to best use different batch system technologies, the evolution of workload submission tools by the experiments and the knowledge gained from scale tests of the different proposed job submission strategies.

引用

页数：8

共 50 条

[41] The LHC computing grid project at CERN
Lamanna, M
[J]. NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2004, 534 (1-2): : 1 - 6
[42] Job Scheduling in a Grid Cluster
Skenteridou, Kyriaki
Karatza, Helen D.
[J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (CITS), 2015,
[43] Job scheduling for a computing system
Komandrovskii, VG
[J]. AUTOMATION AND REMOTE CONTROL, 2005, 66 (12) : 1929 - 1936
[44] Job Scheduling for a Computing System
V. G. Komandrovskii
[J]. Automation and Remote Control, 2005, 66 : 1929 - 1936
[45] An enhanced meta-scheduling system for grid computing that considers the job type and priority
Al-Khateeb, Asef
Rashid, Nur'Aini Abdul
Abdullah, Rosni
[J]. COMPUTING, 2012, 94 (05) : 389 - 410
[46] Strategic Oscillation for Exploitation and Exploration of ACS Algorithm for Job Scheduling in Static Grid Computing
Alobaedy, Mustafa Muwafak
Ku-Mahamud, Ku Ruhana
[J]. 2015 SECOND INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGY AND INFORMATION MANAGEMENT (ICCTIM), 2015, : 87 - 92
[47] A novel algorithm for fault tolerant job Scheduling and load balancing in grid computing environment
Naik, K. Jairam
Jagan, A.
Narayana, N. Satya
[J]. 2015 INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT), 2015, : 1113 - 1118
[48] A novel multi-agent reinforcement learning approach for job scheduling in Grid computing
Wu, Jun
Xu, Xin
Zhang, Pengcheng
Liu, Chunming
[J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2011, 27 (05): : 430 - 439
[49] An analysis of MIPS group based job scheduling algorithm with other algorithms in grid computing
Gomathi, S.
Manimegalai, D.
[J]. International Journal of Computer Science Issues, 2011, 8 (6 6-3): : 335 - 340
[50] An enhanced meta-scheduling system for grid computing that considers the job type and priority
Asef Al-Khateeb
Nur’Aini Abdul Rashid
Rosni Abdullah
[J]. Computing, 2012, 94 : 389 - 410

← 1 2 3 4 5 →