Next Generation Workload Management System For Big Data on Heterogeneous Distributed Computing

被引:17
|
作者
Klimentov, A. [1 ]
Buncic, P. [2 ]
De, K. [3 ,4 ]
Jha, S.
Maeno, T. [1 ]
Mount, R. [5 ]
Nilsson, P. [1 ]
Oleynik, D. [3 ,4 ]
Panitkin, S. [1 ]
Petrosyan, A. [3 ,4 ]
Porter, R. J. [6 ]
Read, K. F. [7 ]
Vaniachine, A. [8 ]
Wells, J. C. [7 ]
Wenaus, T. [1 ]
机构
[1] Brookhaven Natl Lab, Upton, NY 11973 USA
[2] CERN, Geneva, Switzerland
[3] Univ Texas Arlington, Arlington, TX 76019 USA
[4] Rutgers State Univ, Piscataway, NJ USA
[5] SLAC Natl Accelerator Lab, Menlo Pk, CA USA
[6] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
[7] Oak Ridge Natl Lab, Oak Ridge, TN USA
[8] Argonne Natl Lab Lemont, Argonne, IL USA
关键词
D O I
10.1088/1742-6596/608/1/012040
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Large Hadron Collider (LHC), operating at the international CERN Laboratory in Geneva, Switzerland, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe, and were recently credited for the discovery of a Higgs boson. ATLAS and ALICE are the largest collaborations ever assembled in the sciences and are at the forefront of research at the LHC. To address an unprecedented multi-petabyte data processing challenge, both experiments rely on a heterogeneous distributed computational infrastructure. The ATLAS experiment uses PanDA (Production and Data Analysis) Workload Management System (WMS) for managing the workflow for all data processing on hundreds of data centers. Through PanDA, ATLAS physicists see a single computing facility that enables rapid scientific breakthroughs for the experiment, even though the data centers are physically scattered all over the world. The scale is demonstrated by the following numbers: PanDA manages O(10(2)) sites, O(10(5)) cores, O(10(8)) jobs per year, O(10(3)) users, and ATLAS data volume is O(10(17)) bytes. In 2013 we started an ambitious program to expand PanDA to all available computing resources, including opportunistic use of commercial and academic clouds and Leadership Computing Facilities (LCF). The project titled 'Next Generation Workload Management and Analysis System for Big Data' (BigPanDA) is funded by DOE ASCR and HEP. Extending PanDA to clouds and LCF presents new challenges in managing heterogeneity and supporting workflow. The BigPanDA project is underway to setup and tailor PanDA at the Oak Ridge Leadership Computing Facility (OLCF) and at the National Research Center "Kurchatov Institute" together with ALICE distributed computing and ORNL computing professionals. Our approach to integration of HPC platforms at the OLCF and elsewhere is to reuse, as much as possible, existing components of the PanDA system. We will present our current accomplishments with running the PanDA WMS at OLCF and other supercomputers and demonstrate our ability to use PanDA as a portal independent of the computing facilities infrastructure for High Energy and Nuclear Physics as well as other data-intensive science applications.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] MANAGEMENT SYSTEM PROTOTYPE FOR INTELLIGENT MOBILE CLOUD COMPUTING FOR BIG DATA
    Hussien, Nur Syahela
    Sulaiman, Sarina
    Shamsuddin, Siti Mariyam
    JURNAL TEKNOLOGI, 2016, 78 (12-2): : 19 - 28
  • [42] The Big Data Analysis of the Next Generation Video Surveillance System for Public Security
    Xu, Zheng
    Yan, Zhiguo
    Mei, Lin
    Zhang, Hui
    E-LIFE: WEB-ENABLED CONVERGENCE OF COMMERCE, WORK, AND SOCIAL LIFE, WEB 2015, 2016, 258 : 171 - 175
  • [43] E-commerce big data computing platform system based on distributed computing logistics information
    Junmin Hu
    Cluster Computing, 2019, 22 : 13693 - 13702
  • [44] E-commerce big data computing platform system based on distributed computing logistics information
    Hu, Junmin
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 6): : 13693 - 13702
  • [45] Rucio, the next-generation Data Management system in ATLAS
    Serfon, C.
    Barisits, M.
    Beermann, T.
    Garonne, V.
    Goossens, L.
    Lassnig, M.
    Nairz, A.
    Vigne, R.
    NUCLEAR AND PARTICLE PHYSICS PROCEEDINGS, 2016, 273 : 969 - 975
  • [46] A holistic approach for high-level programming of next-generation data-intensive applications targeting distributed heterogeneous computing environment
    Carlini, Emanuele
    Dazzi, Patrizio
    Mordacchini, Matteo
    2ND INTERNATIONAL CONFERENCE ON CLOUD FORWARD: FROM DISTRIBUTED TO COMPLETE COMPUTING, 2016, 97 : 131 - 134
  • [47] Workload Management for Power Efficiency in Heterogeneous Data Centers
    Ruiu, Pietro
    Scionti, Alberto
    Nider, Joel
    Rapoport, Mike
    PROCEEDINGS OF 2016 10TH INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS (CISIS), 2016, : 23 - 30
  • [48] Telescoping Architectures: Evaluating Next-Generation Heterogeneous Computing
    Krommydas, Konstantinos
    Feng, Wu-Chun
    PROCEEDINGS OF 2016 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2016, : 162 - 171
  • [49] Heterogeneous Internet of Things Big Data Analysis System Based on Mobile Edge Computing
    Yang, Lin
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2024,