Federated data storage and management infrastructure

被引:4
|
作者
Zarochentsev, A. [1 ]
Kiryanov, A. [2 ,3 ]
Klimentov, A. [3 ,4 ]
Krasnopevtsev, D. [3 ,5 ]
Hristov, P. [6 ]
机构
[1] St Petersburg State Univ, St Petersburg, Russia
[2] Petersburg Nucl Phys Inst, Gatchina, Leningrad Oblas, Russia
[3] Natl Res Ctr, Kurchatov Inst, Moscow, Russia
[4] Brookhaven Natl Lab, Upton, NY 11973 USA
[5] Natl Res Nucl Univ MEPhI, Moscow, Russia
[6] CERN, European Ctr Nucl Res, Geneva, Switzerland
关键词
D O I
10.1088/1742-6596/762/1/012016
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Large Hadron Collider (LHC), operating at the international CERN Laboratory in Geneva, Switzerland, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe. Computing models for the High Luminosity LHC era anticipate a growth of storage needs of at least orders of magnitude; it will require new approaches in data storage organization and data handling. In our project we address the fundamental problem of designing of architecture to integrate a distributed heterogeneous disk resources for LHC experiments and other data intensive science applications and to provide access to data from heterogeneous computing facilities. We have prototyped a federated storage for Russian T1 and T2 centers located in Moscow, St.-Petersburg and Gatchina, as well as Russian / CERN federation. We have conducted extensive tests of underlying network infrastructure and storage endpoints with synthetic performance measurement tools as well as with HENP-specific workloads, including the ones running on supercomputing platform, cloud computing and Grid for ALICE and ATLAS experiments. We will present our current accomplishments with running LHC data analysis remotely and locally to demonstrate our ability to efficiently use federated data storage experiment wide within National Academic facilities for High Energy and Nuclear Physics as well as for other data-intensive science applications, such as bio-infomatics.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Web Infrastructure for Data Management, Storage and Computation
    Brosset, Serge
    Dumont, Maxime
    Cevidanes, Lucia
    Soroushmehr, Reza
    Bianchi, Jonas
    Gurgel, Marcela
    Deleat-Besson, Romain
    Le, Celia
    Ruellas, Antonio
    Yatabe, Marilia
    Chaves Junior, Cauby
    Gomes, Liliane
    Goncalves, Joao
    Najarian, Kayvan
    Gryak, Jonathan
    Styner, Martin
    Paniagua, Beatriz
    Prieto, Juan Carlos
    MEDICAL IMAGING 2021: BIOMEDICAL APPLICATIONS IN MOLECULAR, STRUCTURAL, AND FUNCTIONAL IMAGING, 2021, 11600
  • [2] A Federated Infrastructure for European Data Spaces
    Otto, Boris
    COMMUNICATIONS OF THE ACM, 2022, 65 (04) : 44 - 45
  • [3] Energy Efficient Big Data Infrastructure Management in Geo-Federated Cloud Data Centers
    Subbiah, Sankari
    Varalakshmi, Perumal
    Prarthana, R.
    Devi, Renuka C.
    SECOND INTERNATIONAL SYMPOSIUM ON COMPUTER VISION AND THE INTERNET (VISIONNET'15), 2015, 58 : 151 - 157
  • [4] Efficient Attribute Management in a Federated Identity Management Infrastructure
    Berbecaru, Diana
    Lioy, Antonio
    2016 24TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP), 2016, : 590 - 595
  • [5] Towards a Federated Infrastructure for the Global Data Pipeline
    Hofman, Wout
    OPEN AND BIG DATA MANAGEMENT AND INNOVATION, I3E 2015, 2015, 9373 : 479 - 490
  • [6] Towards a Semantic Data Harmonization Federated Infrastructure
    Martinez-Costa, Catalina
    Abad-Navarro, Francisco
    PUBLIC HEALTH AND INFORMATICS, PROCEEDINGS OF MIE 2021, 2021, 281 : 38 - 42
  • [7] The Fermilab data storage infrastructure
    Bakken, J
    Berman, E
    Huang, CH
    Moibenko, A
    Petravick, D
    Zalokar, M
    20TH IEEE/11TH NASA GODDARD CONFERENCE ON MASS STORAGE AND TECHNOLOGIES (MSST 2003), PROCEEDINGS, 2003, : 101 - 104
  • [8] Data Management for Federated Biobanks
    Eder, Johann
    Dabringer, Claus
    Schicho, Michaela
    Stark, Konrad
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 184 - +
  • [9] Optimizing Federated Reinforcement Learning Algorithm for Data Management of Distributed Energy Storage Network
    Li, Yuan
    Li, Yuancheng
    RECENT ADVANCES IN ELECTRICAL & ELECTRONIC ENGINEERING, 2024,
  • [10] Data On-boarding in Federated Storage Clouds
    Vernik, Gil
    Shulman-Peleg, Alexandra
    Dippl, Sebastian
    Formisano, Ciro
    Jaeger, Michael C.
    Kolodner, Elliot K.
    Villari, Massimo
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 244 - 251