Data transfer for STAR grid jobs

被引:0
|
作者
Chakaberia, Irakli [1 ]
Lauret, Jerome [2 ]
Poat, Michael [2 ]
Porter, Jefferson [1 ]
机构
[1] Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
[2] Brookhaven Natl Lab, Upton, NY USA
关键词
D O I
10.1088/1742-6596/2438/1/012022
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Solenoidal Tracker at RHIC (STAR) is a multipurpose experiment at the Relativistic Heavy Ion Collider (RHIC) with the primary goal to study the formation and properties of the quark-gluon plasma. STAR is an international collaboration of member institutions and laboratories from around the world. Yearly data-taking period produces PBytes of raw data collected by the experiment. STAR primarily uses its dedicated facility at BNL to process this data, but has routinely leveraged distributed systems, both high throughput (HTC) and high performance (HPC) computing clusters, to significantly augment the processing capacity available to the experiment. The ability to automate the efficient transfer of large data sets on reliable, scalable, and secure infrastructure is critical for any large-scale distributed processing campaign. For more than a decade, STAR computing has relied upon GridFTP with its x509-based authentication to build such data transfer systems and integrate them into its larger production workflow. The end of support by the community for both GridFTP and the x509 standard requires STAR to investigate other approaches to meet its distributed processing needs. In this study we investigate two multi-purpose data distribution systems, Globus.org and XRootD, as alternatives to GridFTP. We compare both their performance and the ease by which each service is integrated into the type of secure and automated data transfer systems STAR has previously built using GridFTP. The presented approach and study may be applicable to other distributed data processing use cases beyond STAR.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Management of grid jobs and data within SAMGrid
    Baranovski, A
    Garzoglio, G
    Terekhov, I
    Roy, A
    Tannenbaum, T
    2004 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, 2004, : 353 - 359
  • [2] Remote data access in computational jobs on the ATLAS data grid
    Begy, Volodimir
    Barisits, Martin
    Lassnig, Mario
    Schikuta, Erich
    23RD INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP 2018), 2019, 214
  • [3] GSWAP: A DATA EXCHANGING PARTITION FOR THE EXECUTION OF GRID JOBS
    Hu, Liang
    Lin, Lin
    Che, Xilong
    Li, Changwu
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (09): : 6271 - 6282
  • [4] RUN-TIME ADAPTATION OF GRID DATA PLACEMENT JOBS
    Kola, G.
    Kosar, T.
    Livny, M.
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2005, 6 (03): : 33 - 43
  • [5] Jobs masonry in LHCb with elastic Grid Jobs
    Stagni, F.
    Charpentier, Ph
    21ST INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2015), PARTS 1-9, 2015, 664
  • [6] Research and Implementation of Data Transfer in Grid
    Zhao, Guanghui
    Wang, Chunli
    Liu, Dan
    Zou, Chengming
    2009 ETP INTERNATIONAL CONFERENCE ON FUTURE COMPUTER AND COMMUNICATION (FCC 2009), 2009, : 12 - 15
  • [7] Scheduling jobs on Grid processors
    Boyar, Joan
    Favrholdt, Lene M.
    ALGORITHM THEORY - SWAT 2006, PROCEEDINGS, 2006, 4059 : 17 - 28
  • [8] Scheduling Jobs on Grid Processors
    Boyar, Joan
    Favrholdt, Lene M.
    ALGORITHMICA, 2010, 57 (04) : 819 - 847
  • [9] Scheduling Jobs on Grid Processors
    Joan Boyar
    Lene M. Favrholdt
    Algorithmica, 2010, 57 : 819 - 847
  • [10] Backfilling a cloud with grid jobs
    Fayer, Simon
    Whitehouse, Dan
    26TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS, CHEP 2023, 2024, 295