End-to-End Analysis Automation over Distributed Resources with Luigi Analysis Workflows

被引:0
|
作者
Rieger, Marcel [1 ]
机构
[1] Hamburg Univ, Inst Expt Phys, Luruper Chaussee 149, D-22761 Hamburg, Germany
关键词
D O I
10.1051/epjconf/202429505012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In particle physics, workflow management systems are primarily used as tailored solutions in dedicated areas such as Monte Carlo production. However, physicists performing data analyses are usually required to steer their individual, complex workflows manually, frequently involving job submission in several stages and interaction with distributed storage systems by hand. This process is not only time-consuming and error-prone, but also leads to undocumented relations between particular workloads, rendering the steering of an analysis a serious challenge. This article presents the Luigi Analysis Workflow (Law) Python package which is based on the open-source pipelining tool Luigi, originally developed by Spotify. It establishes a generic design pattern for analyses of arbitrary scale and complexity, and shifts the focus from executing to defining the analysis logic. Law provides the building blocks to seamlessly integrate with interchangeable remote resources without, however, limiting itself to a specific choice of infrastructure. In particular, it introduces the concept of complete separation between analysis algorithms on the one hand, and run locations, storage locations, and software environments on the other hand. To cope with the sophisticated demands of end-to-end HEP analyses, Law supports job execution on WLCG infrastructure (ARC, gLite, CMS-CRAB) as well as on local computing clusters (HTCondor, Slurm, LSF), remote file access via various protocols using the Grid File Access Library (GFAL2), and an environment sandboxing mechanism with support for sub-shells and virtual environments, as well as Docker and Singularity containers. Moreover, the novel approach ultimately aims for analysis preservation out-of-the-box. Law is developed opensource and independent of any experiment or the language of executed code, and its user-base increased steadily over the past years.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Data-Driven End-to-End Lighting Automation Based on Human Residential Trajectory Analysis
    Zhu, Jack
    Tan, Jingwen
    Wu, Wencen
    2024 INTERNATIONAL CONFERENCE ON SMART APPLICATIONS, COMMUNICATIONS AND NETWORKING, SMARTNETS-2024, 2024,
  • [22] End-to-end delay analysis of videoconferencing over packet-switched networks
    Baldi, M
    Ofek, Y
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2000, 8 (04) : 479 - 492
  • [23] On a Multi-objective Evolutionary Algorithm for Optimizing End-to-end Performance of Scientific Workflows in Distributed Environments
    Gu, Yi
    Shenq, Shwu-Ling
    Wu, Qishi
    Dasgupta, Dipankar
    45TH ANNUAL SIMULATION SYMPOSIUM 2012 (ANSS 2012), 2012, 44 (02): : 69 - 77
  • [24] Distributed End-to-End testing management
    Bai, XY
    Tsai, WT
    Paul, R
    Shen, TC
    Li, B
    FIFTH IEEE INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE, PROCEEDINGS, 2001, : 140 - 151
  • [25] End-to-End Timing Analysis of Sporadic Cause-Effect Chains in Distributed Systems
    Duerr, Marco
    Von Der Brueggen, Georg
    Chen, Kuan-Hsun
    Chen, Jian-Jia
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2019, 18 (05)
  • [26] End-to-end Distributed Video Coding
    Zhou, Junwei
    Lv, Ting
    Yi, XiangBo
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 496 - 496
  • [27] IMF end-to-end workflows in media asset management systems
    Fernandez-Campon J.
    2018, Society of Motion Picture and Television Engineers (127): : 44 - 52
  • [28] End-to-end delay analysis for networked systems
    Shen, Jie
    He, Wen-bo
    Liu, Xue
    Wang, Zhi-bo
    Wang, Zhi
    Yao, Jian-guo
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2015, 16 (09) : 732 - 743
  • [29] An end-to-end administrative document analysis system
    Hamza, Hatem
    Belaid, Yolande
    Belaid, Abdel
    Chaudhuri, Bidyut B.
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 175 - 182
  • [30] Analysis of end-to-end QoS control on the Internet
    Bian, Yucun
    Gu, Junzhong
    Jisuanji Gongcheng/Computer Engineering, 2001, 27 (11):