End-to-End Analysis Automation over Distributed Resources with Luigi Analysis Workflows

被引:0
|
作者
Rieger, Marcel [1 ]
机构
[1] Hamburg Univ, Inst Expt Phys, Luruper Chaussee 149, D-22761 Hamburg, Germany
关键词
D O I
10.1051/epjconf/202429505012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In particle physics, workflow management systems are primarily used as tailored solutions in dedicated areas such as Monte Carlo production. However, physicists performing data analyses are usually required to steer their individual, complex workflows manually, frequently involving job submission in several stages and interaction with distributed storage systems by hand. This process is not only time-consuming and error-prone, but also leads to undocumented relations between particular workloads, rendering the steering of an analysis a serious challenge. This article presents the Luigi Analysis Workflow (Law) Python package which is based on the open-source pipelining tool Luigi, originally developed by Spotify. It establishes a generic design pattern for analyses of arbitrary scale and complexity, and shifts the focus from executing to defining the analysis logic. Law provides the building blocks to seamlessly integrate with interchangeable remote resources without, however, limiting itself to a specific choice of infrastructure. In particular, it introduces the concept of complete separation between analysis algorithms on the one hand, and run locations, storage locations, and software environments on the other hand. To cope with the sophisticated demands of end-to-end HEP analyses, Law supports job execution on WLCG infrastructure (ARC, gLite, CMS-CRAB) as well as on local computing clusters (HTCondor, Slurm, LSF), remote file access via various protocols using the Grid File Access Library (GFAL2), and an environment sandboxing mechanism with support for sub-shells and virtual environments, as well as Docker and Singularity containers. Moreover, the novel approach ultimately aims for analysis preservation out-of-the-box. Law is developed opensource and independent of any experiment or the language of executed code, and its user-base increased steadily over the past years.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Improving AFDX End-to-End delays analysis
    Kemayo, Georges
    Benammar, Nassima
    Ridouard, Frederic
    Bauer, Henri
    Richard, Pascal
    PROCEEDINGS OF 2015 IEEE 20TH CONFERENCE ON EMERGING TECHNOLOGIES & FACTORY AUTOMATION (ETFA), 2015,
  • [32] End-to-end performance analysis with traffic aggregation
    Ferrari, T
    COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 34 (06): : 905 - 914
  • [33] Opportunistic Networks: end-to-end performance analysis)
    Santos, Rodrigo
    Orozco, Javier
    2011 BRAZILIAN SYMPOSIUM ON COMPUTING SYSTEM ENGINEERING (SBESC), 2011, : 90 - 92
  • [34] End-to-end delay analysis for networked systems
    Jie Shen
    Wen-bo He
    Xue Liu
    Zhi-bo Wang
    Zhi Wang
    Jian-guo Yao
    Frontiers of Information Technology & Electronic Engineering, 2015, 16 : 732 - 743
  • [35] An End-to-End Toolkit for Social Network Analysis
    Hao, Jun
    Zhao, Xinghui
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5692 - 5694
  • [36] End-to-end availability analysis of physical network
    Kong, RS
    Zhou, HB
    FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2004, : 668 - 673
  • [37] Modelling and analysis of end-to-end network performance
    Zhanikeev, Marat
    Tanaka, Yoshiaki
    2008 7TH ASIA-PACIFIC SYMPOSIUM ON INFORMATION AND TELECOMMUNICATION TECHNOLOGIES, 2008, : 201 - +
  • [38] Design-Agnostic Distributed Timing Fault Injection Monitor With End-to-End Design Automation
    He, Yan
    Su, Yumin
    Yang, Kaiyuan
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2025, 60 (01) : 146 - 157
  • [39] Compositional performance analysis with improved analysis techniques for obtaining viable end-to-end latencies in distributed embedded systems
    Rox J.
    Ernst R.
    International Journal on Software Tools for Technology Transfer, 2013, 15 (3) : 171 - 187
  • [40] Analysis of forward approach for upper bounding end-to-end transmission delays over distributed real-time avionics networks
    Xu, Q.
    Yang, X.
    AERONAUTICAL JOURNAL, 2020, 124 (1279): : 1399 - 1435