Reproducibility Analysis of Scientific Workflows

被引:4
|
作者
Banati, Anna [3 ]
Kacsuk, Peter [1 ,2 ]
Kozlovszky, Miklos [1 ,3 ]
机构
[1] MTA SZTAKI, Pf 63, H-1518 Budapest, Hungary
[2] Univ Westminster, 115 New Cavendish St, London W1W 6UW, England
[3] Obuda Univ, John von Neumann Fac Informat, Becsi Ut 96-B, H-1034 Budapest, Hungary
关键词
scientific workflows; reproducibility; analytical model; provenance; evaluation; gUSE;
D O I
10.12700/APH.14.2.2017.2.11
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Scientific workflows are efficient tools for specifying and automating compute and data intensive in-silico experiments. An important challenge related to their usage is their reproducibility. In order to make it reproducible, many factors have to be investigated which can influence and even prevent this process: the missing descriptions and samples; the missing provenance data about the environmental parameters and the data dependencies; the dependencies of executions which are based on special hardware, changing or volatile third party services or random generated values. Some of these factors (called dependencies) can be eliminated by careful design or by huge resource usage but most of them cannot be bypassed. Our investigation deals with the critical dependencies of execution. In this paper we set up a mathematical model to evaluate the results of the workflow in addition we provide a mechanism to make the workflow reproducible based on provenance data and statistical tools.
引用
收藏
页码:201 / 217
页数:17
相关论文
共 50 条
  • [21] Scientific workflows
    Anna-Lena Lamprecht
    Kenneth J. Turner
    International Journal on Software Tools for Technology Transfer, 2016, 18 : 575 - 580
  • [22] Overhead analysis of scientific workflows in grid environments
    Prodan, Radu
    Fahringer, Thomas
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2008, 19 (03) : 378 - 393
  • [23] Common Motifs in Scientific Workflows: An Empirical Analysis
    Garijo, Daniel
    Alper, Pinar
    Belhajjame, Khalid
    Corcho, Oscar
    Gil, Yolanda
    Goble, Carole
    2012 IEEE 8TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2012,
  • [24] Network Analysis of Scientific Workflows: A Gateway to Reuse
    Tan, Wei
    Zhang, Jia
    Foster, Ian
    COMPUTER, 2010, 43 (09) : 54 - 61
  • [25] Common motifs in scientific workflows: An empirical analysis
    Garijo, Daniel
    Alper, Pinar
    Belhajjame, Khalid
    Corcho, Oscar
    Gil, Yolanda
    Goble, Carole
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 36 : 338 - 351
  • [26] Bridging the Gaps in Meta-Omic Analysis: Workflows and Reproducibility
    Cavalcante, Joao Vitor Ferreira
    de Souza, Iara Dantas
    Morais, Diego Arthur de Azevedo
    Dalmolin, Rodrigo Juliani Siqueira
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2023, 27 (12) : 547 - 549
  • [27] A Semantic-Based Approach to Attain Reproducibility of Computational Environments in Scientific Workflows: A Case Study
    Santana-Perez, Idafen
    da Silva, Rafael Ferreira
    Rynge, Mats
    Deelman, Ewa
    Perez-Hernandez, Maria S.
    Corcho, Oscar
    EURO-PAR 2014: PARALLEL PROCESSING WORKSHOPS, PT I, 2014, 8805 : 452 - 463
  • [28] Online performance monitoring and analysis of grid scientific workflows
    Truong, HL
    Fahringer, T
    ADVANCES IN GRID COMPUTING - EGC 2005, 2005, 3470 : 1154 - 1164
  • [29] Scalable Composition and Analysis Techniques for Massive Scientific Workflows
    Ahn, Dong H.
    Zhang, Xiaohua
    Mast, Jeffrey
    Herbein, Stephen
    Di Natale, Francesco
    Kirshner, Dan
    Jacobs, Sam Ade
    Karlin, Ian
    Milroy, Daniel J.
    De Supinski, Bronis
    Van Essen, Brian
    Allen, Jonathan
    Lightstone, Felice C.
    2022 IEEE 18TH INTERNATIONAL CONFERENCE ON E-SCIENCE (ESCIENCE 2022), 2022, : 32 - 43
  • [30] OpenAlea: Scientific Workflows Combining Data Analysis and Simulation
    Pradal, Christophe
    Fournier, Christian
    Valduriez, Patrick
    Cohen-Boulakia, Sarah
    PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2015,