Towards Integrating Workflow and Database Provenance

被引:0
|
作者
Chirigati, Fernando [1 ]
Freire, Juliana [1 ]
机构
[1] NYU, Polytech Inst, Comp Sci & Engn Dept, New York, NY 10003 USA
关键词
Workflow Provenance; Database Provenance; Reproducibility;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While there has been substantial work on both database and workflow provenance, the two problems have only been examined in isolation. It is widely accepted that the existing models are incompatible. Database provenance is fine-grained and captures changes to tuples in a database. In contrast, workflow provenance is represented at a coarser level and reflects the functional model of workflow systems, which is stateless-each computational step derives a new artifact. In this paper, we propose a new approach to combine database and workflow provenance. We address the mismatch between the different kinds of provenance by using a temporal model which explicitly represents the database states as updates are applied. We discuss how, under this model, reproducibility is obtained for workflows that manipulate databases, and how different queries that straddle the two provenance traces can be evaluated. We also describe a proof-of-concept implementation that integrates a workflow system and a commercial relational database.
引用
收藏
页码:11 / 23
页数:13
相关论文
共 50 条
  • [1] Database Support for Exploring Scientific Workflow Provenance Graphs
    Anand, Manish Kumar
    Bowers, Shawn
    Ludascher, Bertram
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2012, 2012, 7338 : 343 - 360
  • [2] Integrating Provenance Data from Distributed Workflow Systems with ProvManager
    Marinho, Anderson
    Murta, Leonardo
    Werner, Claudia
    Braganholo, Vanessa
    Ogasawara, Eduardo
    Serra da Cruz, Sergio Manuel
    Mattoso, Marta
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, 2010, 6378 : 286 - +
  • [3] Towards Ontology Driven Provenance in Scientific Workflow Engine
    Butt, Anila Sahar
    Car, Nicholas
    Fitch, Peter
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON MODEL-DRIVEN ENGINEERING AND SOFTWARE DEVELOPMENT (MODELSWARD), 2020, : 105 - 115
  • [4] Towards a Taxonomy of Provenance in Scientific Workflow Management Systems
    Serra da Cruz, Sergio Manuel
    Campos, Maria Luiza M.
    Mattoso, Marta
    2009 IEEE CONGRESS ON SERVICES (SERVICES-1 2009), VOLS 1 AND 2, 2009, : 259 - +
  • [5] Putting Lipstick on Pig: Enabling Database-style Workflow Provenance
    Amsterdamer, Yael
    Davidson, Susan B.
    Deutch, Daniel
    Milo, Tova
    Stoyanovich, Julia
    Tannen, Val
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 5 (04): : 346 - 357
  • [6] Towards an Adaptive and Distributed Architecture for Managing Workflow Provenance Data
    Costa, Flavio
    de Oliveira, Daniel
    Mattoso, Marta
    2014 IEEE 10TH INTERNATIONAL CONFERENCE ON ESCIENCE WORKSHOPS (ESCIENCE 2014), VOL 2, 2014, : 79 - 82
  • [7] Data provenance in a scientific workflow service framework integrated with object deputy database
    International School of Software, Wuhan University, Wuhan 430072, China
    不详
    不详
    Jisuanji Xuebao, 2008, 5 (721-732):
  • [8] The Provenance of Workflow Upgrades
    Koop, David
    Scheidegger, Carlos E.
    Freire, Juliana
    Silva, Claudio T.
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, 2010, 6378 : 2 - +
  • [9] Trustworthy Provenance Framework for Document Workflow Provenance
    Rupasinghe, P. L.
    Weerasena, H. H.
    Murray, I.
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNIQUES IN INFORMATION AND COMMUNICATION TECHNOLOGIES (ICCTICT), 2016,
  • [10] A novel database (Sysmed) integrating data entry into the daily clinical workflow
    Eisert, C
    Smith, W
    Ajani, Z
    Savitz, S
    Selim, M
    NEUROLOGY, 2006, 66 (05) : 300 - 300