Towards Integrating Workflow and Database Provenance

被引:0
|
作者
Chirigati, Fernando [1 ]
Freire, Juliana [1 ]
机构
[1] NYU, Polytech Inst, Comp Sci & Engn Dept, New York, NY 10003 USA
关键词
Workflow Provenance; Database Provenance; Reproducibility;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While there has been substantial work on both database and workflow provenance, the two problems have only been examined in isolation. It is widely accepted that the existing models are incompatible. Database provenance is fine-grained and captures changes to tuples in a database. In contrast, workflow provenance is represented at a coarser level and reflects the functional model of workflow systems, which is stateless-each computational step derives a new artifact. In this paper, we propose a new approach to combine database and workflow provenance. We address the mismatch between the different kinds of provenance by using a temporal model which explicitly represents the database states as updates are applied. We discuss how, under this model, reproducibility is obtained for workflows that manipulate databases, and how different queries that straddle the two provenance traces can be evaluated. We also describe a proof-of-concept implementation that integrates a workflow system and a commercial relational database.
引用
收藏
页码:11 / 23
页数:13
相关论文
共 50 条
  • [21] Towards Integrating the Detection of Genetic Variants into an In-Memory Database
    Faehnrich, Cindy
    Schapranow, Matthieu-P.
    Plattner, Hasso
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014,
  • [22] LabelFlow Framework for Annotating Workflow Provenance
    Alper, Pinar
    Belhajjame, Khalid
    Curcin, Vasa
    Goble, Carole A.
    INFORMATICS-BASEL, 2018, 5 (01):
  • [23] Using Provenance to Improve Workflow Design
    de Oliveira, Frederico T.
    Murta, Leonardo
    Werner, Claudia
    Mattoso, Marta
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, 2008, 5272 : 136 - 143
  • [24] Hiding Data and Structure in Workflow Provenance
    Davidson, Susan
    Bao, Zhuowei
    Roy, Sudeepa
    DATABASES IN NETWORKED INFORMATION SYSTEMS, 2011, 7108 : 41 - 48
  • [25] A WORKFLOW FOR DATABASE REFACTORING
    Pereira Domingues, Marcia Beatriz
    de Almeida Junior, Jorge Rady
    Costa, Wilian Franca
    Saraiva, Antonio Mauro
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (06): : 2209 - 2220
  • [26] Provenance Analysis: Towards Quality Provenance
    Cheah, You-Wei
    Plale, Beth
    2012 IEEE 8TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2012,
  • [27] X-Ray - Towards integrating XML and relational database systems
    Kappel, G
    Kapsammer, E
    Rausch-Schott, S
    Retschitzegger, W
    CONCEPTUAL MODELING ER 2000, PROCEEDINGS, 2000, 1920 : 339 - 353
  • [28] On Efficiently Processing Workflow Provenance Queries in Spark
    Rajmohan, C.
    Lohia, Pranay
    Gupta, Himanshu
    Brahma, Siddhartha
    Hernandez, Mauricio
    Mehta, Sameep
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 1443 - 1452
  • [29] Provenance-based Scientific Workflow Search
    Abu Jabal, Amani
    Bertino, Elisa
    de Mel, Geeth
    2017 IEEE 13TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2017, : 119 - 127
  • [30] Workflow provenance in the lifecycle of scientific machine learning
    Souza, Renan
    Azevedo, Leonardo G.
    Lourenco, Vitor
    Soares, Elton
    Thiago, Raphael
    Brandao, Rafael
    Civitarese, Daniel
    Brazil, Emilio Vital
    Moreno, Marcio
    Valduriez, Patrick
    Mattoso, Marta
    Cerqueira, Renato
    Netto, Marco A. S.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (14):