Provenance Analysis: Towards Quality Provenance

被引:0
|
作者
Cheah, You-Wei [1 ]
Plale, Beth [1 ]
机构
[1] Indiana Univ, Sch Informat & Comp, Bloomington, IN 47405 USA
关键词
Data Provenance; Provenance Quality; Scientific Workflows; Provenance Analysis;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Data provenance, a key piece of metadata that describes the lifecycle of a data product, is crucial in aiding scientists to better understand and facilitate reproducibility and reuse of scientific results. Provenance collection systems often capture provenance on the fly and the protocol between application and provenance tool may not be reliable. As a result, data provenance can become ambiguous or simply inaccurate. In this paper, we identify likely quality issues in data provenance. We also establish crucial quality dimensions that are especially critical for the evaluation of provenance quality. We analyze synthetic and real-world provenance based on these quality dimensions and summarize our contributions to provenance quality.
引用
下载
收藏
页数:8
相关论文
共 50 条
  • [1] Towards Unified Provenance Granularities
    Lebo, Timothy
    Wang, Ping
    Graves, Alvaro
    McGuinness, Deborah L.
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, IPAW 2012, 2012, 7525 : 39 - 51
  • [2] Provenance-Aware Entity Resolution: Leveraging Provenance to Improve Quality
    Wang, Qing
    Schewe, Klaus-Dieter
    Wang, Woods
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT1, 2015, 9049 : 474 - 490
  • [3] Exploratory Analysis of Provenance Data Using R and the Provenance Package
    Vermeesch, Pieter
    MINERALS, 2019, 9 (03)
  • [4] Towards Integrating Workflow and Database Provenance
    Chirigati, Fernando
    Freire, Juliana
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, IPAW 2012, 2012, 7525 : 11 - 23
  • [5] Towards Provenance-Enabling ParaView
    Callahan, Steven P.
    Freire, Juliana
    Scheidegger, Carlos E.
    Silva, Claudio T.
    Vo, Huy T.
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, 2008, 5272 : 120 - 127
  • [6] Provenance as dependency analysis
    Cheney, James
    Ahmed, Amal
    Acar, Umut A.
    MATHEMATICAL STRUCTURES IN COMPUTER SCIENCE, 2011, 21 (06) : 1301 - 1337
  • [7] From Scripts Towards Provenance Inference
    Huq, Mohammad Rezwanul
    Apers, Peter M. G.
    Wombacher, Andreas
    Wada, Yoshihide
    van Beek, Ludovicus P. H.
    2012 IEEE 8TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2012,
  • [8] Towards Provenance and Traceability in CRISTAL for HEP
    Shamdasani, Jetendr
    Branson, Andrew
    McClatchey, Richard
    20TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2013), PARTS 1-6, 2014, 513
  • [9] Towards Secure Provenance in the Cloud: A Survey
    Lee, Brian
    Awad, Abir
    Awad, Mirna
    2015 IEEE/ACM 8TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2015, : 577 - 582
  • [10] Provenance as dependency analysis
    Cheney, James
    Ahmed, Amal
    Acar, Umut A.
    DATABASE PROGRAMMING LANGUAGES, 2007, 4797 : 138 - +