LabelFlow: Exploiting Workflow Provenance to Surface Scientific Data Provenance

被引:5
|
作者
Alper, Pinar [1 ]
Belhajjame, Khalid [2 ]
Goble, Carole A. [1 ]
Karagoz, Pinar [3 ]
机构
[1] Univ Manchester, Sch Comp Sci, Manchester, Lancs, England
[2] Univ Paris 09, Paris, France
[3] Middle E Tech Univ, Dept Comp Engn, TR-06531 Ankara, Turkey
基金
英国工程与自然科学研究理事会;
关键词
Provenance; Annotation; Scientific workflows; SEMANTIC PROVENANCE; WEB;
D O I
10.1007/978-3-319-16462-5_7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Provenance traces captured by scientific workflows can be useful for designing, debugging and maintenance. However, our experience suggests that they are of limited use for reporting results, in part because traces do not comprise domain-specific annotations needed for explaining results, and the black-box nature of some workflow activities. We show that by basic mark-up of the data processing within activities and using a set of domain specific label generation functions, standard workflow provenance can be utilised as a platform for the labelling of data artefacts. These labels can in turn aid selection of data subsets and proxy for data descriptors for shared datasets.
引用
收藏
页码:84 / 96
页数:13
相关论文
共 50 条
  • [31] Temporal Representation for Scientific Data Provenance
    Chen, Peng
    Plale, Beth
    Aktas, Mehmet S.
    2012 IEEE 8TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2012,
  • [32] Provenance and credibility in scientific data repositories
    Fear, Kathleen
    Donaldson, Devan Ray
    ARCHIVAL SCIENCE, 2012, 12 (03) : 319 - 339
  • [33] A Tool for Scientific Provenance of Data and Software
    Ceguerra, Anna V.
    Liddicoat, Peter V.
    Ringer, Simon P.
    Goscinski, Wojtek J.
    Androulakis, Steve
    2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 561 - 565
  • [34] The Provenance of Workflow Upgrades
    Koop, David
    Scheidegger, Carlos E.
    Freire, Juliana
    Silva, Claudio T.
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, 2010, 6378 : 2 - +
  • [35] Facilitating Asynchronous Collaboration in Scientific Workflow Composition Using Provenance
    Abediniala M.
    Roy B.
    Proc. ACM Hum. Comput. Interact., 2022, EICS
  • [36] Modeling and Querying Scientific Workflow Provenance in the D-OPM
    Cuevas-Vicenttin, Victor
    Dey, Saumen
    Wang, Michael Li Yuan
    Song, Tianhong
    Ludaescher, Bertram
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 119 - 128
  • [37] Scientific Workflow Repeatability through Cloud-Aware Provenance
    Hasham, Khawar
    Munir, Kamran
    Shamdasani, Jetendr
    McClatchey, Richard
    2014 IEEE/ACM 7TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2014, : 951 - 956
  • [38] Quality Analysis for Scientific Workflow Provenance Access Control Policies
    Bhuyan, Fahima Amin
    Lu, Shiyong
    Reynolds, Robert
    Ahmed, Ishtiaq
    Zhang, Jia
    2018 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (IEEE SCC 2018), 2018, : 261 - 264
  • [39] A Security Framework for Scientific Workflow Provenance Access Control Policies
    Bhuyan, Fahima Amin
    Lu, Shiyong
    Reynolds, Robert
    Zhang, Jia
    Ahmed, Ishtiaq
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (01) : 97 - 109
  • [40] Storing and querying scientific workflow provenance metadata using an RDBMS
    Chebotko, Artem
    Fei, Xubo
    Lin, Cui
    Lu, Shiyong
    Fotouhi, Farshad
    E-SCIENCE 2007: THIRD IEEE INTERNATIONAL CONFERENCE ON E-SCIENCE AND GRID COMPUTING, PROCEEDINGS, 2007, : 611 - 618