LabelFlow: Exploiting Workflow Provenance to Surface Scientific Data Provenance

被引:5
|
作者
Alper, Pinar [1 ]
Belhajjame, Khalid [2 ]
Goble, Carole A. [1 ]
Karagoz, Pinar [3 ]
机构
[1] Univ Manchester, Sch Comp Sci, Manchester, Lancs, England
[2] Univ Paris 09, Paris, France
[3] Middle E Tech Univ, Dept Comp Engn, TR-06531 Ankara, Turkey
基金
英国工程与自然科学研究理事会;
关键词
Provenance; Annotation; Scientific workflows; SEMANTIC PROVENANCE; WEB;
D O I
10.1007/978-3-319-16462-5_7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Provenance traces captured by scientific workflows can be useful for designing, debugging and maintenance. However, our experience suggests that they are of limited use for reporting results, in part because traces do not comprise domain-specific annotations needed for explaining results, and the black-box nature of some workflow activities. We show that by basic mark-up of the data processing within activities and using a set of domain specific label generation functions, standard workflow provenance can be utilised as a platform for the labelling of data artefacts. These labels can in turn aid selection of data subsets and proxy for data descriptors for shared datasets.
引用
下载
收藏
页码:84 / 96
页数:13
相关论文
共 50 条
  • [1] LabelFlow Framework for Annotating Workflow Provenance
    Alper, Pinar
    Belhajjame, Khalid
    Curcin, Vasa
    Goble, Carole A.
    INFORMATICS-BASEL, 2018, 5 (01):
  • [2] A survey of provenance in scientific workflow
    Lin, Songhai
    Xiao, Hong
    Jiang, Wenchao
    Li, Dafeng
    Liang, Jiaben
    Li, Zelin
    JOURNAL OF HIGH SPEED NETWORKS, 2023, 29 (02) : 129 - 145
  • [3] Enabling Data Recommendation in Scientific Workflow based on Provenance
    Huang, Xing
    Lu, Tun
    Ding, Xianghua
    Gu, Ning
    2013 8TH CHINAGRID ANNUAL CONFERENCE (CHINAGRID), 2013, : 117 - 122
  • [4] Scientific Workflow, Provenance, and Data Modeling Challenges and Approaches
    Bowers, Shawn
    JOURNAL ON DATA SEMANTICS, 2012, 1 (01) : 19 - 30
  • [5] Provenance Browser: Displaying and Querying Scientific Workflow Provenance Graphs
    Anand, Manish Kumar
    Bowers, Shawn
    Ludaescher, Bertram
    26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 1201 - 1204
  • [6] Provenance-based Scientific Workflow Search
    Abu Jabal, Amani
    Bertino, Elisa
    de Mel, Geeth
    2017 IEEE 13TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2017, : 119 - 127
  • [7] A scientific workflow framework integrated with object deputy model for data provenance
    Wang, Liwei
    Peng, Zhiyong
    Luo, Min
    Ji, Wenhao
    Huang, Zeqian
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2006, 4016 : 569 - 580
  • [8] Workflow provenance in the lifecycle of scientific machine learning
    Souza, Renan
    Azevedo, Leonardo G.
    Lourenco, Vitor
    Soares, Elton
    Thiago, Raphael
    Brandao, Rafael
    Civitarese, Daniel
    Brazil, Emilio Vital
    Moreno, Marcio
    Valduriez, Patrick
    Mattoso, Marta
    Cerqueira, Renato
    Netto, Marco A. S.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (14):
  • [9] Challenges of Provenance in Scientific Workflow Management Systems
    Alam, Khairul
    Roy, Banani
    2022 IEEE/ACM WORKSHOP ON WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE, WORKS, 2022, : 10 - 18
  • [10] Mechanisms for provenance collection in scientific workflow systems
    Mehdi Sarikhani
    Andrew Wendelborn
    Computing, 2018, 100 : 439 - 472