LabelFlow: Exploiting Workflow Provenance to Surface Scientific Data Provenance

被引:5
|
作者
Alper, Pinar [1 ]
Belhajjame, Khalid [2 ]
Goble, Carole A. [1 ]
Karagoz, Pinar [3 ]
机构
[1] Univ Manchester, Sch Comp Sci, Manchester, Lancs, England
[2] Univ Paris 09, Paris, France
[3] Middle E Tech Univ, Dept Comp Engn, TR-06531 Ankara, Turkey
基金
英国工程与自然科学研究理事会;
关键词
Provenance; Annotation; Scientific workflows; SEMANTIC PROVENANCE; WEB;
D O I
10.1007/978-3-319-16462-5_7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Provenance traces captured by scientific workflows can be useful for designing, debugging and maintenance. However, our experience suggests that they are of limited use for reporting results, in part because traces do not comprise domain-specific annotations needed for explaining results, and the black-box nature of some workflow activities. We show that by basic mark-up of the data processing within activities and using a set of domain specific label generation functions, standard workflow provenance can be utilised as a platform for the labelling of data artefacts. These labels can in turn aid selection of data subsets and proxy for data descriptors for shared datasets.
引用
收藏
页码:84 / 96
页数:13
相关论文
共 50 条
  • [21] Database Support for Exploring Scientific Workflow Provenance Graphs
    Anand, Manish Kumar
    Bowers, Shawn
    Ludascher, Bertram
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2012, 2012, 7338 : 343 - 360
  • [22] OPQL: Querying scientific workflow provenance at the graph level
    Lim, Chunhyeok
    Lu, Shiyong
    Chebotko, Artem
    Fotouhi, Farshad
    Kashlev, Andrey
    DATA & KNOWLEDGE ENGINEERING, 2013, 88 : 37 - 59
  • [23] Secure Abstraction Views for Scientific Workflow Provenance Querying
    Chebotko, Artem
    Lu, Shiyong
    Chang, Seunghan
    Fotouhi, Farshad
    Yang, Ping
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2010, 3 (04) : 322 - 337
  • [24] Towards a Taxonomy of Provenance in Scientific Workflow Management Systems
    Serra da Cruz, Sergio Manuel
    Campos, Maria Luiza M.
    Mattoso, Marta
    2009 IEEE CONGRESS ON SERVICES (SERVICES-1 2009), VOLS 1 AND 2, 2009, : 259 - +
  • [25] Trustworthy Provenance Framework for Document Workflow Provenance
    Rupasinghe, P. L.
    Weerasena, H. H.
    Murray, I.
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNIQUES IN INFORMATION AND COMMUNICATION TECHNOLOGIES (ICCTICT), 2016,
  • [26] Abstract Provenance Graphs: Anticipating and Exploiting Schema-Level Data Provenance
    Zinn, Daniel
    Ludaescher, Bertram
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, 2010, 6378 : 206 - 215
  • [27] Connecting scientific data to scientific experiments with provenance
    Miles, Simon
    Deelman, Ewa
    Groth, Paul
    Vahi, Karan
    Mehta, Gaurang
    Moreau, Luc
    E-SCIENCE 2007: THIRD IEEE INTERNATIONAL CONFERENCE ON E-SCIENCE AND GRID COMPUTING, PROCEEDINGS, 2007, : 179 - +
  • [28] Provenance aware workflow for data quality management and improvement for large continuous scientific data streams
    Kumar, Jitendra
    Crow, Michael C.
    Devarakonda, Ranjeet
    Giansiracusa, Michael
    Guntupally, Kavya
    Olatt, Joseph V.
    Price, Zach
    Shanafield, Harold A., III
    Singh, Alka
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 3260 - 3266
  • [29] A workflow modeling system for capturing data provenance
    Joglekar, Girish S.
    Giridhar, Arun
    Reklaitis, Gintaras
    COMPUTERS & CHEMICAL ENGINEERING, 2014, 67 : 148 - 158
  • [30] Provenance and data differencing for workflow reproducibility analysis
    Missier, Paolo
    Woodman, Simon
    Hiden, Hugo
    Watson, Paul
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (04): : 995 - 1015