Lineage retrieval for scientific data processing: A survey

被引:164
|
作者
Bose, R [1 ]
Frew, J [1 ]
机构
[1] Univ Calif Santa Barbara, Bren Sch Environm Sci & Management, Santa Barbara, CA 93106 USA
关键词
design; documentation; experimentation; management; data lineage; data provenance; scientific data; scientific workflow; audit;
D O I
10.1145/1057977.1057978
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Scientific research relies as much on the dissemination and exchange of data sets as on the publication of conclusions. Accurately tracking the lineage (origin and subsequent processing history) of scientific data sets is thus imperative for the complete documentation of scientific work. Researchers are effectively prevented from determining, preserving, or providing the lineage of the computational data products they use and create, however, because of the lack of a definitive model for lineage retrieval and a poor fit between current data management tools and scientific software. Based on a comprehensive survey of lineage research and previous prototypes, we present a metamodel to help identify and assess the basic components of systems that provide lineage retrieval for scientific data products.
引用
收藏
页码:1 / 28
页数:28
相关论文
共 50 条
  • [1] Astro-WISE: Tracing and Using Lineage for Scientific Data Processing
    Mwebaze, Johnson
    Boxhoorn, Danny
    Valentijn, Edwin
    [J]. 2009 INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS, 2009, : 475 - 480
  • [2] RETRIEVAL OF SCIENTIFIC AND TECHNICAL DATA
    ABELSON, PH
    [J]. SCIENCE, 1989, 245 (4913) : 9 - 9
  • [3] RETRIEVAL AND ANALYSIS OF SCIENTIFIC AND NON-SCIENTIFIC DATA
    CHURCHOU.RF
    [J]. COMPUTER BULLETIN, 1971, 15 (03): : 102 - &
  • [4] Configurable distributed retrieval of scientific data
    Silva, DM
    Schwan, K
    Eisenhauer, G
    [J]. FOURTH INTERNATIONAL CONFERENCE ON CONFIGURABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 1998, : 120 - 127
  • [5] MEASURING DATA RETRIEVAL AND PROCESSING
    PETERS, P
    [J]. ZUCKER, 1970, 23 (08): : 220 - &
  • [6] PROCESSING, STORAGE AND RETRIEVAL OF DATA
    EHRENGRUBER, H
    [J]. BULLETIN DER SCHWEIZERISCHEN AKADEMIE DER MEDIZINISCHEN WISSENSCHAFTEN, 1972, 28 (3-4): : 195 - +
  • [7] Data processing on scientific visualization
    Bose, SK
    [J]. SOLID STATE PHYSICS, VOL 41, 1998, 1999, : 88 - 90
  • [8] A conceptual framework for composing and managing scientific data lineage
    Bose, R
    [J]. 14TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2002, : 15 - 19
  • [9] A service based interface for scientific data retrieval
    Bluhm, Torsten
    Jacob, Sven
    Werner, Andreas
    Heimann, Peter
    Hennig, Christine
    Kuehner, Georg
    Kroiss, Hugo
    Laqua, Heike
    Lewerentz, Marc
    Maier, Josef
    Riemann, Heike
    Schacht, Joerg
    Spring, Anett
    Zilker, Manfred
    [J]. FUSION ENGINEERING AND DESIGN, 2010, 85 (3-4) : 579 - 582
  • [10] The Cognitive Enhancement Process of Scientific Data Retrieval
    Liu, Jianping
    Wang, Jian
    Zhou, Guomin
    Zhang, Guilan
    Cui, Yunpeng
    Liu, Juan
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,