Tracking provenance of earth science data

被引:15
|
作者
Tilmes, Curt [1 ]
Yesha, Yelena [2 ]
Halem, Milton [2 ]
机构
[1] NASA, Goddard Space Flight Ctr, Greenbelt, MD 20771 USA
[2] Univ Maryland, Baltimore, MD 21250 USA
基金
美国国家科学基金会;
关键词
Data processing; Provenance;
D O I
10.1007/s12145-010-0046-3
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Tremendous volumes of data have been captured, archived and analyzed. Sensors, algorithms and processing systems for transforming and analyzing the data are evolving over time. Web Portals and Services can create transient data sets on-demand. Data are transferred from organization to organization with additional transformations at every stage. Provenance in this context refers to the source of data and a record of the process that led to its current state. It encompasses the documentation of a variety of artifacts related to particular data. Provenance is important for understanding and using scientific datasets, and critical for independent confirmation of scientific results. Managing provenance throughout scientific data processing has gained interest lately and there are a variety of approaches. Large scale scientific datasets consisting of thousands to millions of individual data files and processes offer particular challenges. This paper uses the analogy of art history provenance to explore some of the concerns of applying provenance tracking to earth science data. It also illustrates some of the provenance issues with examples drawn from the Ozone Monitoring Instrument (OMI) Data Processing System (OMIDAPS) (Tilmes et al. 2004) run at NASA's Goddard Space Flight Center by the first author.
引用
收藏
页码:59 / 65
页数:7
相关论文
共 50 条
  • [1] Tracking provenance of earth science data
    Curt Tilmes
    Yelena Yesha
    Milton Halem
    [J]. Earth Science Informatics, 2010, 3 : 59 - 65
  • [2] Provenance Tracking in an Earth Science Data Processing System
    Tilmes, Curt
    Fleig, Albert J.
    [J]. PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, 2008, 5272 : 221 - +
  • [3] Distinguishing Provenance Equivalence of Earth Science Data
    Tilmes, C.
    Yesha, Ye.
    Halem, M.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 : 548 - 557
  • [4] A mathematical framework for earth science data provenance tracing
    Barkstrom, Bruce R.
    [J]. EARTH SCIENCE INFORMATICS, 2010, 3 (03) : 167 - 196
  • [5] A mathematical framework for earth science data provenance tracing
    Bruce R. Barkstrom
    [J]. Earth Science Informatics, 2010, 3 : 167 - 196
  • [6] Vamsa: Automated Provenance Tracking in Data Science Scripts
    Namaki, Mohammad Hossein
    Floratou, Avrilia
    Psallidas, Fotis
    Krishnan, Subru
    Agrawal, Ashvin
    Wu, Yinghui
    Zhu, Yiwen
    Weimer, Markus
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1542 - 1551
  • [7] Tracking and Establishing Provenance of Earth Science Datasets: A NASA-Based Example
    Ramapriyan, Hampapuram K.
    Goldstein, Justin C.
    Hua, Hook
    Wolfe, Robert E.
    [J]. Provenance and Annotation of Data and Processes, IPAW 2016, 2016, 9672 : 226 - 229
  • [8] Data Provenance Tracking for Concurrent Programs
    Lucia, Brandon
    Ceze, Luis
    [J]. 2015 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO), 2015, : 146 - 156
  • [9] Tracking provenance in a virtual data grid
    Clifford, Ben
    Foster, Ian
    Voeckler, Jens-S.
    Wilder, Michael
    Zhao, Yong
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2008, 20 (05): : 565 - 575
  • [10] A Provenance Tracking Model for Data Updates
    Ciobanu, Gabriel
    Horne, Ross
    [J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2012, (91): : 31 - 44