Tracking provenance of earth science data

被引:15
|
作者
Tilmes, Curt [1 ]
Yesha, Yelena [2 ]
Halem, Milton [2 ]
机构
[1] NASA, Goddard Space Flight Ctr, Greenbelt, MD 20771 USA
[2] Univ Maryland, Baltimore, MD 21250 USA
基金
美国国家科学基金会;
关键词
Data processing; Provenance;
D O I
10.1007/s12145-010-0046-3
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Tremendous volumes of data have been captured, archived and analyzed. Sensors, algorithms and processing systems for transforming and analyzing the data are evolving over time. Web Portals and Services can create transient data sets on-demand. Data are transferred from organization to organization with additional transformations at every stage. Provenance in this context refers to the source of data and a record of the process that led to its current state. It encompasses the documentation of a variety of artifacts related to particular data. Provenance is important for understanding and using scientific datasets, and critical for independent confirmation of scientific results. Managing provenance throughout scientific data processing has gained interest lately and there are a variety of approaches. Large scale scientific datasets consisting of thousands to millions of individual data files and processes offer particular challenges. This paper uses the analogy of art history provenance to explore some of the concerns of applying provenance tracking to earth science data. It also illustrates some of the provenance issues with examples drawn from the Ozone Monitoring Instrument (OMI) Data Processing System (OMIDAPS) (Tilmes et al. 2004) run at NASA's Goddard Space Flight Center by the first author.
引用
收藏
页码:59 / 65
页数:7
相关论文
共 50 条
  • [21] SCIENCE DATA INFRASTRUCTURE FOR PRESERVATION - EARTH SCIENCE
    Albani, Mirko
    Marelli, Fulvio
    Giaretta, David
    Shaon, Arif
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 5270 - 5273
  • [22] Design and Development of a Provenance Capture Platform for Data Science
    Gregori, Luca
    Missier, Paolo
    Stidolph, Matthew
    Torlone, Riccardo
    Wood, Alessandro
    2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 285 - 290
  • [23] Milieu: Lightweight and Configurable Big Data Provenance for Science
    Cheah, You-Wei
    Canon, Richard
    Plale, Beth
    Ramakrishnan, Lavanya
    2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 46 - 53
  • [24] Earth Observation Data Provenance: A Blockchain-Based Solution
    Zhang, Feng
    Wang, Zihao
    Guo, Ruixin
    Qu, Guangzhi
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (07) : 9548 - 9556
  • [25] Research Workflows - Towards reproducible science via detailed provenance tracking in Open Science Chain
    Nandigam, Viswanath
    Lin, Kai
    Shantharam, Manu
    Sakai, Scott
    Sivagnanam, Subhashini
    PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING 2020, PEARC 2020, 2020, : 484 - 486
  • [26] A Demonstration of TripleProv: Tracking and Querying Provenance over Web Data
    Wylot, Marcin
    Cudre-Mauroux, Philippe
    Groth, Paul
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (12): : 1993 - 1996
  • [27] A Blockchain-based Approach for Data Accountability and Provenance Tracking
    Neisse, Ricardo
    Steri, Gary
    Nai-Fovino, Igor
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY (ARES 2017), 2017,
  • [28] Tracking Data Provenance of Archaeological Temporal Information in Presence of Uncertainty
    Migliorini, Sara
    Quintarelli, Elisa
    Belussi, Alberto
    ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2022, 15 (02):
  • [29] Earth system science workbench: A data management infrastructure for earth science products
    Frew, J
    Bose, R
    THIRTEENTH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2001, : 180 - 189
  • [30] Earth science instruction with digital data
    Hays, JD
    Pfirman, S
    Blumenthal, B
    Kastens, K
    Menke, W
    COMPUTERS & GEOSCIENCES, 2000, 26 (06) : 657 - 668