Data management and archiving for the Pacific 2001 Air Quality Study

被引:1
|
作者
Sukloff, WB [1 ]
Vet, RJ [1 ]
Li, SM [1 ]
机构
[1] Environm Canada, Toronto, ON M3H 5T4, Canada
关键词
data quality assurance; data management; data archive; data exchange standard; metadata;
D O I
10.1016/j.atmosenv.2005.11.060
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The data management and archiving activities of the Pacific 2001 Air Quality Study were handled by the Pacific 2001 Data Centre which was run by the National Atmospheric Chemistry (NAtChem) Database and Analysis Facility of Environment Canada. To ensure that the Pacific 2001 Air Quality Study data were archived in a common way, the NARSTO Data Exchange Standard (DES) was used as the mandatory format for the data files, partially because it allowed for the inclusion of metadata within the data files and partially because it provided the necessary flexibility for handling the many measurement types used in the study. Described in detail in the paper, the DES is now readily available to the scientific community. After each DES data file was submitted to the Data Centre, a read-and-verify program was run to check its conformity to the DES and to detect incorrect and problematic data. The errors detected by the read-and-verify program were automatically documented and an error report was sent to the data originators for data correction and resubmission. Statistical summaries and data plots were created for all data files and subsequently sent to the data originators for review and further error detection. Of the 125 data files submitted to the Data Centre, only 5 were error-free upon first submission. A test of 17 randomly selected files determined that all but two required at least four iterations of the submission-error checking-resubmission cycle in order to produce final error-free files. It was therefore concluded that both data originators and data centres alike should assume that errors exist in all submitted data files until proven differently by a set of automated error-checking programs. It was also concluded that data visualization plots and statistical summaries are highly effective tools for detecting errors in data files. Metadata associated with the measurement data were documented in Quality Assurance Project Plans that were archived in the Data Centre with the DES data files. (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2783 / 2795
页数:13
相关论文
共 50 条
  • [1] Meteorological analysis of the Pacific 2001 air quality field study
    Snyder, B
    Strawbridge, KB
    [J]. ATMOSPHERIC ENVIRONMENT, 2004, 38 (34) : 5733 - 5743
  • [2] Introduction to the special issue on Pacific 2001 air quality study
    Thomson, B
    Li, SM
    Belzer, W
    [J]. ATMOSPHERIC ENVIRONMENT, 2004, 38 (34) : 5717 - 5717
  • [3] The Pacific 2001 Air Quality Study - synthesis of findings and policy implications
    Vingarzan, R
    Li, SM
    [J]. ATMOSPHERIC ENVIRONMENT, 2006, 40 (15) : 2637 - 2649
  • [4] Airborne and scanning lidar results obtained during the pacific 2001 air quality field study
    Strawbridge, KB
    [J]. 22ND INTERNATIONAL LASER RADAR CONFERENCE (ILRC 2004), VOLS 1 AND 2, 2004, 561 : 751 - 754
  • [5] Data Quality in Web Archiving
    Spaniol, Marc
    Denev, Dimitar
    Mazeika, Arturas
    Weikum, Gerhard
    Senellart, Pierre
    [J]. WICOW 09, 2009, : 19 - 26
  • [6] Air quality 2001
    Anon
    [J]. Israel Environment Bulletin, 2002, 25 (03):
  • [7] A concerted effort to understand the ambient particulate matter in the Lower Fraser Valley: the Pacific 2001 Air Quality Study
    Li, SM
    [J]. ATMOSPHERIC ENVIRONMENT, 2004, 38 (34) : 5719 - 5731
  • [8] ARCHIVING AND QUALITY-CONTROL OF CLIMATOLOGICAL DATA
    BRYANT, GW
    [J]. METEOROLOGICAL MAGAZINE, 1979, 108 (1287): : 309 - 315
  • [9] The SHARC framework for data quality in Web archiving
    Dimitar Denev
    Arturas Mazeika
    Marc Spaniol
    Gerhard Weikum
    [J]. The VLDB Journal, 2011, 20 : 183 - 207
  • [10] The SHARC framework for data quality in Web archiving
    Denev, Dimitar
    Mazeika, Arturas
    Spaniol, Marc
    Weikum, Gerhard
    [J]. VLDB JOURNAL, 2011, 20 (02): : 183 - 207