Reconstructing missing data by comparing interpolation techniques: Applications for long-term water quality data

被引:10
|
作者
Larson, Danelle M. [1 ]
Bungula, Wako [2 ]
Lee, Amber [3 ]
Stockdill, Alaina [3 ]
McKean, Casey [3 ]
Miller, Frederick Forrest [3 ]
Davis, Killian [3 ]
Erickson, Richard A. [1 ]
Hlavacek, Enrika [1 ]
机构
[1] US Geol Survey, Upper Midwest Environm Sci Ctr, La Crosse, WI 54603 USA
[2] Univ Wisconsin La Crosse, Dept Math & Stat, La Crosse, WI USA
[3] Univ Wisconsin La Crosse, Res Experience Undergraduates Program, La Crosse, WI USA
来源
LIMNOLOGY AND OCEANOGRAPHY-METHODS | 2023年 / 21卷 / 07期
关键词
MACHINE LEARNING-METHODS; SPATIAL INTERPOLATION; RIVER;
D O I
10.1002/lom3.10556
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Missing data are typical yet must be addressed for proper inferences or expanding datasets to guide our limnological understanding and management of aquatic systems. Interpolation methods (i.e., estimating missing values using known values within the dataset) can alleviate data gaps and common problems. We compared seven popular interpolation methods for predicting substantial missingness in a long-term water quality dataset from the Upper Mississippi River, U.S.A. The dataset included 80,000 sampling sites collected over 30 yr that had substantial missingness for total nitrogen (TN), total phosphorus (TP), and water velocity. For all three interpolated water quality variables, random forests had very high prediction accuracy and outperformed the methods of ordinary kriging, polynomial regressions, regression trees, and inverse distance weighting. TP had a mean absolute error (MAE) of 0.03 mg (L-TP)(-1), TN had a MAE of 0.39 mg (L-TN)(-1), and water velocity had a MAE of 0.10 m s(-1). The random forests' error rates were mapped and showed low spatiotemporal variability across the riverscape, indicating high model performance across many habitat types and large spatial scales. In the current era of "big data," interpolation becomes an imperative step prior to ecological analyses yet remains unfamiliar and underutilized. Our research briefly describes the importance of addressing missingness and provides a roadmap to conduct model intercomparisons of other big datasets. We also share adaptable data analysis scripts, which allows others to readily conduct interpolation comparisons for many limnology applications and contexts.
引用
收藏
页码:435 / 449
页数:15
相关论文
共 50 条
  • [31] Long-Term Data Opportunities
    不详
    MANUFACTURING ENGINEERING, 2019, 163 (06): : 32 - 33
  • [32] Long-term data on tisagenlecleucel
    David Killock
    Nature Reviews Clinical Oncology, 2021, 18 : 676 - 676
  • [33] A new method for interpolation of missing air quality data at monitor stations
    Xu, Chengdong
    Wang, Jinfeng
    Hu, Maogui
    Wang, Wei
    ENVIRONMENT INTERNATIONAL, 2022, 169
  • [34] Long-term biomonitoring with lichens: Comparing data from different sampling procedures
    Frati, Luisa
    Brunialti, Giorgio
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2006, 119 (1-3) : 391 - 404
  • [35] Long-Term Biomonitoring with Lichens: Comparing Data from Different Sampling Procedures
    Luisa Frati
    Giorgio Brunialti
    Environmental Monitoring and Assessment, 2006, 119 : 391 - 404
  • [36] ISSUES WITH MISSING DATA IN EXERCISE INTERVENTIONS FOR ELDERS WITH LONG-TERM FOLLOW-UP
    Peterson, M.
    Pieper, C. F.
    Sloane, R.
    Morey, M. C.
    GERONTOLOGIST, 2009, 49 : 213 - 213
  • [37] ESTIMATION OF SOLAR-RADIATION DATA MISSING FROM LONG-TERM METEOROLOGICAL RECORDS
    HOOK, JE
    MCCLENDON, RW
    AGRONOMY JOURNAL, 1992, 84 (04) : 739 - 742
  • [38] Techniques for handling missing data: Applications to online condition monitoring
    Nelwamondo, Fulufhelo Vincent
    Marwala, Tshilidzi
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2008, 4 (06): : 1507 - 1526
  • [39] THE QUALITY-CONTROL OF LONG-TERM CLIMATOLOGICAL DATA USING OBJECTIVE DATA-ANALYSIS
    EISCHEID, JK
    BAKER, CB
    KARL, TR
    DIAZ, HF
    JOURNAL OF APPLIED METEOROLOGY, 1995, 34 (12): : 2787 - 2795
  • [40] Preface: CJRS Special Issue: Long-Term Satellite Data and Applications
    Trishchenko, Alexander P.
    Wang, Shusen
    CANADIAN JOURNAL OF REMOTE SENSING, 2016, 42 (03) : 145 - 146