Toward Large-Scale Riverine Phosphorus Estimation Using Remote Sensing and Machine Learning

被引:0
|
作者
Ramtel, Pradeep [1 ]
Feng, Dongmei [1 ]
Gardner, John [2 ]
机构
[1] Univ Cincinnati, Dept Chem & Environm Engn, Cincinnati, OH 45221 USA
[2] Univ Pittsburgh, Dept Geol & Environm Sci, Pittsburgh, PA USA
关键词
total phosphorus; remote sensing; machine learning model; US Rivers; CHLOROPHYLL-A; WATER; QUALITY; SEDIMENT; EUTROPHICATION; SELECTION; IMPACT;
D O I
10.1029/2024JG008121
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Phosphorus pollution is a major water quality issue impacting the environment and human health. Traditional methods limit the frequency and extent of total phosphorus (TP) measurements across many rivers. However, remote sensing can accurately estimate riverine TP; nevertheless, no large-scale assessment of riverine TP using remote sensing exists. Large-scale models using remote sensing can provide a fast and consistent method for TP measurement, important for data generalization and accessing extensive spatial-temporal change in TP. Our study uses remote sensing and machine learning to estimate the TP in rivers in the contiguous United States (CONUS). Initially, we developed a national scale matchup data set for Landsat detectable rivers (river width >30 m) using in situ TP and surface reflectance. We used in situ data from the Water Quality Portal (WQP), alongside water surface reflectance data from Landsat 5, 7, and 8 spanning from 1984 to 2021. Then, we used this data set to develop a machine learning (ML) model using different preprocessing methods and algorithms. We found that using high-level vegetation in the clustering approach and over-sampling or under-sampling our training data in the sampling approach improved our model estimation accuracy. We compared XGBLinear, XGBTree, Regularized Random Forest (RRF), and K-Nearest neighbors ML algorithms and selected XGBLinear as the best model with an R-2 of 0.604, RMSE of 0.103 mg/L, mean average error of 0.83, and NSE of 0.602. Finally, we identified human footprint, elevation, river area, and soil erosion as the main attributes influencing the accuracy of estimated TP from the ML model.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Toward Large-Scale Vulnerability Discovery using Machine Learning
    Grieco, Gustavo
    Grinblat, Guillermo Luis
    Uzal, Lucas
    Rawat, Sanjay
    Feist, Josselin
    Mounier, Laurent
    [J]. CODASPY'16: PROCEEDINGS OF THE SIXTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY, 2016, : 85 - 96
  • [2] Large-scale estimation of forest canopy opening using remote sensing in Central Africa
    Bourbier, Lucas
    Cornu, Guillaume
    Pennec, Alexandre
    Brognoli, Christine
    Gond, Valery
    [J]. BOIS ET FORETS DES TROPIQUES, 2013, (315) : 3 - 9
  • [3] Large-Scale Automated Sustainability Assessment of Infrastructure Projects Using Machine Learning Algorithms with Multisource Remote Sensing Data
    Shamshirgaran, Amiradel
    Nourzad, Seyed Hossein Hosseini
    Keshtkar, Hamidreza
    Golabchi, Mahmood
    Sadeghi, Mehrdad
    [J]. JOURNAL OF INFRASTRUCTURE SYSTEMS, 2022, 28 (04)
  • [4] Incremental Learning for Semantic Segmentation of Large-Scale Remote Sensing Data
    Tasar, Onur
    Tarabalka, Yuliya
    Alliez, Pierre
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (09) : 3524 - 3537
  • [5] Large-scale estimation of xylem phenology in black spruce through remote sensing
    Antonucci, Serena
    Rossi, Sergio
    Deslauriers, Annie
    Morin, Hubert
    Lombardi, Fabio
    Marchetti, Marco
    Tognetti, Roberto
    [J]. AGRICULTURAL AND FOREST METEOROLOGY, 2017, 233 : 92 - 100
  • [6] Responding to Large-Scale Forest Damage in an Alpine Environment with Remote Sensing, Machine Learning, and Web-GIS
    Piragnolo, Marco
    Pirotti, Francesco
    Zanrosso, Carlo
    Lingua, Emanuele
    Grigolato, Stefano
    [J]. REMOTE SENSING, 2021, 13 (08)
  • [7] Toward Large-Scale Learning Design
    Davis, Dan
    Seaton, Daniel
    Hauff, Claudia
    Houben, Geert-Jan
    [J]. PROCEEDINGS OF THE FIFTH ANNUAL ACM CONFERENCE ON LEARNING AT SCALE (L@S'18), 2018,
  • [8] Multiple Feature Hashing Learning for Large-Scale Remote Sensing Image Retrieval
    Ye, Dongjie
    Li, Yansheng
    Tao, Chao
    Xie, Xunwei
    Wang, Xiang
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2017, 6 (11)
  • [9] A Survey on Large-Scale Machine Learning
    Wang, Meng
    Fu, Weijie
    He, Xiangnan
    Hao, Shijie
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) : 2574 - 2594
  • [10] Toward Robust Anxiety Biomarkers: A Machine Learning Approach in a Large-Scale Sample
    Boeke, Emily A.
    Holmes, Avram J.
    Phelps, Elizabeth A.
    [J]. BIOLOGICAL PSYCHIATRY-COGNITIVE NEUROSCIENCE AND NEUROIMAGING, 2020, 5 (08) : 799 - 807