A machine learning model for estimating daily maximum 8-hour average ozone concentrations using OMI and MODIS products

被引:0
|
作者
Jung, Chau-Ren [1 ,2 ]
Chen, Wei [3 ]
Chen, Wei-Ting [4 ]
Su, Shih-Hao [5 ]
Chen, Bo-Ting [3 ]
Chang, Ling [3 ]
Hwang, Bing-Fang [3 ,6 ]
机构
[1] Department of Public Health, China Medical University, Taiwan
[2] Japan Environment and Children's Study Programme Office, National Institute for Environmental Studies, Tsukuba, Japan
[3] Department of Occupational Safety and Health, College of Public Health, China Medical University, Taichung, Taiwan
[4] Department of Atmospheric Sciences, National Taiwan University, Taipei, Taiwan
[5] Department of Atmospheric Sciences, Chinese Culture University, Taipei, Taiwan
[6] Department of Occupational Therapy, College of Medical and Health Science, Asia University, Taichung, Taiwan
基金
美国国家航空航天局;
关键词
Aerosols - Air pollution - Boundary layer flow - Boundary layers - Climate change - Forestry - Land use - Machine learning - Mean square error - Nitrogen oxides - Remote sensing;
D O I
10.1016/j.atmosenv.2024.120587
中图分类号
学科分类号
摘要
Tropospheric ozone (O3) is a criteria air pollutants posing risks to organisms, and is expected to enhance formation due to climate change. Satellite-based measurements provide a promising approach to estimate ground-level air pollution on large scale. However, most applications of satellite-based measurements have been used for fine particulate matter and nitrogen dioxide, while only a few have been used for O3. In this study, we incorporated satellite-based measurements from the Ozone Monitoring Instrument (OMI) and MOderate-resolution Imaging Spectroradiometer (MODIS) with meteorological variables and land-use data to estimate daily maximum 8-h average O3 at 1-km resolution in Taiwan during 2004–2020. The random forest model was used to impute the missing values of the satellite-based measurements. Additionally, the XGBoost model was leveraged to estimate daily O3 concentrations. Model performance was evaluated by the ten-fold cross-validation (CV), temporal and spatial validation, and the results were reported as the coefficient of determination (R2) and root mean square error (RMSE). Our results showed that the 10-fold CV, temporal validated, and spatial validated R2 (RMSE) of the XGBoost model were 0.82 (7.71 ppb), 0.63 (11.09 ppb), and 0.68 (10.27 ppb), respectively. Our model performance was better in central and southern Taiwan. The top ten important predictors were date (relative importance = 12.15%), temperature (10.77%), meridional wind (10.71%), relative humidity (9.60%), zonal wind (8.14%), UV radiation (8.07%), total precipitation (6.35%), surface pressure (5.34%), surface O3 volume mixing ratio (4.93%), and boundary layer height (4.69%). The spatial distribution of O3 estimates showed that daily maximum 8-h average O3 concentrations were higher in the suburban and mountainous areas near the central and southern Taiwan. This reveals that sensitive populations should still pay attention to the secondary pollutants even when outside the urban areas. The O3 estimates can be further leveraged to evaluate the short-term and long-term effects of O3 on human health. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 27 条
  • [1] A machine learning model for estimating daily maximum 8-hour average ozone concentrations using OMI and MODIS products
    Jung, Chau-Ren
    Chen, Wei
    Chen, Wei-Ting
    Su, Shih-Hao
    Chen, Bo-Ting
    Chang, Ling
    Hwang, Bing-Fang
    ATMOSPHERIC ENVIRONMENT, 2024, 331
  • [2] A machine learning model for estimating daily maximum 8-hour average ozone concentrations using OMI and MODIS products
    Jung, Chau-Ren
    Chen, Wei
    Chen, Wei-Ting
    Su, Shih-Hao
    Chen, Bo-Ting
    Chang, Ling
    Hwang, Bing-Fang
    ATMOSPHERIC ENVIRONMENT, 2024, 331
  • [3] Exploring decomposition of temporal patterns to facilitate learning of neural networks for ground-level daily maximum 8-hour average ozone prediction
    Leufen, Lukas Hubert
    Kleinert, Felix
    Schultz, Martin G.
    ENVIRONMENTAL DATA SCIENCE, 2022, 1
  • [4] Effects of am ambient volatile organic compound and nitrogen oxide levels on same-day maximum 8-hour ozone concentrations in the charlotte metropolitan region for the 1995-2000 summer ozone seasons.
    Lang, LM
    Owens, PM
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2002, 223 : U227 - U227
  • [5] Modeling and forecasting daily maximum hourly ozone concentrations using the RegAR model with skewed and heavy-tailed innovations
    Queiroz Sarnaglia, Alessandro Jose
    Jimenez Monroy, Nataly Adriana
    da Vitoria, Arthur Gomes
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2018, 25 (04) : 443 - 469
  • [6] Modeling and forecasting daily maximum hourly ozone concentrations using the RegAR model with skewed and heavy-tailed innovations
    Alessandro José Queiroz Sarnaglia
    Nátaly Adriana Jiménez Monroy
    Arthur Gomes da Vitória
    Environmental and Ecological Statistics, 2018, 25 : 443 - 469
  • [7] A neural network regression model for estimating maximum daily air temperature using Landsat-8 data
    Nascetti, A.
    Monterisi, C.
    Iurilli, F.
    Sonnessa, A.
    XXIV ISPRS CONGRESS: IMAGING TODAY, FORESEEING TOMORROW, COMMISSION III, 2022, 43-B3 : 1273 - 1278
  • [8] Predicting peak daily maximum 8 h ozone and linkages to emissions andmeteorology in Southern California using machine learning methods(SoCAB-8HR V1.0)
    Gao, Ziqi
    Wang, Yifeng
    Vasilakos, Petros
    Ivey, Cesunica E.
    Do, Khanh
    Russell, Armistead G.
    GEOSCIENTIFIC MODEL DEVELOPMENT, 2022, 15 (24) : 9015 - 9029
  • [9] Prediction of daily mean and one-hour maximum PM2.5 concentrations and applications in Central Mexico using satellite-based machine-learning models
    Gutierrez-Avila, Ivan
    Arfer, Kodi B.
    Carrion, Daniel
    Rush, Johnathan
    Kloog, Itai
    Naeger, Aaron R.
    Grutter, Michel
    Paramo-Figueroa, Victor Hugo
    Riojas-Rodriguez, Horacio
    Just, Allan C.
    JOURNAL OF EXPOSURE SCIENCE AND ENVIRONMENTAL EPIDEMIOLOGY, 2022, 32 (06) : 917 - 925
  • [10] Prediction of daily mean and one-hour maximum PM2.5 concentrations and applications in Central Mexico using satellite-based machine-learning models
    Iván Gutiérrez-Avila
    Kodi B. Arfer
    Daniel Carrión
    Johnathan Rush
    Itai Kloog
    Aaron R. Naeger
    Michel Grutter
    Víctor Hugo Páramo-Figueroa
    Horacio Riojas-Rodríguez
    Allan C. Just
    Journal of Exposure Science & Environmental Epidemiology, 2022, 32 : 917 - 925