Evaluation of different machine learning approaches for predicting high concentration episodes of ground-level ozone: A case study in Catalonia, Spain

被引:6
|
作者
Vicente, D. J. [1 ]
Salazar, F. [1 ,2 ]
Lopez-Chacon, S. R. [1 ]
Soriano, C. [1 ]
Martin-Vide, J. [3 ]
机构
[1] Int Ctr Numer Methods Engn CIMNE, Barcelona 08034, Spain
[2] Univ Politecn Catalunya UPC, Flumen Res Inst, Barcelona 08034, Spain
[3] Univ Barcelona, Dept Geog, IdRA Climatol Grp, Barcelona, Spain
关键词
Ozone; Air pollution; Machine learning; High ozone episodes; Random forest; SUPPORT VECTOR MACHINE; SURFACE-OZONE; SPATIOTEMPORAL PREDICTION; CHINA; MODEL; CLASSIFICATION; POLLUTION;
D O I
10.1016/j.apr.2023.101999
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Ground-level ozone (O-3) is a pollutant with a great impact on human health and the environment. As a secondary air contaminant of photochemical origin, those areas with greater exposure to solar radiation, such as Spain and other Mediterranean countries, are considerably affected. With the aggravation of O-3 pollution, it is important to provide reliable forecasting tools to help stakeholders implement more effective policies to mitigate the negative impact associated with this problem. In this regard, Machine Learning-based models have emerged in recent years, since they are able to identify complex relationships between ozone levels and relevant variables. However, their application to capture the most extreme events remains difficult. In this work, different ML approaches for predicting daily maximum 8-h average ozone (O-3,O-MDA8) were compared, investigating their ability to forecast the highest concentration levels recorded. Two variants of the Random Forest algorithm (regression and classification) were applied to a specific area of Catalonia, Spain, with a special interest due to the high number of episodes of exceedance of O-3 concentration levels. The predictive models were built with a 1 day time horizon, using datasets from 2002 to 2020. The variables used as inputs were other air pollutants concentrations and meteorological processes, monitored the day before to the target day to be predicted, and time information. Although results showed reasonable overall performances, low accuracy was achieved when forecasting the highest episodes of O-3,O-MDA8. To improve the capacity of the models in predicting high-O-3,O-MDA8 concentration levels, a methodology was proposed to fine-tuning the original predictions of the ML models according to a classification metric, G-Mean, which allows adjusting the balance between the correct predictions of different classes. Using the Sensitivity and Specificity metrics, the classical approaches were compared with the original ones proposed in the present study. The results obtained, for all the cases analysed, showed a mean increase in Sensitivity of 0.28, associated with a greater number of True Positives (correct predictions of high O-3-episodes). On the other hand, the average Specificity value decreased, due to the appearance of a greater number of False Positives, although this reduction was only 0.05. The proposed criteria showed promising results, better balancing classification metrics and increasing the ratio of correct predictions linked to the higher ranges of O-3.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] A Study on Statistical Data Mining Algorithms for the Prediction of Ground-Level Ozone Concentration in the El Paso-Juarez Area
    Bhuiyan, Md Al Masum
    Mahmud, Suhail
    Sarmin, Nusrat
    Elahee, Sanjida
    AEROSOL SCIENCE AND ENGINEERING, 2020, 4 (04) : 293 - 305
  • [32] Comprehensive 24-hour ground-level ozone monitoring: Leveraging machine learning for full-coverage estimation in East Asia
    Kim, Yejin
    Park, Seohui
    Choi, Hyunyoung
    Im, Jungho
    JOURNAL OF HAZARDOUS MATERIALS, 2025, 488
  • [33] First estimation of hourly full-coverage ground-level ozone from Fengyun-4A satellite using machine learning
    Gao, Ling
    Zhang, Han
    Yang, Fukun
    Tan, Wangshu
    Wu, Ronghua
    Song, Yi
    ENVIRONMENTAL RESEARCH LETTERS, 2024, 19 (02):
  • [34] Understanding the spatial and seasonal variation of the ground-level ozone in Southeast China with an interpretable machine learning and multi-source remote sensing
    Zhong, Haobin
    Zhen, Ling
    Xiao, Yanping
    Liu, Jinsong
    Chen, Baihua
    Xu, Wei
    SCIENCE OF THE TOTAL ENVIRONMENT, 2024, 917
  • [35] Field Measures Are All You Need: Predicting Need for Surgery in Elderly Ground-Level Fall Patients via Machine Learning
    Shooshani, Tara
    Pooladzandi, Omead
    Nguyen, Andrew
    Shipley, Jonathan H. H.
    Harris, Mark H. H.
    Hovis, Gabrielle E. A.
    Barrios, Cristobal
    AMERICAN SURGEON, 2023, 89 (10) : 4095 - 4100
  • [36] Predicting the ground-level pollutants concentrations and identifying the influencing factors using machine learning, wavelet transformation, and remote sensing techniques
    Ebrahimi-Khusfi, Zohre
    Taghizadeh-Mehrjardi, Ruhollah
    Kazemi, Mohamad
    Nafarzadegan, Ali Reza
    ATMOSPHERIC POLLUTION RESEARCH, 2021, 12 (05)
  • [37] Machine Learning Approaches for Predicting Crystal Systems: A Brief Review and a Case Study
    Settembre, Gaetano
    Corriero, Nicola
    Del Buono, Nicoletta
    Esposito, Flavia
    Rizzi, Rosanna
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2022, PT I, 2023, 13810 : 93 - 107
  • [38] Ground-based remote sensing measurements of aerosol and ozone in an urban area: A case study of mixing height evolution and its effect on ground-level ozone concentrations
    Kim, Sang-Woo
    Yoon, Soon-Chang
    Won, Jae-Gwang
    Choi, Sung-Chul
    ATMOSPHERIC ENVIRONMENT, 2007, 41 (33) : 7069 - 7081
  • [39] Comparative Assessment of Linear Regression and Machine Learning for Analyzing the Spatial Distribution of Ground-level NO2 Concentrations: A Case Study for Seoul, Korea
    Kang, Eunjin
    Yoo, Cheolhee
    Shin, Yeji
    Cho, Dongjin
    Im, Jungho
    KOREAN JOURNAL OF REMOTE SENSING, 2021, 37 (06) : 1739 - 1756
  • [40] Predicting Groundwater Level Based on Machine Learning: A Case Study of the Hebei Plain
    Wu, Zhenjiang
    Lu, Chuiyu
    Sun, Qingyan
    Lu, Wen
    He, Xin
    Qin, Tao
    Yan, Lingjia
    Wu, Chu
    WATER, 2023, 15 (04)