Evaluation of different machine learning approaches for predicting high concentration episodes of ground-level ozone: A case study in Catalonia, Spain

被引:6
|
作者
Vicente, D. J. [1 ]
Salazar, F. [1 ,2 ]
Lopez-Chacon, S. R. [1 ]
Soriano, C. [1 ]
Martin-Vide, J. [3 ]
机构
[1] Int Ctr Numer Methods Engn CIMNE, Barcelona 08034, Spain
[2] Univ Politecn Catalunya UPC, Flumen Res Inst, Barcelona 08034, Spain
[3] Univ Barcelona, Dept Geog, IdRA Climatol Grp, Barcelona, Spain
关键词
Ozone; Air pollution; Machine learning; High ozone episodes; Random forest; SUPPORT VECTOR MACHINE; SURFACE-OZONE; SPATIOTEMPORAL PREDICTION; CHINA; MODEL; CLASSIFICATION; POLLUTION;
D O I
10.1016/j.apr.2023.101999
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Ground-level ozone (O-3) is a pollutant with a great impact on human health and the environment. As a secondary air contaminant of photochemical origin, those areas with greater exposure to solar radiation, such as Spain and other Mediterranean countries, are considerably affected. With the aggravation of O-3 pollution, it is important to provide reliable forecasting tools to help stakeholders implement more effective policies to mitigate the negative impact associated with this problem. In this regard, Machine Learning-based models have emerged in recent years, since they are able to identify complex relationships between ozone levels and relevant variables. However, their application to capture the most extreme events remains difficult. In this work, different ML approaches for predicting daily maximum 8-h average ozone (O-3,O-MDA8) were compared, investigating their ability to forecast the highest concentration levels recorded. Two variants of the Random Forest algorithm (regression and classification) were applied to a specific area of Catalonia, Spain, with a special interest due to the high number of episodes of exceedance of O-3 concentration levels. The predictive models were built with a 1 day time horizon, using datasets from 2002 to 2020. The variables used as inputs were other air pollutants concentrations and meteorological processes, monitored the day before to the target day to be predicted, and time information. Although results showed reasonable overall performances, low accuracy was achieved when forecasting the highest episodes of O-3,O-MDA8. To improve the capacity of the models in predicting high-O-3,O-MDA8 concentration levels, a methodology was proposed to fine-tuning the original predictions of the ML models according to a classification metric, G-Mean, which allows adjusting the balance between the correct predictions of different classes. Using the Sensitivity and Specificity metrics, the classical approaches were compared with the original ones proposed in the present study. The results obtained, for all the cases analysed, showed a mean increase in Sensitivity of 0.28, associated with a greater number of True Positives (correct predictions of high O-3-episodes). On the other hand, the average Specificity value decreased, due to the appearance of a greater number of False Positives, although this reduction was only 0.05. The proposed criteria showed promising results, better balancing classification metrics and increasing the ratio of correct predictions linked to the higher ranges of O-3.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Development and evaluation of a ground-level area source analytical dispersion model to predict particulate matter concentration for different particle sizes
    Nimmatoori, Praneeth
    Kumar, Ashok
    JOURNAL OF AEROSOL SCIENCE, 2013, 66 : 139 - 149
  • [42] Evaluation of machine learning techniques with multiple remote sensing datasets in estimating monthly concentrations of ground-level PM2.5
    Xu, Yongming
    Ho, Hung Chak
    Wong, Man Sing
    Deng, Chengbin
    Shi, Yuan
    Chan, Ta-Chien
    Knudby, Anders
    ENVIRONMENTAL POLLUTION, 2018, 242 : 1417 - 1426
  • [43] Predicting ground-level PM2.5 concentrations in the Beijing-Tianjin-Hebei region: A hybrid remote sensing and machine learning approach
    Li, Xintong
    Zhang, Xiaodong
    ENVIRONMENTAL POLLUTION, 2019, 249 : 735 - 749
  • [44] Machine learning approaches for predicting fetal macrosomia at different stages of pregnancy: a retrospective study in China
    Liu, Qingyuan
    Zhu, Simin
    Zhao, Meng
    Ma, Lan
    Wang, Chenqian
    Sun, Xiaotong
    Feng, Yanyan
    Wu, Yifan
    Zeng, Zhen
    Zhang, Lei
    BMC PREGNANCY AND CHILDBIRTH, 2025, 25 (01)
  • [45] Influence of volatile organic compounds emissions from road marking paints on ground-level ozone formation: case study of Krakow, Poland
    Burghardt, Tomasz E.
    Pashkevich, Anton
    Zakowska, Lidia
    TRANSPORT RESEARCH ARENA TRA2016, 2016, 14 : 714 - 723
  • [46] Comparison of machine learning models for predicting groundwater level, case study: Najafabad region
    Pejman Zarafshan
    Hamed Etezadi
    Saman Javadi
    Abbas Roozbahani
    S. Mehdi Hashemy
    Payam Zarafshan
    Acta Geophysica, 2023, 71 : 1817 - 1830
  • [47] Comparison of machine learning models for predicting groundwater level, case study: Najafabad region
    Zarafshan, Pejman
    Etezadi, Hamed
    Javadi, Saman
    Roozbahani, Abbas
    Hashemy, S. Mehdi
    Zarafshan, Payam
    ACTA GEOPHYSICA, 2023, 71 (04) : 1817 - 1830
  • [48] Assessment of nitrogen oxides and ground-level ozone behavior in a dense air quality station network: Case study in the Lesser Antilles Arc
    Plocoste, Thomas
    Dorville, Jean-Francois
    Monjoly, Stephanie
    Jacoby-Koaly, Sandra
    Andre, Maina
    JOURNAL OF THE AIR & WASTE MANAGEMENT ASSOCIATION, 2018, 68 (12) : 1278 - 1300
  • [49] Estimating Ground-Level NO2 Concentrations Using Machine Learning Exclusively with Remote Sensing and ERA5 Data: The Mexico City Case Study
    Jimenez, Jesus Rodrigo Cedeno
    Brovelli, Maria Antonia
    REMOTE SENSING, 2024, 16 (17)
  • [50] Observational study of ground-level ozone and climatic factors in Craiova, Romania, based on one-year high-resolution data
    Yildizhan, Hasan
    Udristioiu, Mihaela Tinca
    Pekdogan, Tugce
    Ameen, Arman
    SCIENTIFIC REPORTS, 2024, 14 (01):