Mapping of high-resolution daily particulate matter (PM2.5) concentration at the city level through a machine learning-based downscaling approach

被引:0
|
作者
Nguyen, Phuong D. M. [1 ]
Phan, An H. [1 ]
Ngo, Truong X. [1 ]
Ho, Bang Q. [2 ]
Pham, Tran Vu [3 ]
Nguyen, Thanh T. N. [1 ]
机构
[1] Vietnam Natl Univ Hanoi, Univ Engn & Technol, Fac Informat Technol, E3 Bldg,144 Xuan Thuy St,Dich Vong Hau Ward, Hanoi 100000, Vietnam
[2] Vietnam Natl Univ, Dept Acad Affairs, 142 Hien Thanh St,Dist 10, Ho Chi Minh City 700000, Vietnam
[3] Ho Chi Minh City Univ Technol HCMUT, Fac Comp Sci & Engn, VNU HCM, Ho Chi Minh City 700000, Vietnam
关键词
PM2.5; Downscaling; Machine learning; Deep learning; Ho Chi Minh City;
D O I
10.1007/s10661-024-13562-6
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
PM2.5 pollution is a major global concern, especially in Vietnam, due to its harmful effects on health and the environment. Monitoring local PM2.5 levels is crucial for assessing air quality. However, Vietnam's state-of-the-art (SOTA) dataset with a 3 km resolution needs to be revised to depict spatial variation in smaller regions accurately. In this research, we investigated machine learning-based downscaling methods to improve the spatial resolution and quality of Vietnam's existing 3 km PM2.5 products using different approaches: traditional machine learning models (random forest, XGBoost, Catboost, support vector regression (SVR), mixed effect model (MEM)) and deep learning models (long short-term memory (LSTM), convolutional neural network (CNN), convolutional LSTM (ConvLSTM)). Overall, the CatBoost 2-day lag model exhibited superior performance. In terms of modeling, integrating temporal factors into tree-based models can enhance predictive accuracy. Furthermore, when faced with small datasets, traditional machine learning models demonstrate superior performance over complex deep learning approaches. The validation of machine and deep learning models based on their PM2.5 generated maps is requested because these models can obtain very high results for model evaluation but are unrealistic for application. In this study, compared to the state-of-the-art (SOTA) PM2.5 maps in Vietnam and the SOTA global maps, the proposed CatBoost 2-day lag model's maps showed a 57% increase in the correlation coefficient (Pearson R), as well as 42-73%, 28-75%, and 39-75% reductions in root mean squared error (RMSE), mean relative error (MRE), and mean absolute error (MAE), respectively. Additionally, the daily, monthly, and year-average maps generated by the Catboost 2-day lag model effectively capture the spatial distribution and seasonal variations of PM2.5 in Ho Chi Minh City. These findings indicate a substantial enhancement in the accuracy and reliability of downscaled PM2.5 maps.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] High-Resolution Satellite Bathymetry Mapping: Regression and Machine Learning-Based Approaches
    Eugenio, Francisco
    Marcello, Javier
    Mederos-Barrera, Antonio
    Marques, Ferran
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [22] Satellite-based mapping of daily high-resolution ground PM2.5 in China via space-time regression modeling
    He, Qingqing
    Huang, Bo
    REMOTE SENSING OF ENVIRONMENT, 2018, 206 : 72 - 83
  • [23] Machine learning-guided integration of fixed and mobile sensors for high resolution urban PM2.5 mapping
    Li, Tianshuai
    Huang, Xin
    Zhang, Qingzhu
    Wang, Xinfeng
    Wang, Xianfeng
    Zhu, Anbao
    Wei, Zhaolin
    Wang, Xinyan
    Wang, Haolin
    Chen, Jiaqi
    Li, Min
    Wang, Qiao
    Wang, Wenxing
    NPJ CLIMATE AND ATMOSPHERIC SCIENCE, 2025, 8 (01):
  • [24] High Spatiotemporal Resolution PM2.5 Concentration Estimation with Machine Learning Algorithm: A Case Study for Wildfire in California
    Cui, Qian
    Zhang, Feng
    Fu, Shaoyun
    Wei, Xiaoli
    Ma, Yue
    Wu, Kun
    REMOTE SENSING, 2022, 14 (07)
  • [25] Development and Evaluation of Statistical Models Based on Machine Learning Techniques for Estimating Particulate Matter (PM2.5 and PM10) Concentrations
    Hong, Wan Yun
    Koh, David
    Yu, Liya E.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (13)
  • [26] High-resolution spatiotemporal prediction of PM 2.5 concentration based on mobile monitoring and deep learning
    Wang, Yi-Zhou
    He, Hong-Di
    Huang, Hai-Chao
    Yang, Jin-Ming
    Peng, Zhong-Ren
    ENVIRONMENTAL POLLUTION, 2025, 364
  • [27] Machine learning driven by environmental covariates to estimate high-resolution PM2.5 in data-poor regions
    Jin, Xiaoye
    Ding, Jianli
    Ge, Xiangyu
    Liu, Jie
    Xie, Boqiang
    Zhao, Shuang
    Zhao, Qiaozhen
    PEERJ, 2022, 10
  • [28] A hybrid approach for integrating micro-satellite images and sensors network-based ground measurements using deep learning for high-resolution prediction of fine particulate matter (PM2.5) over an indian city, lucknow
    Jain, Vaishali
    Mukherjee, Avideep
    Banerjee, Soumya
    Madhwal, Sandeep
    Bergin, Michael H.
    Bhave, Prakash
    Carlson, David
    Jiang, Ziyang
    Zheng, Tongshu
    Rai, Piyush
    Tripathi, Sachchida Nand
    ATMOSPHERIC ENVIRONMENT, 2024, 338
  • [29] Machine learning-based spatial downscaling and bias-correction framework for high-resolution temperature forecasting
    Meng, Xiangrui
    Zhao, Huan
    Shu, Ting
    Zhao, Junhua
    Wan, Qilin
    APPLIED INTELLIGENCE, 2024, 54 (17-18) : 8399 - 8414
  • [30] Downscaling GRACE TWSA Data into High-Resolution Groundwater Level Anomaly Using Machine Learning-Based Models in a Glacial Aquifer System
    Seyoum, Wondwosen M.
    Kwon, Dongjae
    Milewski, Adam M.
    REMOTE SENSING, 2019, 11 (07)