Mapping of high-resolution daily particulate matter (PM2.5) concentration at the city level through a machine learning-based downscaling approach

被引:0
|
作者
Nguyen, Phuong D. M. [1 ]
Phan, An H. [1 ]
Ngo, Truong X. [1 ]
Ho, Bang Q. [2 ]
Pham, Tran Vu [3 ]
Nguyen, Thanh T. N. [1 ]
机构
[1] Vietnam Natl Univ Hanoi, Univ Engn & Technol, Fac Informat Technol, E3 Bldg,144 Xuan Thuy St,Dich Vong Hau Ward, Hanoi 100000, Vietnam
[2] Vietnam Natl Univ, Dept Acad Affairs, 142 Hien Thanh St,Dist 10, Ho Chi Minh City 700000, Vietnam
[3] Ho Chi Minh City Univ Technol HCMUT, Fac Comp Sci & Engn, VNU HCM, Ho Chi Minh City 700000, Vietnam
关键词
PM2.5; Downscaling; Machine learning; Deep learning; Ho Chi Minh City;
D O I
10.1007/s10661-024-13562-6
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
PM2.5 pollution is a major global concern, especially in Vietnam, due to its harmful effects on health and the environment. Monitoring local PM2.5 levels is crucial for assessing air quality. However, Vietnam's state-of-the-art (SOTA) dataset with a 3 km resolution needs to be revised to depict spatial variation in smaller regions accurately. In this research, we investigated machine learning-based downscaling methods to improve the spatial resolution and quality of Vietnam's existing 3 km PM2.5 products using different approaches: traditional machine learning models (random forest, XGBoost, Catboost, support vector regression (SVR), mixed effect model (MEM)) and deep learning models (long short-term memory (LSTM), convolutional neural network (CNN), convolutional LSTM (ConvLSTM)). Overall, the CatBoost 2-day lag model exhibited superior performance. In terms of modeling, integrating temporal factors into tree-based models can enhance predictive accuracy. Furthermore, when faced with small datasets, traditional machine learning models demonstrate superior performance over complex deep learning approaches. The validation of machine and deep learning models based on their PM2.5 generated maps is requested because these models can obtain very high results for model evaluation but are unrealistic for application. In this study, compared to the state-of-the-art (SOTA) PM2.5 maps in Vietnam and the SOTA global maps, the proposed CatBoost 2-day lag model's maps showed a 57% increase in the correlation coefficient (Pearson R), as well as 42-73%, 28-75%, and 39-75% reductions in root mean squared error (RMSE), mean relative error (MRE), and mean absolute error (MAE), respectively. Additionally, the daily, monthly, and year-average maps generated by the Catboost 2-day lag model effectively capture the spatial distribution and seasonal variations of PM2.5 in Ho Chi Minh City. These findings indicate a substantial enhancement in the accuracy and reliability of downscaled PM2.5 maps.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Research on time series change point detection and influencing factors under machine learning: based on PM2.5 concentration data in Hefei city
    Maosen Xia
    Linlin Dong
    Lingling Jiang
    Min Zeng
    Earth Science Informatics, 2024, 17 : 351 - 364
  • [42] Research on time series change point detection and influencing factors under machine learning: based on PM2.5 concentration data in Hefei city
    Xia, Maosen
    Dong, Linlin
    Jiang, Lingling
    Zeng, Min
    EARTH SCIENCE INFORMATICS, 2024, 17 (01) : 351 - 364
  • [43] Spatiotemporal trend analysis for fine particulate matter concentrations in China using high-resolution satellite-derived and ground-measured PM2.5 data
    Bai, Kaixu
    Ma, Mingliang
    Chang, Ni-Bin
    Gao, Wei
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2019, 233 : 530 - 542
  • [44] Estimation of the vertical distribution of particle matter (PM2.5) concentration and its transport flux from lidar measurements based on machine learning algorithms
    Ma, Yingying
    Zhu, Yang
    Liu, Boming
    Li, Hui
    Jin, Shikuan
    Zhang, Yiqun
    Fan, Ruonan
    Gong, Wei
    ATMOSPHERIC CHEMISTRY AND PHYSICS, 2021, 21 (22) : 17003 - 17016
  • [45] Spatiotemporal Weighted for Improving the Satellite-Based High-Resolution Ground PM2.5 Estimation Using the Light Gradient Boosting Machine
    Yu, Xinyu
    Xi, Mengzhu
    Wu, Liyang
    Zheng, Hui
    REMOTE SENSING, 2023, 15 (16)
  • [46] A Machine Learning-Based High-Resolution Soil Moisture Mapping and Spatial-Temporal Analysis: The mlhrsm Package
    Peng, Yuliang
    Yang, Zhengwei
    Zhang, Zhou
    Huang, Jingyi
    AGRONOMY-BASEL, 2024, 14 (03):
  • [47] A Comparison of Machine Learning-Based Approaches in Estimating Surface PM2.5 Concentrations Focusing on Artificial Neural Networks and High Pollution Events
    Wei, Shijin
    Shores, Kyle
    Xu, Yangyang
    ATMOSPHERE, 2025, 16 (01)
  • [48] TC-GEN: Data-Driven Tropical Cyclone Downscaling Using Machine Learning-Based High-Resolution Weather Model
    Jing, Renzhi
    Gao, Jianxiong
    Cai, Yunuo
    Xi, Dazhi
    Zhang, Yinda
    Fu, Yanwei
    Emanuel, Kerry
    Diffenbaugh, Noah S.
    Bendavid, Eran
    JOURNAL OF ADVANCES IN MODELING EARTH SYSTEMS, 2024, 16 (10)
  • [49] Which model to choose? Performance comparison of statistical and machine learning models in predicting PM2.5 from high-resolution satellite aerosol optical depth
    Kulkarni, Padmavati
    Sreekanth, V.
    Upadhya, Adithi R.
    Gautam, Hrishikesh Chandra
    ATMOSPHERIC ENVIRONMENT, 2022, 282
  • [50] Developing high-resolution PM2.5 exposure models by integrating low-cost sensors, automated machine learning, and big human mobility data
    Yu, Manzhu
    Zhang, Shiyan
    Zhang, Kai
    Yin, Junjun
    Varela, Matthew
    Miao, Jiheng
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2023, 11