An Ensemble Machine Learning Model to Enhance Extrapolation Ability of Predicting Coarse Particulate Matter with High Resolutions in China

被引:0
|
作者
Shi, Su [1 ,2 ]
Chen, Renjie [1 ,2 ]
Wang, Peng [3 ]
Zhang, Hongliang [4 ]
Kan, Haidong [1 ,2 ]
Meng, Xia [1 ,2 ,5 ]
机构
[1] Fudan Univ, Sch Publ Hlth, Key Lab Publ Hlth Safety, Minist Educ, Shanghai 200433, Peoples R China
[2] Fudan Univ, Natl Hlth Commiss NHC Key Lab Hlth Technol Assessm, IRDR ICoE Risk Interconnect & Governance Weather C, Minist Hlth, Shanghai 200433, Peoples R China
[3] Fudan Univ, Inst Atmospher Sci, Dept Atmospher & Ocean Sci, Shanghai 200438, Peoples R China
[4] Fudan Univ, Dept Environm Sci & Engn, Shanghai 200438, Peoples R China
[5] Shanghai Typhoon Inst CMA, Shanghai Key Lab Meteorol & Hlth, Shanghai 200030, Peoples R China
基金
中国国家自然科学基金;
关键词
PM10-2.5; PM2.5; PM10; ensemble method; random forest; AEROSOL OPTICAL DEPTH; LAND-USE REGRESSION; AIR-POLLUTION; PM2.5; CONCENTRATIONS; REANALYSIS; PRODUCTS; PROVINCE; MERRA-2; PM10; PART;
D O I
10.1021/acs.est.4c08610
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate exposure assessment is important for conducting PM10-2.5-related epidemiological studies, which have been limited thus far. In this study, we aimed to develop an ensemble machine learning method to estimate PM10-2.5 concentrations in mainland China during 2013-2020. The study was conducted in two stages. In the first stage, we developed two methods: the indirect method refers to developing models for PM2.5 and PM10 separately and subsequently calculating PM10-2.5 as the difference between them; and the direct method refers to establishing a model between PM10-2.5 measurements and relevant predictors directly. In the second stage, we employed an ensemble method by integrating predictions from both indirect and direct methods. Internal and external cross-validation (CV) were performed to validate the extrapolation capacity of models. The ensemble method demonstrated enhanced extrapolation accuracy in both internal and external CV compared to indirect and direct methods. The predictions produced by the ensemble method captured the spatiotemporal pattern of PM10-2.5, even in the sand and dust storm seasons. Our study introduces an ensemble strategy leveraging the strengths of both indirect and direct methods to estimate PM10-2.5 concentrations, which holds significant potential to support future epidemiological studies to address knowledge gaps in understanding the health effects of PM10-2.5.
引用
收藏
页码:19325 / 19337
页数:13
相关论文
共 50 条
  • [1] Predicting Fine Particulate Matter (PM2.5) in the Greater London Area: An Ensemble Approach using Machine Learning Methods
    Yazdi, Mahdieh Danesh
    Kuang, Zheng
    Dimakopoulou, Konstantina
    Barratt, Benjamin
    Suel, Esra
    Amini, Heresh
    Lyapustin, Alexei
    Katsouyanni, Klea
    Schwartz, Joel
    REMOTE SENSING, 2020, 12 (06)
  • [2] A Machine Learning Model for Predicting Heart Disease using Ensemble Methods
    Samagh J.S.
    Singh D.
    International Journal of Advanced Computer Science and Applications, 2022, 13 (09) : 558 - 565
  • [3] Predicting the hydrogen release ability of LiBH4-based mixtures by ensemble machine learning
    Ding, Zhao
    Chen, Zhiqian
    Ma, Tianyi
    Lu, Chang-Tien
    Ma, Wenhui
    Shaw, Leon
    ENERGY STORAGE MATERIALS, 2020, 27 : 466 - 477
  • [4] Machine Learning Model for Predicting Walking Ability in Lower Limb Amputees
    Knezevic, Aleksandar
    Arsenovic, Jovana
    Garipi, Enis
    Platisa, Nedeljko
    Savic, Aleksandra
    Aleksandric, Tijana
    Popovic, Dunja
    Subic, Larisa
    Milenovic, Natasa
    Panic, Dusica Simic
    Budinski, Slavko
    Pasternak, Janko
    Manojlovic, Vladimir
    Knezevic, Milica Jeremic
    Radovic, Mirna Kapetina
    Jelicic, Zoran
    JOURNAL OF CLINICAL MEDICINE, 2024, 13 (22)
  • [5] A machine-learning ensemble model for predicting energy consumption in smart homes
    Priyadarshini, Ishaani
    Sahu, Sandipan
    Kumar, Raghvendra
    Taniar, David
    INTERNET OF THINGS, 2022, 20
  • [6] A robust and interpretable ensemble machine learning model for predicting healthcare insurance fraud
    Wang, Zeyu
    Chen, Xiaofang
    Wu, Yiwei
    Jiang, Linke
    Lin, Shiming
    Qiu, Gang
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [7] High-Resolution Planetscope Imagery and Machine Learning for Estimating Suspended Particulate Matter in the Ebinur Lake, Xinjiang, China
    Duan, Pan
    Zhang, Fei
    Liu, Changjiang
    Tan, Mou Leong
    Shi, Jingchao
    Wang, Weiwei
    Cai, Yunfei
    Kung, Hsiang-Te
    Yang, Shengtian
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 1019 - 1032
  • [8] Application of Discrete Wavelet Transform and Tree-Based Ensemble Machine Learning for Modeling of Particulate Matter Concentrations
    Stoimenova-Minova, Maya
    Gocheva-Ilieva, Snezhana
    Ivanov, Atanas
    MATHEMATICAL METHODS FOR ENGINEERING APPLICATIONS, ICMASE 2023, 2024, 439 : 171 - 183
  • [9] Predicting unrecognized enhancer-mediated genome topology by an ensemble machine learning model
    Tang, Li
    Hill, Matthew C.
    Wang, Jun
    Wang, Jianxin
    Martin, James F.
    Li, Min
    GENOME RESEARCH, 2020, 30 (12) : 1835 - 1845
  • [10] Extrapolation-enhanced model for travel decision making: An ensemble machine learning approach considering behavioral theory
    Gao, Kun
    Yang, Ying
    Zhang, Tianshu
    Li, Aoyong
    Qu, Xiaobo
    KNOWLEDGE-BASED SYSTEMS, 2021, 218