An Ensemble Machine Learning Model to Enhance Extrapolation Ability of Predicting Coarse Particulate Matter with High Resolutions in China

被引:0
|
作者
Shi, Su [1 ,2 ]
Chen, Renjie [1 ,2 ]
Wang, Peng [3 ]
Zhang, Hongliang [4 ]
Kan, Haidong [1 ,2 ]
Meng, Xia [1 ,2 ,5 ]
机构
[1] Fudan Univ, Sch Publ Hlth, Key Lab Publ Hlth Safety, Minist Educ, Shanghai 200433, Peoples R China
[2] Fudan Univ, Natl Hlth Commiss NHC Key Lab Hlth Technol Assessm, IRDR ICoE Risk Interconnect & Governance Weather C, Minist Hlth, Shanghai 200433, Peoples R China
[3] Fudan Univ, Inst Atmospher Sci, Dept Atmospher & Ocean Sci, Shanghai 200438, Peoples R China
[4] Fudan Univ, Dept Environm Sci & Engn, Shanghai 200438, Peoples R China
[5] Shanghai Typhoon Inst CMA, Shanghai Key Lab Meteorol & Hlth, Shanghai 200030, Peoples R China
基金
中国国家自然科学基金;
关键词
PM10-2.5; PM2.5; PM10; ensemble method; random forest; AEROSOL OPTICAL DEPTH; LAND-USE REGRESSION; AIR-POLLUTION; PM2.5; CONCENTRATIONS; REANALYSIS; PRODUCTS; PROVINCE; MERRA-2; PM10; PART;
D O I
10.1021/acs.est.4c08610
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate exposure assessment is important for conducting PM10-2.5-related epidemiological studies, which have been limited thus far. In this study, we aimed to develop an ensemble machine learning method to estimate PM10-2.5 concentrations in mainland China during 2013-2020. The study was conducted in two stages. In the first stage, we developed two methods: the indirect method refers to developing models for PM2.5 and PM10 separately and subsequently calculating PM10-2.5 as the difference between them; and the direct method refers to establishing a model between PM10-2.5 measurements and relevant predictors directly. In the second stage, we employed an ensemble method by integrating predictions from both indirect and direct methods. Internal and external cross-validation (CV) were performed to validate the extrapolation capacity of models. The ensemble method demonstrated enhanced extrapolation accuracy in both internal and external CV compared to indirect and direct methods. The predictions produced by the ensemble method captured the spatiotemporal pattern of PM10-2.5, even in the sand and dust storm seasons. Our study introduces an ensemble strategy leveraging the strengths of both indirect and direct methods to estimate PM10-2.5 concentrations, which holds significant potential to support future epidemiological studies to address knowledge gaps in understanding the health effects of PM10-2.5.
引用
收藏
页码:19325 / 19337
页数:13
相关论文
共 50 条
  • [21] Hybrid physically based and machine learning model to enhance high streamflow prediction
    Lopez-Chacon, Sergio Ricardo
    Salazar, Fernando
    Blade, Ernest
    HYDROLOGICAL SCIENCES JOURNAL, 2025, 70 (02) : 311 - 333
  • [22] Remote sensing inversion of suspended particulate matter in the estuary of the Pinglu Canal in China based on machine learning algorithms
    Mo, Jinying
    Tian, Yichao
    Wang, Jiale
    Zhang, Qiang
    Zhang, Yali
    Tao, Jin
    Lin, Junliang
    FRONTIERS IN MARINE SCIENCE, 2024, 11
  • [23] Machine learning model ensemble for predicting sugarcane yield through synergy of optical and SAR remote sensing
    Das, Ayan
    Kumar, Mukesh
    Kushwaha, Amit
    Dave, Rucha
    Dakhore, Kailash Kamaji
    Chaudhari, Karshan
    Bhattacharya, Bimal Kumar
    REMOTE SENSING APPLICATIONS-SOCIETY AND ENVIRONMENT, 2023, 30
  • [24] An ensemble tree-based machine learning model for predicting the uniaxial compressive strength of travertine rocks
    Barzegar, Rahim
    Sattarpour, Masoud
    Deo, Ravinesh
    Fijani, Elham
    Adamowski, Jan
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13): : 9065 - 9080
  • [25] A machine learning ensemble model for predicting pavement conditions using automatic laser crack measurement data
    Bai, Lihui
    Zhang, Jie
    Zhu, Xuwen
    Alam, Md Morshedul
    Sun, Zhihui
    INTERNATIONAL JOURNAL OF PAVEMENT ENGINEERING, 2023, 24 (01)
  • [26] Development and comparative of a new meta-ensemble machine learning model in predicting construction labor productivity
    Karatas, Ibrahim
    Budak, Abdulkadir
    ENGINEERING CONSTRUCTION AND ARCHITECTURAL MANAGEMENT, 2024, 31 (03) : 1123 - 1144
  • [27] Predicting Divorce Prospect Using Ensemble Learning: Support Vector Machine, Linear Model, and Neural Network
    Fareed, Mian Muhammad Sadiq
    Raza, Ali
    Zhao, Na
    Tariq, Aqil
    Younas, Faizan
    Ahmed, Gulnaz
    Ullah, Saleem
    Jillani, Syeda Fizzah
    Abbas, Irfan
    Aslam, Muhammad
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [28] An ensemble tree-based machine learning model for predicting the uniaxial compressive strength of travertine rocks
    Rahim Barzegar
    Masoud Sattarpour
    Ravinesh Deo
    Elham Fijani
    Jan Adamowski
    Neural Computing and Applications, 2020, 32 : 9065 - 9080
  • [29] A Stacking Ensemble Learning Model Combining a Crop Simulation Model with Machine Learning to Improve the Dry Matter Yield Estimation of Greenhouse Pakchoi
    Wang, Chao
    Xu, Xiangying
    Zhang, Yonglong
    Cao, Zhuangzhuang
    Ullah, Ikram
    Zhang, Zhiping
    Miao, Minmin
    AGRONOMY-BASEL, 2024, 14 (08):
  • [30] A novel chlorophyll-a retrieval model based on suspended particulate matter classification and different machine learning
    Fang, Chong
    Song, Changchun
    Wen, Zhidan
    Liu, Ge
    Wang, Xiaodi
    Li, Sijia
    Shang, Yingxin
    Tao, Hui
    Lyu, Lili
    Song, Kaishan
    ENVIRONMENTAL RESEARCH, 2024, 240