Unmasking the sky: high-resolution PM2.5 prediction in Texas using machine learning techniques

被引:0
|
作者
Zhang, Kai [1 ]
Lin, Jeffrey [2 ]
Li, Yuanfei [3 ]
Sun, Yue [4 ]
Tong, Weitian [5 ]
Li, Fangyu [6 ]
Chien, Lung-Chang [7 ]
Yang, Yiping [2 ]
Su, Wei-Chung [6 ]
Tian, Hezhong [8 ,9 ]
Fu, Peng [10 ,11 ]
Qiao, Fengxiang [12 ]
Romeiko, Xiaobo Xue [1 ]
Lin, Shao [1 ]
Luo, Sheng [13 ]
Craft, Elena [14 ]
机构
[1] SUNY Albany, Sch Publ Hlth, Dept Environm Hlth Sci, Rensselaer, NY 12144 USA
[2] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Dept Biostat & Data Sci, Houston, TX USA
[3] Shanghai Univ, Asian Demog Res Inst, Shanghai, Peoples R China
[4] Clark Univ, Dept Int Dev Community & Environm, Worcester, MA USA
[5] Georgia Southern Univ, Dept Comp Sci, Statesboro, GA USA
[6] Univ Texas Hlth Sci Ctr, Dept Epidemiol Human Genet & Environm Sci, Sch Publ Hlth, Houston, TX USA
[7] Univ Nevada, Sch Publ Hlth, Dept Epidemiol & Biostat, Las Vegas, NV USA
[8] Beijing Normal Univ, Sch Environm, State Key Joint Lab Environm Simulat & Pollut Cont, Beijing, Peoples R China
[9] Beijing Normal Univ, Ctr Atmospher Environm Studies, Beijing, Peoples R China
[10] Univ Illinois, Dept Plant Biol, Urbana, IL USA
[11] Harrisburg Univ, Ctr Econ Environm & Energy, Harrisburg, PA USA
[12] Texas Southern Univ, Innovat Transportat Res Inst, Houston, TX USA
[13] Duke Univ, Dept Biostat & Bioinformat, Durham, NC USA
[14] Hlth Effects Inst, Boston, MA USA
关键词
AOD; Gradient boosting; Machine learning; PM2.5; Random forest; FINE PARTICULATE MATTER; PRIVATELY INSURED POPULATION; BEIJING-TIANJIN-HEBEI; RANDOM FOREST; COMPONENTS; MODEL; AOD;
D O I
10.1038/s41370-024-00659-w
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Background Although PM2.5 (fine particulate matter with an aerodynamic diameter less than 2.5 mu m) is an air pollutant of great concern in Texas, limited regulatory monitors pose a significant challenge for decision-making and environmental studies. Objective This study aimed to predict PM2.5 concentrations at a fine spatial scale on a daily basis by using novel machine learning approaches and incorporating satellite-derived Aerosol Optical Depth (AOD) and a variety of weather and land use variables. MethodsWe compiled a comprehensive dataset in Texas from 2013 to 2017, including ground-level PM2.5 concentrations from regulatory monitors; AOD values at 1-km resolution based on images retrieved from the MODIS satellite; and weather, land-use, population density, among others. We built predictive models for each year separately to estimate PM2.5 concentrations using two machine learning approaches called gradient boosted trees and random forest. We evaluated the model prediction performance using in-sample and out-of-sample validations. Results Our predictive models demonstrate excellent in-sample model performance, as indicated by high R-2 values generated from the gradient boosting models (0.94-0.97) and random forest models (0.81-0.90). However, the out-of-sample R-2 values fall within a range of 0.52-0.75 for gradient boosting models and 0.44-0.69 for random forest models. Model performance varies slightly across years. A generally decreasing trend in predicted PM2.5 concentrations over time is observed in Eastern Texas.
引用
收藏
页码:814 / 820
页数:7
相关论文
共 50 条
  • [21] High-resolution prediction of the spatial distribution of PM2.5 concentrations in China using a long short-term memory model
    Wang, Zhige
    Zhou, Yue
    Zhao, Ruiying
    Wang, Nan
    Biswas, Asim
    Shi, Zhou
    JOURNAL OF CLEANER PRODUCTION, 2021, 297
  • [22] High-resolution estimation of PM 2.5 concentrations across China using multiple machine learning approaches and model fusion
    Meng, Lingtong
    Xu, Xiangqing
    Huang, Xiaona
    Li, Xinju
    Chang, Xiaoyan
    Xu, Dongyun
    ATMOSPHERIC POLLUTION RESEARCH, 2024, 15 (06)
  • [23] Explore spatio-temporal PM2.5 features in northern Taiwan using machine learning techniques
    Chang, Fi-John
    Chang, Li-Chiu
    Kang, Che-Chia
    Wang, Yi-Shin
    Huang, Angela
    SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 736
  • [24] Predicting PM2.5 Concentrations Across USA Using Machine Learning
    Vignesh, P. Preetham
    Jiang, Jonathan H.
    Kishore, P.
    EARTH AND SPACE SCIENCE, 2023, 10 (10)
  • [25] A nested machine learning approach to short-term PM2.5 prediction in metropolitan areas using PM2.5 data from different sensor networks
    Li, Jing
    Crooks, James
    Murdock, Jennifer
    de Souza, Priyanka
    Hohs, Kirk
    Obermann, Bill
    Stockman, Tehya
    SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 873
  • [26] Estimation of PM2.5 concentrations with high spatiotemporal resolution in Beijing using the ERA5 dataset and machine learning models
    Wang, Zhihao
    Chen, Peng
    Wang, Rong
    An, Zhiyuan
    Qiu, Liangcai
    ADVANCES IN SPACE RESEARCH, 2023, 71 (08) : 3150 - 3165
  • [27] A machine learning-based framework for high resolution mapping of PM2.5 in Tehran, Iran, using MAIAC AOD data
    Bagheri, Hossein
    ADVANCES IN SPACE RESEARCH, 2022, 69 (09) : 3333 - 3349
  • [28] Unveiling PM2.5 sources: Double and tracer conjugate PMF approaches for high-resolution organic, BC, and inorganic PM2.5 data
    Faisal, Mohd
    Ali, Umer
    Kumar, Ajit
    Kumar, Mayank
    Singh, Vikram
    Atmospheric Environment, 2025, 343
  • [29] The Prediction of PM2.5 Concentration Using Transfer Learning Based on ADGRU
    Xinbiao Lu
    Chunlin Ye
    Miaoxuan Shan
    Buzhi Qin
    Ying Wang
    Hao Xing
    Xupeng Xie
    Zecheng Liu
    Water, Air, & Soil Pollution, 2023, 234
  • [30] The Prediction of PM2.5 Concentration Using Transfer Learning Based on ADGRU
    Lu, Xinbiao
    Ye, Chunlin
    Shan, Miaoxuan
    Qin, Buzhi
    Wang, Ying
    Xing, Hao
    Xie, Xupeng
    Liu, Zecheng
    WATER AIR AND SOIL POLLUTION, 2023, 234 (04):