Unmasking the sky: high-resolution PM2.5 prediction in Texas using machine learning techniques

被引:0
|
作者
Zhang, Kai [1 ]
Lin, Jeffrey [2 ]
Li, Yuanfei [3 ]
Sun, Yue [4 ]
Tong, Weitian [5 ]
Li, Fangyu [6 ]
Chien, Lung-Chang [7 ]
Yang, Yiping [2 ]
Su, Wei-Chung [6 ]
Tian, Hezhong [8 ,9 ]
Fu, Peng [10 ,11 ]
Qiao, Fengxiang [12 ]
Romeiko, Xiaobo Xue [1 ]
Lin, Shao [1 ]
Luo, Sheng [13 ]
Craft, Elena [14 ]
机构
[1] SUNY Albany, Sch Publ Hlth, Dept Environm Hlth Sci, Rensselaer, NY 12144 USA
[2] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Dept Biostat & Data Sci, Houston, TX USA
[3] Shanghai Univ, Asian Demog Res Inst, Shanghai, Peoples R China
[4] Clark Univ, Dept Int Dev Community & Environm, Worcester, MA USA
[5] Georgia Southern Univ, Dept Comp Sci, Statesboro, GA USA
[6] Univ Texas Hlth Sci Ctr, Dept Epidemiol Human Genet & Environm Sci, Sch Publ Hlth, Houston, TX USA
[7] Univ Nevada, Sch Publ Hlth, Dept Epidemiol & Biostat, Las Vegas, NV USA
[8] Beijing Normal Univ, Sch Environm, State Key Joint Lab Environm Simulat & Pollut Cont, Beijing, Peoples R China
[9] Beijing Normal Univ, Ctr Atmospher Environm Studies, Beijing, Peoples R China
[10] Univ Illinois, Dept Plant Biol, Urbana, IL USA
[11] Harrisburg Univ, Ctr Econ Environm & Energy, Harrisburg, PA USA
[12] Texas Southern Univ, Innovat Transportat Res Inst, Houston, TX USA
[13] Duke Univ, Dept Biostat & Bioinformat, Durham, NC USA
[14] Hlth Effects Inst, Boston, MA USA
关键词
AOD; Gradient boosting; Machine learning; PM2.5; Random forest; FINE PARTICULATE MATTER; PRIVATELY INSURED POPULATION; BEIJING-TIANJIN-HEBEI; RANDOM FOREST; COMPONENTS; MODEL; AOD;
D O I
10.1038/s41370-024-00659-w
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Background Although PM2.5 (fine particulate matter with an aerodynamic diameter less than 2.5 mu m) is an air pollutant of great concern in Texas, limited regulatory monitors pose a significant challenge for decision-making and environmental studies. Objective This study aimed to predict PM2.5 concentrations at a fine spatial scale on a daily basis by using novel machine learning approaches and incorporating satellite-derived Aerosol Optical Depth (AOD) and a variety of weather and land use variables. MethodsWe compiled a comprehensive dataset in Texas from 2013 to 2017, including ground-level PM2.5 concentrations from regulatory monitors; AOD values at 1-km resolution based on images retrieved from the MODIS satellite; and weather, land-use, population density, among others. We built predictive models for each year separately to estimate PM2.5 concentrations using two machine learning approaches called gradient boosted trees and random forest. We evaluated the model prediction performance using in-sample and out-of-sample validations. Results Our predictive models demonstrate excellent in-sample model performance, as indicated by high R-2 values generated from the gradient boosting models (0.94-0.97) and random forest models (0.81-0.90). However, the out-of-sample R-2 values fall within a range of 0.52-0.75 for gradient boosting models and 0.44-0.69 for random forest models. Model performance varies slightly across years. A generally decreasing trend in predicted PM2.5 concentrations over time is observed in Eastern Texas.
引用
收藏
页码:814 / 820
页数:7
相关论文
共 50 条
  • [1] Estimating daily high-resolution PM2.5 concentrations over Texas: Machine Learning approach
    Ghahremanloo, Masoud
    Choi, Yunsoo
    Sayeed, Alqamah
    Salman, Ahmed Khan
    Pan, Shuai
    Amani, Meisam
    ATMOSPHERIC ENVIRONMENT, 2021, 247
  • [2] High-resolution downscaling of source resolved PM2.5 predictions using machine learning models
    Dinkelacker, Brian T.
    Rivera, Pablo Garcia
    Marshall, Julian D.
    Adams, Peter J.
    Pandis, Spyros N.
    ATMOSPHERIC ENVIRONMENT, 2023, 310
  • [3] Characterization and prediction of PM2.5 levels in Afghanistan using machine learning techniques
    Salehie, Obaidullah
    Bin Jamal, Mohamad Hidayat
    Shahid, Shamsuddin
    THEORETICAL AND APPLIED CLIMATOLOGY, 2024, 155 (09) : 9081 - 9097
  • [4] Combining Machine Learning and Numerical Simulation for High-Resolution PM2.5 Concentration Forecast
    Bi, Jianzhao
    Knowland, K. Emma
    Keller, Christoph A.
    Liu, Yang
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2022, 56 (03) : 1544 - 1556
  • [5] High-resolution spatiotemporal prediction of PM2.5 concentration based on mobile monitoring and deep learning
    Wang, Yi-Zhou
    He, Hong-Di
    Huang, Hai-Chao
    Yang, Jin-Ming
    Peng, Zhong-Ren
    Environmental Pollution, 2025, 364
  • [6] Prediction and analysis of particulate matter (PM2.5 and PM10) concentrations using machine learning techniques
    Anurag Barthwal
    Debopam Acharya
    Divya Lohani
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 1323 - 1338
  • [7] Prediction and analysis of particulate matter (PM2.5 and PM10) concentrations using machine learning techniques
    Barthwal, Anurag
    Acharya, Debopam
    Lohani, Divya
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (3) : 1323 - 1338
  • [8] Prediction of atmospheric PM2.5 level by machine learning techniques in Isfahan, Iran
    Mohammadi, Farzaneh
    Teiri, Hakimeh
    Hajizadeh, Yaghoub
    Abdolahnejad, Ali
    Ebrahimi, Afshin
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [9] Predicting daily PM2.5 concentrations in Texas using high-resolution satellite aerosol optical depth
    Zhang, Xueying
    Chu, Yiyi
    Wang, Yuxuan
    Zhang, Kai
    SCIENCE OF THE TOTAL ENVIRONMENT, 2018, 631-632 : 904 - 911
  • [10] High temporal resolution prediction of street-level PM2.5 and NOx concentrations using machine learning approach
    Li, Zhiyuan
    Yim, Steve Hung-Lam
    Ho, Kin-Fai
    JOURNAL OF CLEANER PRODUCTION, 2020, 268