Unmasking the sky: high-resolution PM2.5 prediction in Texas using machine learning techniques

被引:0
|
作者
Zhang, Kai [1 ]
Lin, Jeffrey [2 ]
Li, Yuanfei [3 ]
Sun, Yue [4 ]
Tong, Weitian [5 ]
Li, Fangyu [6 ]
Chien, Lung-Chang [7 ]
Yang, Yiping [2 ]
Su, Wei-Chung [6 ]
Tian, Hezhong [8 ,9 ]
Fu, Peng [10 ,11 ]
Qiao, Fengxiang [12 ]
Romeiko, Xiaobo Xue [1 ]
Lin, Shao [1 ]
Luo, Sheng [13 ]
Craft, Elena [14 ]
机构
[1] SUNY Albany, Sch Publ Hlth, Dept Environm Hlth Sci, Rensselaer, NY 12144 USA
[2] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Dept Biostat & Data Sci, Houston, TX USA
[3] Shanghai Univ, Asian Demog Res Inst, Shanghai, Peoples R China
[4] Clark Univ, Dept Int Dev Community & Environm, Worcester, MA USA
[5] Georgia Southern Univ, Dept Comp Sci, Statesboro, GA USA
[6] Univ Texas Hlth Sci Ctr, Dept Epidemiol Human Genet & Environm Sci, Sch Publ Hlth, Houston, TX USA
[7] Univ Nevada, Sch Publ Hlth, Dept Epidemiol & Biostat, Las Vegas, NV USA
[8] Beijing Normal Univ, Sch Environm, State Key Joint Lab Environm Simulat & Pollut Cont, Beijing, Peoples R China
[9] Beijing Normal Univ, Ctr Atmospher Environm Studies, Beijing, Peoples R China
[10] Univ Illinois, Dept Plant Biol, Urbana, IL USA
[11] Harrisburg Univ, Ctr Econ Environm & Energy, Harrisburg, PA USA
[12] Texas Southern Univ, Innovat Transportat Res Inst, Houston, TX USA
[13] Duke Univ, Dept Biostat & Bioinformat, Durham, NC USA
[14] Hlth Effects Inst, Boston, MA USA
关键词
AOD; Gradient boosting; Machine learning; PM2.5; Random forest; FINE PARTICULATE MATTER; PRIVATELY INSURED POPULATION; BEIJING-TIANJIN-HEBEI; RANDOM FOREST; COMPONENTS; MODEL; AOD;
D O I
10.1038/s41370-024-00659-w
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Background Although PM2.5 (fine particulate matter with an aerodynamic diameter less than 2.5 mu m) is an air pollutant of great concern in Texas, limited regulatory monitors pose a significant challenge for decision-making and environmental studies. Objective This study aimed to predict PM2.5 concentrations at a fine spatial scale on a daily basis by using novel machine learning approaches and incorporating satellite-derived Aerosol Optical Depth (AOD) and a variety of weather and land use variables. MethodsWe compiled a comprehensive dataset in Texas from 2013 to 2017, including ground-level PM2.5 concentrations from regulatory monitors; AOD values at 1-km resolution based on images retrieved from the MODIS satellite; and weather, land-use, population density, among others. We built predictive models for each year separately to estimate PM2.5 concentrations using two machine learning approaches called gradient boosted trees and random forest. We evaluated the model prediction performance using in-sample and out-of-sample validations. Results Our predictive models demonstrate excellent in-sample model performance, as indicated by high R-2 values generated from the gradient boosting models (0.94-0.97) and random forest models (0.81-0.90). However, the out-of-sample R-2 values fall within a range of 0.52-0.75 for gradient boosting models and 0.44-0.69 for random forest models. Model performance varies slightly across years. A generally decreasing trend in predicted PM2.5 concentrations over time is observed in Eastern Texas.
引用
收藏
页码:814 / 820
页数:7
相关论文
共 50 条
  • [31] PM2.5 CONCENTRATION PREDICTION USING DEEP LEARNING IN AIR MONITORING
    Huang, Yi
    FRESENIUS ENVIRONMENTAL BULLETIN, 2021, 30 (12): : 13200 - 13211
  • [32] Spatiotemporal Weighted for Improving the Satellite-Based High-Resolution Ground PM2.5 Estimation Using the Light Gradient Boosting Machine
    Yu, Xinyu
    Xi, Mengzhu
    Wu, Liyang
    Zheng, Hui
    REMOTE SENSING, 2023, 15 (16)
  • [33] High Spatiotemporal Resolution PM2.5 Concentration Estimation with Machine Learning Algorithm: A Case Study for Wildfire in California
    Cui, Qian
    Zhang, Feng
    Fu, Shaoyun
    Wei, Xiaoli
    Ma, Yue
    Wu, Kun
    REMOTE SENSING, 2022, 14 (07)
  • [34] Machine learning-guided integration of fixed and mobile sensors for high resolution urban PM2.5 mapping
    Tianshuai Li
    Xin Huang
    Qingzhu Zhang
    Xinfeng Wang
    Xianfeng Wang
    Anbao Zhu
    Zhaolin Wei
    Xinyan Wang
    Haolin Wang
    Jiaqi Chen
    Min Li
    Qiao Wang
    Wenxing Wang
    npj Climate and Atmospheric Science, 8 (1)
  • [35] Prediction of PM2.5 and PM10 in Chiang Mai Province: A Comparison of Machine Learning Models
    Thongrod, Thitaporn
    Lim, Apiradee
    Ingviya, Thammasin
    Owusu, Benjamin Atta
    2022 37TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2022), 2022, : 337 - 340
  • [36] Mapping of high-resolution daily particulate matter (PM2.5) concentration at the city level through a machine learning-based downscaling approach
    Phuong D. M. Nguyen
    An H. Phan
    Truong X. Ngo
    Bang Q. Ho
    Tran Vu Pham
    Thanh T. N. Nguyen
    Environmental Monitoring and Assessment, 197 (1)
  • [37] Estimation of PM2.5 using high-resolution satellite data and its mortality risk in an area of Iran
    Li, Guoxing
    Aboubakri, Omid
    Soleimani, Samira
    Maleki, Afshin
    Rezaee, Reza
    Safari, Mahdi
    Goudarzi, Gholamreza
    Fatehi, Fariba
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL HEALTH RESEARCH, 2024,
  • [38] Application of the XGBoost Machine Learning Method in PM2.5 Prediction: A Case Study of Shanghai
    Ma, Jinghui
    Yu, Zhongqi
    Qu, Yuanhao
    Xu, Jianming
    Cao, Yu
    AEROSOL AND AIR QUALITY RESEARCH, 2020, 20 (01) : 128 - 138
  • [39] Evaluation of different machine learning approaches and aerosol optical depth in PM2.5 prediction
    Karimian, Hamed
    Li, Yaqian
    Chen, Youliang
    Wang, Zhaoru
    ENVIRONMENTAL RESEARCH, 2023, 216
  • [40] A model for particulate matter (PM2.5) prediction for Delhi based on machine learning approaches
    Masood, Adil
    Ahmad, Kafeel
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 2101 - 2110