Machine learning-based ozone and PM2.5 forecasting: Application to multiple AQS sites in the Pacific Northwest

被引:2
|
作者
Fan, Kai [1 ,2 ,3 ]
Dhammapala, Ranil [4 ]
Harrington, Kyle [5 ]
Lamb, Brian [3 ]
Lee, Yunha [1 ,2 ,3 ]
机构
[1] Ctr Adv Syst Understanding, Gorlitz, Germany
[2] Helmholtz Zentrum Dresden Rossendorf, Dresden, Germany
[3] Washington State Univ, Dept Civil & Environm Engn, Lab Atmospher Res, Pullman, WA 99164 USA
[4] South Coast Air Qual Management Dist, Diamond Bar, CA USA
[5] Max Delbruck Ctr Mol Med, Berlin, Germany
来源
FRONTIERS IN BIG DATA | 2023年 / 6卷
关键词
machine learning; air quality forecasts; ozone; PM2; 5; random forest; multiple linear regression; RANDOM FOREST; PREDICTION; MESOSCALE;
D O I
10.3389/fdata.2023.1124148
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Air quality in the Pacific Northwest (PNW) of the U.S has generally been good in recent years, but unhealthy events were observed due to wildfires in summer or wood burning in winter. The current air quality forecasting system, which uses chemical transport models (CTMs), has had difficulty forecasting these unhealthy air quality events in the PNW. We developed a machine learning (ML) based forecasting system, which consists of two components, ML1 (random forecast classifiers and multiple linear regression models) and ML2 (two-phase random forest regression model). Our previous study showed that the ML system provides reliable forecasts of O-3 at a single monitoring site in Kennewick, WA. In this paper, we expand the ML forecasting system to predict both O-3 in the wildfire season and PM2.5 in wildfire and cold seasons at all available monitoring sites in the PNW during 2017-2020, and evaluate our ML forecasts against the existing operational CTM-based forecasts. For O-3, both ML1 and ML2 are used to achieve the best forecasts, which was the case in our previous study: ML2 performs better overall (R-2 = 0.79), especially for low-O-3 events, while ML1 correctly captures more high-O-3 events. Compared to the CTM-based forecast, our O-3 ML forecasts reduce the normalized mean bias (NMB) from 7.6 to 2.6% and normalized mean error (NME) from 18 to 12% when evaluating against the observation. For PM2.5, ML2 performs the best and thus is used for the final forecasts. Compared to the CTM-based PM2.5, ML2 clearly improves PM2.5 forecasts for both wildfire season (May to September) and cold season (November to February): ML2 reduces NMB (-27 to 7.9% for wildfire season; 3.4 to 2.2% for cold season) and NME (59 to 41% for wildfires season; 67 to 28% for cold season) significantly and captures more high-PM2.5 events correctly. Our ML air quality forecast system requires fewer computing resources and fewer input datasets, yet it provides more reliable forecasts than (if not, comparable to) the CTM-based forecast. It demonstrates that our ML system is a low-cost, reliable air quality forecasting system that can support regional/local air quality management.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] A Machine Learning-Based Ensemble Framework for Forecasting PM2.5 Concentrations in Puli, Taiwan
    Yin, Peng-Yeng
    Yen, Alex Yaning
    Chao, Shou-En
    Day, Rong-Fuh
    Bhanu, Bir
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (05):
  • [2] A Study on Machine Learning-Based Approaches for PM2.5 Prediction
    Lakshmi, V. Santhana
    Vijaya, M. S.
    [J]. SUSTAINABLE COMMUNICATION NETWORKS AND APPLICATION, ICSCN 2021, 2022, 93 : 163 - 175
  • [3] Enhancement and evaluation of the AIRPACT ozone and PM2.5 forecast system for the Pacific Northwest
    Chen, Jack
    Vaughan, Joe
    Avise, Jeremy
    O'Neill, Susan
    Lamb, Brian
    [J]. JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2008, 113 (D14)
  • [4] Forecasting Ozone and PM2.5 Pollution Potentials Using Machine Learning Algorithms: A Case Study in Chengdu
    Wang, Xinlu
    Huang, Ran
    Zhang, Wenxian
    Lü, Baolei
    Du, Yunsong
    Zhang, Wei
    Li, Bolan
    Hu, Yongtao
    [J]. Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2021, 57 (05): : 938 - 950
  • [5] A Machine Learning Based PM2.5 Forecasting Framework Using Internet of Environmental Things
    Mahajan, Sachit
    Liu, Hao-Min
    Chen, Ling-Jyh
    Tsai, Tzu-Chieh
    [J]. IOT AS A SERVICE, IOTAAS 2017, 2018, 246 : 170 - 176
  • [6] A deep learning-based PM2.5 concentration estimator
    Sun, Kezheng
    Tang, Lijuan
    Qian, JianSheng
    Wang, Guangcheng
    Lou, Cairong
    [J]. DISPLAYS, 2021, 69
  • [7] A new hybrid deep neural network for multiple sites PM2.5 forecasting
    Teng, Mengfan
    Li, Siwei
    Yang, Jie
    Chen, Jiarui
    Fan, Chunying
    Ding, Yu
    [J]. JOURNAL OF CLEANER PRODUCTION, 2024, 473
  • [8] A new hybrid PM2.5 volatility forecasting model based on EMD and machine learning algorithms
    Wang, Ping
    Bi, Xu
    Zhang, Guisheng
    Yu, Mengjiao
    [J]. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2023, 30 (34) : 82878 - 82894
  • [9] A machine learning-based model to estimate PM2.5 concentration levels in Delhi's atmosphere
    Kumar, Saurabh
    Mishra, Shweta
    Singh, Sunil Kumar
    [J]. HELIYON, 2020, 6 (11)
  • [10] Evaluation of Different Machine Learning Approaches to Forecasting PM2.5 Mass Concentrations
    Karimian, Hamed
    Li, Qi
    Wu, Chunlin
    Qi, Yanlin
    Mo, Yuqin
    Chen, Gong
    Zhang, Xianfeng
    Sachdeva, Sonali
    [J]. AEROSOL AND AIR QUALITY RESEARCH, 2019, 19 (06) : 1400 - 1410