Enhancing Pollen Prediction in Beijing, a Chinese Megacity: Leveraging Ensemble Learning Models for Greater Accuracy

被引:0
|
作者
Ruan, Wenxi [1 ,2 ]
Li, Ziming [3 ]
Sun, Zhaobin [1 ,2 ]
An, Xingqin [1 ,2 ]
Zhao, Yuxin [1 ,2 ]
Zhang, Shuwen [4 ]
Liang, Yinglin [5 ]
Bu, Yaqin [6 ]
Xin, Jingyi [7 ]
Hang, Xiaoyi [7 ]
机构
[1] Chinese Acad Meteorol Sci, State Key Lab Severe Weather, Beijing 100081, Peoples R China
[2] Chinese Acad Meteorol Sci, Key Lab Atmospher Chem, CMA, Beijing 100081, Peoples R China
[3] Beijing Weather Forecast Ctr, Beijing 100089, Peoples R China
[4] Nanjing Univ Chinese Med, Coll Tradit Chinese Med, Nanjing 210023, Peoples R China
[5] Chengdu Univ Informat Technol, Sch Atmospher Sci, Chengdu 610225, Peoples R China
[6] Lanzhou Univ, Coll Earth & Environm Sci, Key Lab Western Chinas Environm Syst, Minist Educ, Lanzhou 730000, Peoples R China
[7] Beijing Univ Chinese Med, Sch Tradit Chinese Med, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
Machine learning; Forecasting; Pollen concentrations; Lead time; Time series analysis; AIRBORNE POLLEN; CLIMATE-CHANGE; CORYLUS; ALNUS;
D O I
10.4209/aaqr.240123
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In North China, pollen stands as a leading allergen responsible for allergic rhinitis, with climate change exacerbating allergenic pollen sensitization and posing significant health risks to residents. Despite its critical importance, pollen forecasting technology is still not sufficiently optimized. This study leverages multi-year daily pollen concentration observations and ECMWF (European Centre for Medium-Range Weather Forecasts) real-time forecast data, applying twelve machine learning models to learn perturbations separated from characteristic quantities. Specifically, it forecasts pollen concentrations in Beijing, utilizing R 2 and RMSE as evaluation metrics. The findings reveal that the CatBoost, Extra Trees, and XGBoost algorithms perform well for three-day consecutive pollen predictions. Specifically, when considering a one-day prediction period, the R 2 values for these algorithms are 0.72, 0.73, and 0.73, respectively. In contrast, algorithms such as Neural Network, LightGBM, and K-nearest Neighbor demonstrate weaker performance, though all models except Neural NetTorch achieve R 2 values above 0.50. Notably, the prediction accuracy of Neural NetTorch significantly improves with extended prediction time, with its R 2 increasing from 0.34 to 0.67 as the prediction period extends from one day to three days. The Weighted Ensemble model, which adjusts other models based on weighted optimization to mitigate excessive peaks, consistently yields stable results with an R 2 exceeding 0.67. Furthermore, the study assesses the importance of feature groups within the model, indicating that pollen emission intensity and phenological characteristics are crucial for both training and testing phases, whereas meteorological factors predominantly influence pollen dispersion. Given the strong impact of meteorological conditions and nonlinear regulation on pollen, a type of bioaerosol, machine learning demonstrates substantial potential for simulating and predicting its concentrations.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Enhancing Machine Learning based QoE Prediction by Ensemble Models
    Casas, Pedro
    Seufert, Michael
    Wehner, Nikolas
    Schwind, Anika
    Wamser, Florian
    2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 1642 - 1647
  • [2] An Ensemble Machine Learning Model for Enhancing the Prediction Accuracy of Energy Consumption in Buildings
    Ngoc-Tri Ngo
    Anh-Duc Pham
    Thi Thu Ha Truong
    Ngoc-Son Truong
    Nhat-To Huynh
    Tuan Minh Pham
    Arabian Journal for Science and Engineering, 2022, 47 : 4105 - 4117
  • [3] An Ensemble Machine Learning Model for Enhancing the Prediction Accuracy of Energy Consumption in Buildings
    Ngoc-Tri Ngo
    Anh-Duc Pham
    Thi Thu Ha Truong
    Ngoc-Son Truong
    Nhat-To Huynh
    Tuan Minh Pham
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (04) : 4105 - 4117
  • [4] Enhancing stormwater network overflow prediction: investigation of ensemble learning models
    Boughandjioua, Samira
    Laouacheria, Fares
    Azizi, Nabiha
    ACTA GEOPHYSICA, 2025, 73 (01) : 875 - 899
  • [5] Effects of pollen concentration on allergic rhinitis in children: A retrospective study from Beijing, a Chinese megacity
    Zhao, Yuxin
    Sun, Zhaobin
    Xiang, Li
    An, Xingqin
    Hou, Xiaoling
    Shang, Jing
    Han, Ling
    Ye, Caihua
    ENVIRONMENTAL RESEARCH, 2023, 229
  • [6] Enhancing prediction accuracy of concrete compressive strength using stacking ensemble machine learning
    Zhao, Yunpeng
    Goulias, Dimitrios
    Saremi, Setare
    COMPUTERS AND CONCRETE, 2023, 32 (03): : 233 - 246
  • [7] Credit scoring prediction leveraging interpretable ensemble learning
    Liu, Yang
    Huang, Fei
    Ma, Lili
    Zeng, Qingguo
    Shi, Jiale
    JOURNAL OF FORECASTING, 2024, 43 (02) : 286 - 308
  • [8] Leveraging advanced ensemble models to increase building energy performance prediction accuracy in the residential building sector
    Konhaeuser, Koray
    Wenninger, Simon
    Werner, Tim
    Wiethe, Christian
    ENERGY AND BUILDINGS, 2022, 269
  • [9] Enhancing accuracy of membrane fouling prediction using hybrid machine learning models
    Lim, Seung Ji
    Kim, Young Mi
    Park, Hosik
    Ki, Seojin
    Jeong, Kwanho
    Seo, Jangwon
    Chae, Sung Ho
    Kim, Joon Ha
    DESALINATION AND WATER TREATMENT, 2019, 146 : 22 - 28
  • [10] Enhancing accuracy of membrane fouling prediction using hybrid machine learning models
    Lim, Seung Ji
    Kim, Young Mi
    Park, Hosik
    Ki, Seojin
    Jeong, Kwanho
    Seo, Jangwon
    Chae, Sung Ho
    Kim, Joon Ha
    DESALINATION AND WATER TREATMENT, 2022, 270 : 320 - 320