Feature selection for global tropospheric ozone prediction based on the BO-XGBoost-RFE algorithm

被引:22
|
作者
Zhang, Biao [1 ]
Zhang, Ying [2 ]
Jiang, Xuchu [2 ]
机构
[1] Liaocheng Univ, Sch Comp Sci, Liaocheng 252000, Shandong, Peoples R China
[2] Zhongnan Univ Econ & Law, Sch Stat & Math, Wuhan 430073, Peoples R China
关键词
AIR-QUALITY; CHEMISTRY;
D O I
10.1038/s41598-022-13498-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Ozone is one of the most important air pollutants, with significant impacts on human health, regional air quality and ecosystems. In this study, we use geographic information and environmental information of the monitoring site of 5577 regions in the world from 2010 to 2014 as feature input to predict the long-term average ozone concentration of the site. A Bayesian optimization-based XGBoost-RFE feature selection model BO-XGBoost-RFE is proposed, and a variety of machine learning algorithms are used to predict ozone concentration based on the optimal feature subset. Since the selection of the underlying model hyperparameters is involved in the recursive feature selection process, different hyperparameter combinations will lead to differences in the feature subsets selected by the model, so that the feature subsets obtained by the model may not be optimal solutions. We combine the Bayesian optimization algorithm to adjust the parameters of recursive feature elimination based on XGBoost to obtain the optimal parameter combination and the optimal feature subset under the parameter combination. Experiments on long-term ozone concentration prediction on a global scale show that the prediction accuracy of the model after Bayesian optimized XGBoost-RFE feature selection is higher than that based on all features and on feature selection with Pearson correlation. Among the four prediction models, random forest obtained the highest prediction accuracy. The XGBoost prediction model achieved the greatest improvement in accuracy.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Feature selection for global tropospheric ozone prediction based on the BO-XGBoost-RFE algorithm
    Biao Zhang
    Ying Zhang
    Xuchu Jiang
    Scientific Reports, 12
  • [2] Feature selection algorithm based on XGBoost
    Li Z.
    Liu Z.
    Tongxin Xuebao/Journal on Communications, 2019, 40 (10): : 101 - 108
  • [3] FEATURE SELECTION FOR THE PREDICTION OF TROPOSPHERIC OZONE CONCENTRATION USING A WRAPPER METHOD
    Sakar, C. Okan
    Demir, Goksel
    Kursun, Olcay
    Ozdemir, Huseyin
    Altay, Gokmen
    Yalcin, Senay
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2011, 17 (04): : 403 - 413
  • [4] Prediction of tropospheric ozone using artificial neural network (ANN) and feature selection techniques
    Kapadia, Drashti
    Jariwala, Namrata
    MODELING EARTH SYSTEMS AND ENVIRONMENT, 2022, 8 (02) : 2183 - 2192
  • [5] Prediction of tropospheric ozone using artificial neural network (ANN) and feature selection techniques
    Drashti Kapadia
    Namrata Jariwala
    Modeling Earth Systems and Environment, 2022, 8 : 2183 - 2192
  • [6] PREDICTION OF HYPERTENSION RISKS WITH FEATURE SELECTION AND XGBOOST
    Peng, Yan
    Xu, Jing
    Ma, Ling
    Wang, Jie
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2021, 21 (05)
  • [7] Mapping of Soil pH Based on SVM-RFE Feature Selection Algorithm
    Guo, Jia
    Wang, Ku
    Jin, Shaofei
    AGRONOMY-BASEL, 2022, 12 (11):
  • [8] Based on the RFE-LSTM Ozone Prediction Research
    Ji, Wenquan
    Ren, Ge
    Han, Mengjuan
    Lin, Hong
    2024 9TH INTERNATIONAL CONFERENCE ON ELECTRONIC TECHNOLOGY AND INFORMATION SCIENCE, ICETIS 2024, 2024, : 773 - 776
  • [9] Research on Quality Prediction of Typical Workpieces Based on Feature Recombination and XGBoost Algorithm
    Fan, Yaoyao
    Liu, Yahui
    Wang, Xingfen
    Xu, Zhulu
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3239 - 3245
  • [10] Modified Genetic Algorithm for Feature Selection and Hyper Parameter Optimization: Case of XGBoost in Spam Prediction
    Ghatasheh, Nazeeh
    Altaharwa, Ismail
    Aldebei, Khaled
    IEEE ACCESS, 2022, 10 : 84365 - 84383