Two-Stage Feature Engineering to Predict Air Pollutants in Urban Areas

被引:0
|
作者
Naz, Fareena [1 ]
Fahim, Muhammad [1 ]
Cheema, Adnan Ahmad [2 ]
Viet, Nguyen Trung [3 ]
Cao, Tuan-Vu [4 ]
Hunter, Ruth [5 ]
Duong, Trung Q. [1 ,6 ]
机构
[1] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Belfast BT7 1NN, North Ireland
[2] Ulster Univ, Sch Engn, Belfast BT15 1AP, North Ireland
[3] Thuyloi Univ, Hanoi, Vietnam
[4] Norwegian Inst Air Res, Oslo, Norway
[5] Queens Univ Belfast, Ctr Publ Hlth, Sch Med Dent & Biomed Sci, Belfast BT12 1NN, North Ireland
[6] Mem Univ Newfoundland, Fac Engn & Appl Sci, St John, NF A1C 5S7, Canada
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Predictive models; Air pollution; Long short term memory; Atmospheric modeling; Forecasting; Pollution; Time series analysis; Machine learning; Air quality; feature engineering; variational mode decomposition; machine learning; predictive model;
D O I
10.1109/ACCESS.2024.3443810
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Air pollution is a global challenge to human health and the ecological environment. Identifying the relationship among pollutants, their fundamental sources and detrimental effects on health and mental well-being is critical in order to implement appropriate countermeasures. The way forward to address this issue and assess air quality is through accurate air pollution prediction. Such prediction can subsequently assist governing bodies in making prompt, evidence-based decisions and prevent further harm to our urban environment, public health, and climate, all of which co-benefit our economy. In this study, the main objective is to explore the strength of features and proposed a two stage feature engineering approach, which fuses the advantage of influential factors along with the decomposition approach and generates an optimum feature combination for five major pollutants including Nitrogen Dioxide (NO2), Ozone (O-3), Sulphur Dioxide (SO2), and Particulate Matter (PM2.5, and PM10). The experiments are conducted using a dataset from 2015 to 2020 which is publicly available and is collected from Belfast-based air quality monitoring stations in Northern Ireland, UK. In stage-1, using the dataset new features such as trigonometric and statistical features are created to capture their dependency on the target pollutant and generated correlation-inspired best feature combinations to improve forecasting model performance. This is further enhanced in stage-2 by an optimum feature combination which is an integration of stage-1 and Variational Mode Decomposition (VMD) based features. This study employed a simplified Long Short Term Memory (LSTM) neural network and proposed a single-step forecasting model to predict multivariate time series data. Three performance indicators are used to evaluate the effectiveness of forecasting model: 1) root mean square error (RMSE), 2) mean absolute error (MAE), and 3) R-squared (R-2). The results demonstrate the effectiveness of proposed approach with 13% improvement in performance (in terms of R-2) and the lowest error scores for both RMSE and MAE.
引用
收藏
页码:114073 / 114085
页数:13
相关论文
共 50 条
  • [1] Two-Stage Feature Selection with Unsupervised Second Stage
    Xu, Ke
    Arai, Hiromasa
    Maung, Crystal
    Schweitzer, Haim
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 153 - 159
  • [2] Two-Stage Feature Selection with Unsupervised Second Stage
    Xu, Ke
    Maung, Crystal
    Arai, Hiromasa
    Schweitzer, Haim
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (07)
  • [3] A Robust Two-Stage Model for the Urban Air Mobility Flight Scheduling Problem
    Portoleau, Tom
    D'Ambrosio, Claudia
    COMBINATORIAL OPTIMIZATION, ISCO 2024, 2024, 14594 : 348 - 360
  • [4] Two-Stage Feature Selection for Text Classification
    Ozgur, Levent
    Gungor, Tunga
    INFORMATION SCIENCES AND SYSTEMS 2015, 2016, 363 : 329 - 337
  • [5] Two-Stage Method for Clothing Feature Detection
    Lyu, Xinwei
    Li, Xinjia
    Zhang, Yuexin
    Lu, Wenlian
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (04)
  • [6] A two-stage sampling for robust feature matching
    Chou, Chih-Chung
    Seo, Young Woo
    Wang, Chieh-Chih
    JOURNAL OF FIELD ROBOTICS, 2018, 35 (05) : 779 - 801
  • [7] Comparison of dispersion models for vehicular air pollutants in urban areas
    Michail, A.
    Basbas, S.
    Nikolaou, K.
    Proceeding of the 9th International Conference on Environmental Science and Technology Vol B - Poster Presentations, 2005, : B611 - B616
  • [8] Assessment of air pollutants in urban recreation areas of Fortaleza city
    Moura de Oliveira, Mona Lisa
    Porfirio Sampaio Lopes, Mauro Henrique
    Policarpo, Nara Angelica
    Aguiar da Costa Alves, Camila Maria
    Araujo, Rinaldo dos Santos
    Avila Cavalcante, Francisco Sales
    URBE-REVISTA BRASILEIRA DE GESTAO URBANA, 2019, 11
  • [9] A Two-Stage Feature Extraction Approach for ECG Signals
    Houssein, Essam H.
    Kilany, Moataz
    Hassanien, Aboul Ella
    Snasel, Vaclav
    PROCEEDINGS OF THE THIRD INTERNATIONAL AFRO-EUROPEAN CONFERENCE FOR INDUSTRIAL ADVANCEMENT-AECIA 2016, 2018, 565 : 299 - 310
  • [10] A two-stage feature selection method for text categorization
    Meng, Jiana
    Lin, Hongfei
    Yu, Yuhai
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2011, 62 (07) : 2793 - 2800