Two-Stage Feature Engineering to Predict Air Pollutants in Urban Areas

被引:0
|
作者
Naz, Fareena [1 ]
Fahim, Muhammad [1 ]
Cheema, Adnan Ahmad [2 ]
Viet, Nguyen Trung [3 ]
Cao, Tuan-Vu [4 ]
Hunter, Ruth [5 ]
Duong, Trung Q. [1 ,6 ]
机构
[1] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Belfast BT7 1NN, North Ireland
[2] Ulster Univ, Sch Engn, Belfast BT15 1AP, North Ireland
[3] Thuyloi Univ, Hanoi, Vietnam
[4] Norwegian Inst Air Res, Oslo, Norway
[5] Queens Univ Belfast, Ctr Publ Hlth, Sch Med Dent & Biomed Sci, Belfast BT12 1NN, North Ireland
[6] Mem Univ Newfoundland, Fac Engn & Appl Sci, St John, NF A1C 5S7, Canada
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Predictive models; Air pollution; Long short term memory; Atmospheric modeling; Forecasting; Pollution; Time series analysis; Machine learning; Air quality; feature engineering; variational mode decomposition; machine learning; predictive model;
D O I
10.1109/ACCESS.2024.3443810
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Air pollution is a global challenge to human health and the ecological environment. Identifying the relationship among pollutants, their fundamental sources and detrimental effects on health and mental well-being is critical in order to implement appropriate countermeasures. The way forward to address this issue and assess air quality is through accurate air pollution prediction. Such prediction can subsequently assist governing bodies in making prompt, evidence-based decisions and prevent further harm to our urban environment, public health, and climate, all of which co-benefit our economy. In this study, the main objective is to explore the strength of features and proposed a two stage feature engineering approach, which fuses the advantage of influential factors along with the decomposition approach and generates an optimum feature combination for five major pollutants including Nitrogen Dioxide (NO2), Ozone (O-3), Sulphur Dioxide (SO2), and Particulate Matter (PM2.5, and PM10). The experiments are conducted using a dataset from 2015 to 2020 which is publicly available and is collected from Belfast-based air quality monitoring stations in Northern Ireland, UK. In stage-1, using the dataset new features such as trigonometric and statistical features are created to capture their dependency on the target pollutant and generated correlation-inspired best feature combinations to improve forecasting model performance. This is further enhanced in stage-2 by an optimum feature combination which is an integration of stage-1 and Variational Mode Decomposition (VMD) based features. This study employed a simplified Long Short Term Memory (LSTM) neural network and proposed a single-step forecasting model to predict multivariate time series data. Three performance indicators are used to evaluate the effectiveness of forecasting model: 1) root mean square error (RMSE), 2) mean absolute error (MAE), and 3) R-squared (R-2). The results demonstrate the effectiveness of proposed approach with 13% improvement in performance (in terms of R-2) and the lowest error scores for both RMSE and MAE.
引用
收藏
页码:114073 / 114085
页数:13
相关论文
共 50 条
  • [31] Scene Classification Based on Two-Stage Deep Feature Fusion
    Liu, Yishu
    Liu, Yingbin
    Ding, Liwang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (02) : 183 - 186
  • [32] Two-stage classification with automatic feature selection for an industrial application
    Hader, S
    Hamprecht, FA
    Classification - the Ubiquitous Challenge, 2005, : 137 - 144
  • [33] A two-stage feature selection method for hob state recognition
    Jia, Yachao
    Li, Guolong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [34] A two-stage feature extraction for hyperspectral image data classification
    Chen, GS
    Ko, LW
    Kuo, BC
    Shih, SC
    IGARSS 2004: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM PROCEEDINGS, VOLS 1-7: SCIENCE FOR SOCIETY: EXPLORING AND MANAGING A CHANGING PLANET, 2004, : 1212 - 1215
  • [35] A Two-Stage Feature Selection Method for Gene Expression Data
    Chuang, Li-Yeh
    Ke, Chao-Hsuan
    Chang, Hsueh-Wei
    Yang, Cheng-Hong
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2009, 13 (02) : 127 - 137
  • [36] Lidar Odometry and Mapping Based on Two-stage Feature Extraction
    Zhang, Shuaipeng
    Xiao, Liang
    Nie, Yining
    Dai, Bin
    Hu, Chaofang
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 3966 - 3971
  • [37] A Novel Two-Stage Selection of Feature Subsets in Machine Learning
    Kamala, F. Rosita
    Thangaiah, P. Ranjit Jeba
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2019, 9 (03) : 4169 - 4175
  • [38] A two-stage layout method for functional areas in logistics park
    Luo, Qingyu
    Zhu, Jiaxiang
    Jia, Hongfei
    Xu, Yingjun
    ADVANCES IN MECHANICAL ENGINEERING, 2019, 11 (03)
  • [39] A study of air pollutants and acute asthma exacerbations in urban areas: status report
    Luttinger, D
    Wilson, L
    ENVIRONMENTAL POLLUTION, 2003, 123 (03) : 399 - 402
  • [40] The effect of seasonal temperatures on the levels of air pollutants in rural and urban areas in Iraq
    Alallawi, Ahmed Ibrahim
    Hameed-Ameen, Attalah Maeedi
    Al-Jubouri, Khalf Ibraheem Khalf
    NATIVA, 2023, 11 (02): : 178 - 184