GIS-based air quality modelling: spatial prediction of PM10 for Selangor State, Malaysia using machine learning algorithms

被引:20
|
作者
Tella, Abdulwaheed [1 ]
Balogun, Abdul-Lateef [1 ]
机构
[1] Univ Teknol PETRONAS, Dept Civil & Environm Engn, Geospatial Anal & Modelling GAM Res Lab, Seri Iskandar 32610, Perak, Malaysia
关键词
GIS; Air quality modelling; Spatial prediction; PM10; Machine learning algorithms (MLAs); Selangor State; RANDOM-FOREST; LOGISTIC-REGRESSION; PM2.5; PATTERN; XGBOOST; AREA; HAZE;
D O I
10.1007/s11356-021-16150-0
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Rapid urbanization has caused severe deterioration of air quality globally, leading to increased hospitalization and premature deaths. Therefore, accurate prediction of air quality is crucial for mitigation planning to support urban sustainability and resilience. Although some studies have predicted air pollutants such as particulate matter (PM) using machine learning algorithms (MLAs), there is a paucity of studies on spatial hazard assessment with respect to the air quality index (AQI). Incorporating PM in AQI studies is crucial because of its easily inhalable micro-size which has adverse impacts on ecology, environment, and human health. Accurate and timely prediction of the air quality index can ensure adequate intervention to aid air quality management. Therefore, this study undertakes a spatial hazard assessment of the air quality index using particulate matter with a diameter of 10 mu m or lesser (PM10) in Selangor, Malaysia, by developing four machine learning models: eXtreme Gradient Boosting (XGBoost), random forest (RF), K-nearest neighbour (KNN), and Naive Bayes (NB). Spatially processed data such as NDVI, SAVI, BU, LST, Ws, slope, elevation, and road density was used for the modelling. The model was trained with 70% of the dataset, while 30% was used for cross-validation. Results showed that XGBoost has the highest overall accuracy and precision of 0.989 and 0.995, followed by random forest (0.989, 0.993), K-nearest neighbour (0.987, 0.984), and Naive Bayes (0.917, 0.922), respectively. The spatial air quality maps were generated by integrating the geographical information system (GIS) with the four MLAs, which correlated with Malaysia's air pollution index. The maps indicate that air quality in Selangor is satisfactory and posed no threats to health. Nevertheless, the two algorithms with the best performance (XGBoost and RF) indicate that a high percentage of the air quality is moderate. The study concludes that successful air pollution management policies such as green infrastructure practice. improvement of energy efficiency, and restrictions on heavy-duty vehicles can be adopted in Selangor and other Southeast Asian cities to prevent deterioration of air quality in the future.
引用
收藏
页码:86109 / 86125
页数:17
相关论文
共 50 条
  • [1] GIS-based air quality modelling: spatial prediction of PM10 for Selangor State, Malaysia using machine learning algorithms
    Abdulwaheed Tella
    Abdul-Lateef Balogun
    [J]. Environmental Science and Pollution Research, 2022, 29 : 86109 - 86125
  • [2] Spatial prediction of PM10 concentration using machine learning algorithms in Ankara, Turkey
    Bozdag, Asli
    Dokuz, Yesim
    Gokcek, Oznur Begum
    [J]. ENVIRONMENTAL POLLUTION, 2020, 263
  • [3] Application of GIS-based machine learning algorithms for prediction of irrigational groundwater quality indices
    Mohammed, Musaab A. A.
    Kaya, Fuat
    Mohamed, Ahmed
    Alarifi, Saad S.
    Abdelrady, Ahmed
    Keshavarzi, Ali
    Szabo, Norbert P.
    Szucs, Peter
    [J]. FRONTIERS IN EARTH SCIENCE, 2023, 11
  • [4] Prediction of air pollution and analysis of its effects on the pollution dispersion of PM10 in Egypt using machine learning algorithms
    Hanna, Wael K.
    Elstohy, Rasha
    Radwan, Nouran M.
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2022, 14 (04) : 358 - 371
  • [5] Modelling the Correlation of PM10 Concentration and Location of Air Quality Monitoring Stations in Malaysia Using Network Method
    Rasidi, Norsuhaili Mahamed
    Abu Bakar, Sakhinah
    Razak, Fatimah Abdul
    [J]. ADVANCES IN INDUSTRIAL AND APPLIED MATHEMATICS, 2016, 1750
  • [6] GIS-BASED LANDSLIDE SUSCEPTIBILITY ANALYSIS USING MACHINE LEARNING ALGORITHMS
    Sharma, Ankur
    Sandhu, Har Amrit Singh
    [J]. IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 3038 - 3041
  • [7] Classification Prediction of PM10 Concentration Using a Tree-Based Machine Learning Approach
    Shaziayani, Wan Nur
    Ul-Saufie, Ahmad Zia
    Mutalib, Sofianita
    Noor, Norazian Mohamad
    Zainordin, Nazatul Syadia
    [J]. ATMOSPHERE, 2022, 13 (04)
  • [8] The Impact of Air Quality and Meteorology on COVID-19 Cases at Kuala Lumpur and Selangor, Malaysia and Prediction Using Machine Learning
    Jalaludin, Juliana
    Mansor, Wan Nurdiyana Wan
    Abidin, Nur Afizan
    Suhaimi, Nur Faseeha
    Chao, How-Ran
    [J]. ATMOSPHERE, 2023, 14 (06)
  • [9] Prediction of Spatial Likelihood of Shallow Landslide Using GIS-Based Machine Learning in Awgu, Southeast/Nigeria
    Nnanwuba, Uzodigwe Emmanuel
    Qin, Shengwu
    Adeyeye, Oluwafemi Adewole
    Cosmas, Ndichie Chinemelu
    Yao, Jingyu
    Qiao, Shuangshuang
    Sun Jingbo
    Egwuonwu, Ekene Mathew
    [J]. SUSTAINABILITY, 2022, 14 (19)
  • [10] PM 2.5 Prediction & Air Quality Classification Using Machine Learning
    Soontornpipit, Pichitpong
    Lekawat, Lertsak
    Tritham, Chatchai
    Tritham, Chattabhorn
    Pongpaibool, Pornanong
    Prasertsuk, Narachata
    Jirakitpuwapat, Wachirapong
    [J]. THAI JOURNAL OF MATHEMATICS, 2024, 22 (02): : 441 - 452