Evaluation of machine learning and deep learning models for daily air quality index prediction in Delhi city, India

被引：0

作者：

Pande, Chaitanya Baliram ^{[1
]}

Radhadevi, Latha ^{[1
]}

Satyanarayana, Murthy Bandaru ^{[1
]}

机构：

[1] Indian Institute of Tropical Meteorology, NCL Post, Dr. Homi Bhabha Road, Pune,411008, India

来源：

Environmental Monitoring and Assessment | 2024年 / 196卷 / 12期

关键词：

Air pollution; Extreme gradient boosting; Cross-validation; SHAP method; ANN model;

D O I：

10.1007/s10661-024-13351-1

中图分类号：

学科分类号：

摘要：

The air quality index (AQI), based on criteria for air contaminants, is defined to provide a shared vision of air quality. As air pollution continues to rise in global cities due to urbanization and climate change, air pollution monitoring and forecasting models for effective air quality monitoring that gather and forecast information about air pollution concentration are essential in every city. Air quality predictions have evolved to be more helpful for management. Recently, better performance and ability have developed due to the involvement of machine learning (ML) and artificial intelligence (AI) in forecasting air quality in urban cities in India. This paper focuses on air pollution as a significant ecological problem that directly impacts human health and the distribution of an environmental system in urban areas. Hence, we have developed advanced models for daily AQI forecasting to understand the air effluence level in the upcoming days. In this research, six data-driven models have been developed and implemented for daily AQI forecasting in the study area; it is crucial for understanding the future air pollution levels to plan and control air pollution in the entire city. The developed model is applied to air quality datasets. A comparison of the performance of ML models tested here indicates that the XGBoost algorithm achieves the highest coefficient of determination (R2) and root-mean-square deviation (RMSE) value of 0.99 and lower values value of 4.65 than other models in the testing phase. The results of the artificial neural network (ANN) algorithm are slightly lower than the extreme gradient boosting (XGBoost model); the ANN model results are as R2, mean squared error (MSE), and RMSE values of 0.99, 13.99, and 198.88, respectively. All the models were subjected to a ten-fold cross-validation model. However, the RF cross-validation model outperforms other models; the RF model result shows the R2, RMSE, and MSE values of 0.99, 3.64, and 4.12, respectively. This study also employed two interpretable models, namely feature importance analysis and Shapley additive explanation (SHAP), to evaluate both the global and local methods in a manner that is independent of specific ML models. The feature importance shows that particle matter (PM) 2.5, PM10, carbon monoxide (CO), and nitrogen oxides (NOx) were the most influential variables. The results determined that such novel DL and ML models may improve the accuracy of AQI forecasts and understanding of air pollution, particularly in metropolitan cities.

引用

共 50 条

[41] Prediction of Air Quality Index by Extreme Learning Machines
Baran, Burhan
2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,
[42] Air Quality Prediction Of Data Log By Machine Learning
Pasupuleti, Venkat Rao
Uhasri
Kalyan, Pavan
Srikanth
Reddy, Hari Kiran
2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 1395 - 1399
[43] Hybrid machine learning models for prediction of daily dissolved oxygen
Azma, Aliasghar
Liu, Yakun
Azma, Masoumeh
Saadat, Mohsen
Zhang, Di
Cho, Jinwoo
Rezania, Shahabaldin
JOURNAL OF WATER PROCESS ENGINEERING, 2023, 54
[44] Machine Learning-Based Prediction of Air Quality
Liang, Yun-Chia
Maimury, Yona
Chen, Angela Hsiang-Ling
Juarez, Josue Rodolfo Cuevas
APPLIED SCIENCES-BASEL, 2020, 10 (24): : 1 - 17
[45] Monitoring the Impact of Air Quality on the COVID-19 Fatalities in Delhi, India: Using Machine Learning Techniques
Sethi, Jasleen Kaur
Mittal, Mamta
DISASTER MEDICINE AND PUBLIC HEALTH PREPAREDNESS, 2022, 16 (02) : 604 - 611
[46] DEEP LEARNING AND MACHINE LEARNING MODELS TO FORECAST BSE AND NIFTY SENSEX IT INDEX
Selvakumar, V.
Satpathi, Dipak Kumar
Chhabra, Abhinav
Nema, Arjita
ADVANCES AND APPLICATIONS IN STATISTICS, 2022, 82 : 9 - 26
[47] A deep multitask learning approach for air quality prediction
Sun, Xiaotong
Xu, Wei
Jiang, Hongxun
Wang, Qili
ANNALS OF OPERATIONS RESEARCH, 2021, 303 (1-2) : 51 - 79
[48] A deep multitask learning approach for air quality prediction
Xiaotong Sun
Wei Xu
Hongxun Jiang
Qili Wang
Annals of Operations Research, 2021, 303 : 51 - 79
[49] A Comparative Assessment of Machine Learning and Deep Learning Models for the Daily River Streamflow Forecasting
Malihe Danesh
Amin Gharehbaghi
Saeid Mehdizadeh
Amirhossein Danesh
Water Resources Management, 2025, 39 (4) : 1911 - 1930
[50] Development of Machine Learning and Deep Learning Prediction Models for PM2.5 in Ho Chi Minh City, Vietnam
Nguyen, Phuc Hieu
Dao, Nguyen Khoi
Nguyen, Ly Sy Phu
Atmosphere, 15 (10):

← 1 2 3 4 5 →