Evaluation of machine learning and deep learning models for daily air quality index prediction in Delhi city, India

被引:0
|
作者
Pande, Chaitanya Baliram [1 ]
Radhadevi, Latha [1 ]
Satyanarayana, Murthy Bandaru [1 ]
机构
[1] Indian Institute of Tropical Meteorology, NCL Post, Dr. Homi Bhabha Road, Pune,411008, India
关键词
Air pollution; Extreme gradient boosting; Cross-validation; SHAP method; ANN model;
D O I
10.1007/s10661-024-13351-1
中图分类号
学科分类号
摘要
The air quality index (AQI), based on criteria for air contaminants, is defined to provide a shared vision of air quality. As air pollution continues to rise in global cities due to urbanization and climate change, air pollution monitoring and forecasting models for effective air quality monitoring that gather and forecast information about air pollution concentration are essential in every city. Air quality predictions have evolved to be more helpful for management. Recently, better performance and ability have developed due to the involvement of machine learning (ML) and artificial intelligence (AI) in forecasting air quality in urban cities in India. This paper focuses on air pollution as a significant ecological problem that directly impacts human health and the distribution of an environmental system in urban areas. Hence, we have developed advanced models for daily AQI forecasting to understand the air effluence level in the upcoming days. In this research, six data-driven models have been developed and implemented for daily AQI forecasting in the study area; it is crucial for understanding the future air pollution levels to plan and control air pollution in the entire city. The developed model is applied to air quality datasets. A comparison of the performance of ML models tested here indicates that the XGBoost algorithm achieves the highest coefficient of determination (R2) and root-mean-square deviation (RMSE) value of 0.99 and lower values value of 4.65 than other models in the testing phase. The results of the artificial neural network (ANN) algorithm are slightly lower than the extreme gradient boosting (XGBoost model); the ANN model results are as R2, mean squared error (MSE), and RMSE values of 0.99, 13.99, and 198.88, respectively. All the models were subjected to a ten-fold cross-validation model. However, the RF cross-validation model outperforms other models; the RF model result shows the R2, RMSE, and MSE values of 0.99, 3.64, and 4.12, respectively. This study also employed two interpretable models, namely feature importance analysis and Shapley additive explanation (SHAP), to evaluate both the global and local methods in a manner that is independent of specific ML models. The feature importance shows that particle matter (PM) 2.5, PM10, carbon monoxide (CO), and nitrogen oxides (NOx) were the most influential variables. The results determined that such novel DL and ML models may improve the accuracy of AQI forecasts and understanding of air pollution, particularly in metropolitan cities.
引用
收藏
相关论文
共 50 条
  • [31] Comparative analysis of Air Quality Index prediction using deep learning algorithms
    Mishra, Ankita
    Gupta, Yogesh
    SPATIAL INFORMATION RESEARCH, 2024, 32 (01) : 63 - 72
  • [32] Air Quality Index prediction using an effective hybrid deep learning model
    Sarkar, Nairita
    Gupta, Rajan
    Keserwani, Pankaj Kumar
    Govil, Mahesh Chandra
    ENVIRONMENTAL POLLUTION, 2022, 315
  • [33] Development of machine learning models for estimation of daily evaporation and mean temperature: a case study in New Delhi, India
    Rajput, Jitendra
    Kushwaha, Nand Lal
    Srivastava, Aman
    Pande, Chaitanya B.
    Suna, Triptimayee
    Sena, D. R.
    Singh, D. K.
    Mishra, A. K.
    Sahoo, P. K.
    Elbeltagi, Ahmed
    WATER PRACTICE AND TECHNOLOGY, 2024, 19 (07) : 2655 - 2672
  • [34] Prediction of Preeclampsia Using Machine Learning and Deep Learning Models: A Review
    Aljameel, Sumayh S.
    Alzahrani, Manar
    Almusharraf, Reem
    Altukhais, Majd
    Alshaia, Sadeem
    Sahlouli, Hanan
    Aslam, Nida
    Khan, Irfan Ullah
    Alabbad, Dina A.
    Alsumayt, Albandari
    BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (01)
  • [35] Wind Power Prediction Based on Machine Learning and Deep Learning Models
    Tarek, Zahraa
    Shams, Mahmoud Y.
    Elshewey, Ahmed M.
    El-kenawy, El-Sayed M.
    Ibrahim, Abdelhameed
    Abdelhamid, Abdelaziz A.
    El-dosuky, Mohamed A.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 715 - 732
  • [36] Improving soil moisture prediction with deep learning and machine learning models
    Teshome, Fitsum T.
    Bayabil, Haimanote K.
    Schaffer, Bruce
    Ampatzidis, Yiannis
    Hoogenboom, Gerrit
    Computers and Electronics in Agriculture, 2024, 226
  • [37] Machine Learning Approach for Predicting Air Quality Index
    Kekulanadara, K. M. O. V. K.
    Kumara, B. T. G. S.
    Kuhaneswaran, Banujan
    2021 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATION (DASA), 2021,
  • [38] Air Quality Index (AQI) Prediction in Holy Makkah Based on Machine Learning Methods
    Almaliki, Abdulrazak H.
    Derdour, Abdessamed
    Ali, Enas
    SUSTAINABILITY, 2023, 15 (17)
  • [39] PREDICTION OF AIR QUALITY INDEX BASED ON METEOROLOGICAL VARIABLES USING MACHINE LEARNING TECHNIQUES
    Uguz, Sinan
    Oral, Okan
    FRESENIUS ENVIRONMENTAL BULLETIN, 2021, 30 (08): : 10057 - 10077
  • [40] Stock Market Index Prediction Using Machine Learning and Deep Learning Techniques
    Saboor, Abdus
    Hussain, Arif
    Agbley, Bless Lord Y.
    ul Haq, Amin
    Li, Jian Ping
    Kumar, Rajesh
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (02): : 1325 - 1344