Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction

被引:65
|
作者
Jumin, Ellysia [1 ]
Zaini, Nuratiah [1 ]
Ahmed, Ali Najah [2 ]
Abdullah, Samsuri [3 ]
Ismail, Marzuki [4 ,5 ]
Sherif, Mohsen [6 ,7 ]
Sefelnasr, Ahmed [6 ]
EI-Shafie, Ahmed [6 ,8 ]
机构
[1] Univ Tenaga Nas, Coll Engn, Dept Civil Engn, Selangor Darul Ehsan, Malaysia
[2] Univ Tenaga Nas, Inst Energy Infrastruct IEI, Selangor Darul Ehsan, Malaysia
[3] Univ Malaysia Terengganu, Fac Ocean Engn Technol & Informat, Air Qual & Environm Res Grp, Terengganu, Malaysia
[4] Univ Malaysia Terengganu, Fac Sci & Marine Environm, Terengganu, Malaysia
[5] Univ Malaysia Terengganu, Inst Trop Biodivers & Sustainable Dev, Terengganu, Malaysia
[6] United Arab Emirates Univ, Natl Water Ctr NWC, Al Ain, U Arab Emirates
[7] United Arab Emirates Univ, Coll Engn, Civil & Environm Engn Dept, Al Ain, U Arab Emirates
[8] Univ Malaya, Fac Engn, Dept Civil Engn, Kuala Lumpur, Malaysia
关键词
Ozone concentration prediction; machine learning algorithm; ozone precursors; Boosted Decision Tree Regression; neural network; linear regression; Pearson Correlation; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR REGRESSION; TROPOSPHERIC OZONE; LEVEL OZONE; EXPOSURE;
D O I
10.1080/19942060.2020.1758792
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
High level of tropospheric ozone concentration, exceeding allowable level has been frequently reported in Malaysia. This study proposes accurate model based on Machine Learning algorithms to predict Tropospheric ozone concentration in major cities located in Kuala Lumpur and Selangor, Malaysia. The proposed models were developed using three-year of historical data for different parameters as input to predict 24-hour and 12-hour of tropospheric ozone concentration. Different Machine Learning algorithms have been investigated, viz. Linear Regression, Neural Network and Boosted Decision Tree. The results revealed that wind speed, humidity, Nitrogen Oxide, Carbon Monoxide and Nitrogen Dioxide have significant influence on ozone formation. Boosted Decision Tree outperformed Linear regression and Neural Network algorithms for all stations. The performance of the proposed model improved by using 12-hours dataset instead of the 24-hour where R-2 values were equal to 0.91, 0.88 and 0.87 for the three investigated stations. To assess the uncertainties of the Boosted Decision Tree model, 95% prediction uncertainties (95PPU) d-factors were introduced.95PPU showed about 94.4, 93.4, 96.7% and the d-factors were 0.001015, 0.001016 and 0.001124 which relate to S1, S2 and S3, respectively. The obtained results provide a reliable prediction model to mimic actual ozone concentration in different locations in Malaysia.
引用
收藏
页码:713 / 725
页数:13
相关论文
共 50 条
  • [1] An Accurate Prediction of MPG (Miles Per Gallon) using Linear Regression Model of Machine Learning
    Shirbhayye, Varun
    Kurmi, Deepesh
    Dyavanapalli, Siddharth
    Prasad, Agraharam Sai Hari
    Lal, Nidhi
    2020 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI - 2020), 2020, : 74 - +
  • [2] Prediction of Ozone Hourly Concentrations Based on Machine Learning Technology
    Li, Dong
    Ren, Xiaofei
    SUSTAINABILITY, 2022, 14 (10)
  • [3] Analysis and prediction of atmospheric ozone concentrations using machine learning
    Rass, Stephan
    Leuenberger, Markus C.
    FRONTIERS IN BIG DATA, 2025, 7
  • [4] Transformation of nitrogen dioxide into ozone and prediction of ozone concentrations using multiple linear regression techniques
    Nurul Adyani Ghazali
    Nor Azam Ramli
    Ahmad Shukri Yahaya
    Noor Faizah Fitri MD Yusof
    Nurulilyana Sansuddin
    Wesam Ahmed Al Madhoun
    Environmental Monitoring and Assessment, 2010, 165 : 475 - 489
  • [5] Transformation of nitrogen dioxide into ozone and prediction of ozone concentrations using multiple linear regression techniques
    Ghazali, Nurul Adyani
    Ramli, Nor Azam
    Yahaya, Ahmad Shukri
    Yusof, Noor Faizah Fitri M. D.
    Sansuddin, Nurulilyana
    Al Madhoun, Wesam Ahmed
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2010, 165 (1-4) : 475 - 489
  • [6] Prediction and Elucidation of Triglycerides Levels Using a Machine Learning and Linear Fuzzy Modelling Approach
    Ahmad, Wan Muhamad Amir W.
    Ahmed, Faraz
    Noor, Nor Farid Mohd
    Aleng, Nor Azlida
    Ghazali, Farah Muna Mohamad
    Alam, Mohammad Khursheed
    BIOMED RESEARCH INTERNATIONAL, 2022, 2022
  • [7] Prediction of frictional behaviour through regression equations: A statistical modelling approach validated with machine learning
    Kumar, Vineet
    Gautam, Gaurav
    Singh, Manvandra Kumar
    Ji, Gopal
    Katiyar, Jitendra Kumar
    Mohan, Sunil
    Mohan, Anita
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART J-JOURNAL OF ENGINEERING TRIBOLOGY, 2024,
  • [8] Machine learning versus regression for prediction of sporadic pancreatic cancer
    Chen, Wansu
    Zhou, Botao
    Jeon, Christie Y.
    Xie, Fagen
    Lin, Yu-Chen
    Butler, Rebecca K.
    Zhou, Yichen
    Luong, Tiffany Q.
    Lustigova, Eva
    Pisegna, Joseph R.
    Wu, Bechien U.
    PANCREATOLOGY, 2023, 23 (04) : 396 - 402
  • [10] Optimization with Adaptive Learning: A Better Approach for Reducing SSE to Fit Accurate Linear Regression Model for Prediction
    Verma, Vijay Kumar
    Banodha, Umesh
    Malpani, Kamlesh
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (10) : 168 - 173