Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction

被引:65
|
作者
Jumin, Ellysia [1 ]
Zaini, Nuratiah [1 ]
Ahmed, Ali Najah [2 ]
Abdullah, Samsuri [3 ]
Ismail, Marzuki [4 ,5 ]
Sherif, Mohsen [6 ,7 ]
Sefelnasr, Ahmed [6 ]
EI-Shafie, Ahmed [6 ,8 ]
机构
[1] Univ Tenaga Nas, Coll Engn, Dept Civil Engn, Selangor Darul Ehsan, Malaysia
[2] Univ Tenaga Nas, Inst Energy Infrastruct IEI, Selangor Darul Ehsan, Malaysia
[3] Univ Malaysia Terengganu, Fac Ocean Engn Technol & Informat, Air Qual & Environm Res Grp, Terengganu, Malaysia
[4] Univ Malaysia Terengganu, Fac Sci & Marine Environm, Terengganu, Malaysia
[5] Univ Malaysia Terengganu, Inst Trop Biodivers & Sustainable Dev, Terengganu, Malaysia
[6] United Arab Emirates Univ, Natl Water Ctr NWC, Al Ain, U Arab Emirates
[7] United Arab Emirates Univ, Coll Engn, Civil & Environm Engn Dept, Al Ain, U Arab Emirates
[8] Univ Malaya, Fac Engn, Dept Civil Engn, Kuala Lumpur, Malaysia
关键词
Ozone concentration prediction; machine learning algorithm; ozone precursors; Boosted Decision Tree Regression; neural network; linear regression; Pearson Correlation; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR REGRESSION; TROPOSPHERIC OZONE; LEVEL OZONE; EXPOSURE;
D O I
10.1080/19942060.2020.1758792
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
High level of tropospheric ozone concentration, exceeding allowable level has been frequently reported in Malaysia. This study proposes accurate model based on Machine Learning algorithms to predict Tropospheric ozone concentration in major cities located in Kuala Lumpur and Selangor, Malaysia. The proposed models were developed using three-year of historical data for different parameters as input to predict 24-hour and 12-hour of tropospheric ozone concentration. Different Machine Learning algorithms have been investigated, viz. Linear Regression, Neural Network and Boosted Decision Tree. The results revealed that wind speed, humidity, Nitrogen Oxide, Carbon Monoxide and Nitrogen Dioxide have significant influence on ozone formation. Boosted Decision Tree outperformed Linear regression and Neural Network algorithms for all stations. The performance of the proposed model improved by using 12-hours dataset instead of the 24-hour where R-2 values were equal to 0.91, 0.88 and 0.87 for the three investigated stations. To assess the uncertainties of the Boosted Decision Tree model, 95% prediction uncertainties (95PPU) d-factors were introduced.95PPU showed about 94.4, 93.4, 96.7% and the d-factors were 0.001015, 0.001016 and 0.001124 which relate to S1, S2 and S3, respectively. The obtained results provide a reliable prediction model to mimic actual ozone concentration in different locations in Malaysia.
引用
收藏
页码:713 / 725
页数:13
相关论文
共 50 条
  • [41] Risk prediction with machine learning and regression methods
    Steyerberg, Ewout W.
    van der Ploeg, Tjeerd
    Van Calster, Ben
    BIOMETRICAL JOURNAL, 2014, 56 (04) : 601 - 606
  • [42] Machine learning in coronary heart disease prediction: Structural equation modelling approach
    Rodrigues, Lewlyn L. R.
    Shetty, Dasharathraj K.
    Naik, Nithesh
    Maddodi, Chethana Balakrishna
    Rao, Anuradha
    Shetty, Ajith Kumar
    Bhat, Rama
    Hameed, Zeeshan
    COGENT ENGINEERING, 2020, 7 (01):
  • [43] Improved linear regression prediction by transfer learning
    Obst, David
    Ghattas, Badih
    Claudel, Sandra
    Cugliari, Jairo
    Goude, Yannig
    Oppenheim, Georges
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 174
  • [44] A comparison of machine learning methods for ozone pollution prediction
    Qilong Pan
    Fouzi Harrou
    Ying Sun
    Journal of Big Data, 10
  • [45] A machine learning approach for optimizing and accurate prediction of performance parameters for stacked nanosheet transistor
    Kumar, Naveen
    Rajakumari, V
    Padhy, Ram Prasad
    Routray, S.
    Pradhan, K. P.
    PHYSICA SCRIPTA, 2024, 99 (04)
  • [46] A comparison of machine learning methods for ozone pollution prediction
    Pan, Qilong
    Harrou, Fouzi
    Sun, Ying
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [47] Prediction of eating disorder treatment response trajectories via machine learning does not improve performance versus a simpler regression approach
    Espel-Huynh, Hallie
    Zhang, Fengqing
    Thomas, J. Graham
    Boswell, James F.
    Thompson-Brenner, Heather
    Juarascio, Adrienne S.
    Lowe, Michael R.
    INTERNATIONAL JOURNAL OF EATING DISORDERS, 2021, 54 (07) : 1250 - 1259
  • [48] MAGPIE: accurate pathogenic prediction for multiple variant types using machine learning approach
    Liu, Yicheng
    Zhang, Tianyun
    You, Ningyuan
    Wu, Sai
    Shen, Ning
    GENOME MEDICINE, 2024, 16 (01)
  • [49] A machine learning approach to the accurate prediction of multi-leaf collimator positional errors
    Carlson, Joel N. K.
    Park, Jong Min
    Park, So-Yeon
    Park, Jong In
    Choi, Yunseok
    Ye, Sung-Joon
    PHYSICS IN MEDICINE AND BIOLOGY, 2016, 61 (06): : 2514 - 2531
  • [50] MAGPIE: accurate pathogenic prediction for multiple variant types using machine learning approach
    Yicheng Liu
    Tianyun Zhang
    Ningyuan You
    Sai Wu
    Ning Shen
    Genome Medicine, 16