Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction

被引:65
|
作者
Jumin, Ellysia [1 ]
Zaini, Nuratiah [1 ]
Ahmed, Ali Najah [2 ]
Abdullah, Samsuri [3 ]
Ismail, Marzuki [4 ,5 ]
Sherif, Mohsen [6 ,7 ]
Sefelnasr, Ahmed [6 ]
EI-Shafie, Ahmed [6 ,8 ]
机构
[1] Univ Tenaga Nas, Coll Engn, Dept Civil Engn, Selangor Darul Ehsan, Malaysia
[2] Univ Tenaga Nas, Inst Energy Infrastruct IEI, Selangor Darul Ehsan, Malaysia
[3] Univ Malaysia Terengganu, Fac Ocean Engn Technol & Informat, Air Qual & Environm Res Grp, Terengganu, Malaysia
[4] Univ Malaysia Terengganu, Fac Sci & Marine Environm, Terengganu, Malaysia
[5] Univ Malaysia Terengganu, Inst Trop Biodivers & Sustainable Dev, Terengganu, Malaysia
[6] United Arab Emirates Univ, Natl Water Ctr NWC, Al Ain, U Arab Emirates
[7] United Arab Emirates Univ, Coll Engn, Civil & Environm Engn Dept, Al Ain, U Arab Emirates
[8] Univ Malaya, Fac Engn, Dept Civil Engn, Kuala Lumpur, Malaysia
关键词
Ozone concentration prediction; machine learning algorithm; ozone precursors; Boosted Decision Tree Regression; neural network; linear regression; Pearson Correlation; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR REGRESSION; TROPOSPHERIC OZONE; LEVEL OZONE; EXPOSURE;
D O I
10.1080/19942060.2020.1758792
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
High level of tropospheric ozone concentration, exceeding allowable level has been frequently reported in Malaysia. This study proposes accurate model based on Machine Learning algorithms to predict Tropospheric ozone concentration in major cities located in Kuala Lumpur and Selangor, Malaysia. The proposed models were developed using three-year of historical data for different parameters as input to predict 24-hour and 12-hour of tropospheric ozone concentration. Different Machine Learning algorithms have been investigated, viz. Linear Regression, Neural Network and Boosted Decision Tree. The results revealed that wind speed, humidity, Nitrogen Oxide, Carbon Monoxide and Nitrogen Dioxide have significant influence on ozone formation. Boosted Decision Tree outperformed Linear regression and Neural Network algorithms for all stations. The performance of the proposed model improved by using 12-hours dataset instead of the 24-hour where R-2 values were equal to 0.91, 0.88 and 0.87 for the three investigated stations. To assess the uncertainties of the Boosted Decision Tree model, 95% prediction uncertainties (95PPU) d-factors were introduced.95PPU showed about 94.4, 93.4, 96.7% and the d-factors were 0.001015, 0.001016 and 0.001124 which relate to S1, S2 and S3, respectively. The obtained results provide a reliable prediction model to mimic actual ozone concentration in different locations in Malaysia.
引用
收藏
页码:713 / 725
页数:13
相关论文
共 50 条
  • [21] Hybrid machine learning model for hourly ozone concentrations prediction and exposure risk assessment
    Lingxia, Wu
    Qijie, Zhang
    Jie, Li
    Junlin, An
    ATMOSPHERIC POLLUTION RESEARCH, 2023, 14 (11)
  • [22] Integrating graphology and machine learning for accurate prediction of personality: a novel approach
    Bandhu, Kailash Chandra
    Litoriya, Ratnesh
    Khatri, Mihir
    Kaul, Milind
    Soni, Prakhar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (30) : 46457 - 46481
  • [23] Hybrid machine learning approach for accurate prediction of the drilling rock index
    Shahani, Niaz Muhammad
    Zheng, Xigui
    Wei, Xin
    Jiang, Hongwei
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [24] Accurate solubility prediction with error bars for electrolytes: A machine learning approach
    Schroeter, Timon S.
    Schwaighofer, Anton
    Mika, Sebastian
    ter Laak, Antonius
    Suelzle, Detlev
    Heinrich, Nikolaus
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2006, 232 : 137 - 137
  • [25] Accurate solubility prediction with error bars for electrolytes:: A machine learning approach
    Schwaighofer, Anton
    Schroeter, Timon
    Mika, Sebastian
    Laub, Julian
    ter Laak, Antonius
    Suelzle, Detlev
    Ganzer, Ursula
    Heinrich, Nikolaus
    Mueller, Klaus-Robert
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (02) : 407 - 424
  • [26] Integrating graphology and machine learning for accurate prediction of personality: a novel approach
    Kailash Chandra Bandhu
    Ratnesh Litoriya
    Mihir Khatri
    Milind Kaul
    Prakhar Soni
    Multimedia Tools and Applications, 2023, 82 : 46457 - 46481
  • [27] Accurate prediction of dielectric properties and bandgaps in materials with a machine learning approach
    Hu, Yilin
    Wu, Maokun
    Yuan, Miaojia
    Wen, Yichen
    Ren, Pengpeng
    Ye, Sheng
    Liu, Fayong
    Zhou, Bo
    Fang, Hui
    Wang, Runsheng
    Ji, Zhigang
    Huang, Ru
    APPLIED PHYSICS LETTERS, 2024, 125 (15)
  • [28] An Approach for Potato Yield Prediction Using Machine Learning Regression Algorithms
    Patnaik, Prabhu Prasad
    Padhy, Neelamadhab
    NEXT GENERATION OF INTERNET OF THINGS, 2023, 445 : 327 - 336
  • [29] Importance of secondary decomposition in the accurate prediction of daily-scale ozone pollution by machine learning
    Du, Xinyue
    Yuan, Zibing
    Huang, Daojian
    Ma, Wei
    Yang, Jun
    Mo, Jianbin
    SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 904
  • [30] Machine learning versus logistic regression for the prediction of complication after pancreatoduodenectomy Response
    Ingwersen, Erik W.
    Daams, F.
    SURGERY, 2024, 175 (05) : 1467 - 1467