Effects of tuning decision trees in random forest regression on predicting porosity of a hydrocarbon reservoir. A case study: volve oil field, north sea

被引:0
|
作者
Sandunil, Kushan [1 ]
Bennour, Ziad [1 ]
Ben Mahmud, Hisham [2 ]
Giwelli, Ausama [3 ,4 ]
机构
[1] Curtin Univ Malaysia, Miri 98009, Sarawak, Malaysia
[2] Univ Teknol PETRONAS, Seri Iskandar 32610, Perak Darul Rid, Malaysia
[3] INPEX, 100 St Georges Terrace, Perth, WA 6000, Australia
[4] Curtin Univ, WASM, Kensington, WA 6151, Australia
来源
ENERGY ADVANCES | 2024年 / 3卷 / 09期
关键词
PERMEABILITY PREDICTION; SEISMIC ATTRIBUTES; WATER SATURATION; CLASSIFICATION;
D O I
10.1039/d4ya00313f
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Machine learning (ML) has emerged as a powerful tool in petroleum engineering for automatically interpreting well logs and characterizing reservoir properties such as porosity. As a result, researchers are trying to enhance the performance of ML models further to widen their applicability in the real world. Random forest regression (RFR) is one such widely used ML technique that was developed by combining multiple decision trees. To improve its performance, one of its hyperparameters, the number of trees in the forest (n_estimators), is tuned during model optimization. However, the existing literature lacks in-depth studies on the influence of n_estimators on the RFR model when used for predicting porosity, given that n_estimators is one of the most influential hyperparameters that can be tuned to optimize the RFR algorithm. In this study, the effects of n_estimators on the RFR model in porosity prediction were investigated. Furthermore, n_estimators' interactions with two other key hyperparameters, namely the number of features considered for the best split (max_features) and the minimum number of samples required to be at a leaf node (min_samples_leaf) were explored. The RFR models were developed using 4 input features, namely, resistivity log (RES), neutron porosity log (NPHI), gamma ray log (GR), and the corresponding depths obtained from the Volve oil field in the North Sea, and calculated porosity was used as the target data. The methodology consisted of 4 approaches. In the first approach, only n_estimators were changed; in the second approach, n_estimators were changed along with max_features; in the third approach, n_estimators were changed along with min_samples_leaf; and in the final approach, all three hyperparameters were tuned. Altogether 24 RFR models were developed, and models were evaluated using adjusted R2 (adj. R2), root mean squared error (RMSE), and their computational times. The obtained results showed that the highest performance with an adj. R2 value of 0.8505 was achieved when n_estimators was 81, max_features was 2 and min_samples_leaf was 1. In approach 2, when n_estimators' upper limit was increased from 10 to 100, there was a test model performance growth of more than 1.60%, whereas increasing n_estimators' upper limit from 100 to 1000 showed a performance drop of around 0.4%. Models developed by tuning n_estimators from 1 to 100 in intervals of 10 had healthy test model adj. R2 values and lower computational times, making them the best n_estimators' range and interval when both performances and computational times were taken into consideration to predict the porosity of the Volve oil field in the North Sea. Thus, it was concluded that by tuning only n_estimators and max_features, the performance of RFR models can be increased significantly. This study investigates the effects of tuning n_estimators along with max_features and min_samples_leaf in random forest regression when predicting the porosity of the Volve oil field.
引用
收藏
页码:2335 / 2347
页数:13
相关论文
共 6 条
  • [1] Hybrid and automated machine learning approaches for oil fields development: The case study of Volve field, North Sea
    Nikitin, Nikolay O. O.
    Revin, Ilia
    Hvatov, Alexander
    Vychuzhanin, Pavel
    Kalyuzhnaya, Anna, V
    COMPUTERS & GEOSCIENCES, 2022, 161
  • [2] Characterization of elastic moduli with anisotropic rock physics templates considering mineralogy, fluid, porosity, and pore-structure: A case study in Volve field, North Sea
    Mena-Negrete, Joseline
    Valdiviezo-Mijangos, Oscar C.
    Nicolas-Lopez, Ruben
    Coconi-Morales, Enrique
    JOURNAL OF APPLIED GEOPHYSICS, 2022, 206
  • [3] Sequential Data Approach for Rate of Penetration Prediction Using Machine Learning Models: A Case Study the Offshore Volve Oil Field, North Sea, Norway
    Pakawatthapana, Yanadade
    Khonthapagdee, Subhorn
    PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE ON COMPUTING AND INFORMATION TECHNOLOGY, IC2IT 2024, 2024, 973 : 121 - 130
  • [4] Geochemical Effects on Storage Gases and Reservoir Rock during Underground Hydrogen Storage: A Depleted North Sea Oil Reservoir Case Study
    Saeed, Motaz
    Jadhawar, Prashant
    Bagala, Stefano
    HYDROGEN, 2023, 4 (02): : 323 - 337
  • [5] Predicting age at onset of childhood obesity using regression, Random Forest, Decision Tree, and K-Nearest Neighbour-A case study in Saudi Arabia
    Alanazi, Salem Hamoud
    Abdollahian, Mali
    Tafakori, Laleh
    Almulaihan, kheriah Ahmed
    ALruwili, Salman Mutarid
    Alenazi, Omar Falleh
    PLOS ONE, 2024, 19 (09):
  • [6] Bottom-up formulations for the multi-criteria decision analysis of oil and gas pipeline decommissioning in the North Sea: Brent field case study
    Jalili, Shahin
    Leontidis, Georgios
    Cauvin, Samuel R.
    Gormley, Kate
    Stone, Malcolm
    Neilson, Richard
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2024, 365