Predictive lithology mapping using semisupervised learning: Practical insights using a case study from New South Wales, Australia

被引:0
|
作者
Dunham, Michael W. [1 ,2 ]
Malcolm, Alison [1 ]
Welford, J. Kim [1 ]
机构
[1] Mem Univ Newfoundland, Dept Earth Sci, St John, NF, Canada
[2] ALS Goldspot Discoveries, N Vancouver, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
REMOTE-SENSING DATA; SELF-ORGANIZING MAPS; AIRBORNE GAMMA-RAY; MINERAL EXPLORATION; RANDOM FORESTS; IMAGE CLASSIFICATION; LANDSAT; GEOLOGY; IDENTIFICATION; VALIDATION;
D O I
10.1190/GEO2022-0476.1
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
We develop a comprehensive study involving three different types of machine learning (unsupervised, supervised, and semi -supervised, which we emphasize) for bedrock-lithology classifi-cation using a publicly available data set from New South Wales, Australia. The goal of this work is to demonstrate (1) the value each different type of machine learning can provide and (2) which machine learning type(s) may be preferable under different circumstances. Training data are characteristically limited for geoscience problems, which makes supervised techniques sus-ceptible to overfitting; we explore if semisupervised methods can perform better in these circumstances. Using the geophysical data and geologic map provided for the study area, we compare the performance of two supervised methods (the Light Gradient Boosting Machine and eXtreme Gradient Boosting) with one semisupervised algorithm (label propagation [LP]) in three sce-narios with varied limited a priori lithologic constraints (i.e., the training data). Hyperparameter tuning is an essential component of supervised and semisupervised techniques, and the default procedure is to choose the hyperparameter combination with the largest mean cross-validation score. However, we use a new hyperparameter selection strategy that simultaneously uses the mean and standard deviation scores, and we test this new tactic for supervised and semisupervised methods. The results indicate (1) that the new hyperparameter selection technique can slightly improve the performance for supervised and semisupervised methods by 1%-2% compared with the standard selection ap-proach and (2) that LP can outperform the two supervised meth-ods by up to 10%, but it depends on how the training data are distributed. As for the unsupervised analysis, the clusters indicate heterogeneous regions that correlate well with the high-entropy areas in the supervised and semisupervised results. The clustering provides complementary results to the other two types of machine learning and is a source of supporting evidence for suggesting where more in-depth field mapping may be needed.
引用
收藏
页码:JM1 / JM17
页数:17
相关论文
共 50 条
  • [31] Multi-response roles in emergency response personnel Insights from New South Wales, Australia
    Linsdell, Greg
    Rogers, Colin
    INTERNATIONAL JOURNAL OF EMERGENCY SERVICES, 2014, 3 (02) : 162 - 178
  • [32] Machine learning assisted lithology prediction using geophysical logs: A case study from Cambay basin
    Prajapati, Rahul
    Mukherjee, Bappa
    Singh, Upendra K.
    Sain, Kalachand
    JOURNAL OF EARTH SYSTEM SCIENCE, 2024, 133 (02)
  • [33] Quantifying uncertainty in rainfall–runoff models due to design losses using Monte Carlo simulation: a case study in New South Wales, Australia
    Melanie Loveridge
    Ataur Rahman
    Stochastic Environmental Research and Risk Assessment, 2014, 28 : 2149 - 2159
  • [34] Sun exposure and prostate cancer risk in New South Wales, Australia: A case control study
    Nair-Shalliker, Visalini
    Smith, David P.
    Egger, Sam
    Hughes, Ann Marie
    Clements, Mark
    Kricker, Anne
    Armstrong, Bruce K.
    CANCER RESEARCH, 2012, 72
  • [35] The emotional geographies of a coal mining transition: a case study of Singleton, New South Wales, Australia
    Egan, Myles
    Sherval, Meg
    Wright, Sarah
    AUSTRALIAN GEOGRAPHER, 2024, 55 (01) : 1 - 21
  • [36] Sustainable energy system planning for the management of MGs: a case study in New South Wales, Australia
    Hejeejo, Rashid
    Qiu, Jing
    Brinsmead, Thomas S.
    Reedman, Luke J.
    IET RENEWABLE POWER GENERATION, 2017, 11 (02) : 228 - 238
  • [37] Information needs for environmental-flow allocation: A case study from the Lachlan River, New South Wales, Australia
    Hillman, M
    Brierley, G
    ANNALS OF THE ASSOCIATION OF AMERICAN GEOGRAPHERS, 2002, 92 (04) : 617 - 630
  • [38] Using administrative health data to describe colorectal and lung cancer care in New South Wales, Australia: a validation study
    Goldsbury, David E.
    Armstrong, Katie
    Simonella, Leonardo
    Armstrong, Bruce K.
    O'Connell, Dianne L.
    BMC HEALTH SERVICES RESEARCH, 2012, 12
  • [39] Association of climate drivers with rainfall in New South Wales, Australia, using Bayesian Model Averaging
    Hiep Nguyen Duc
    Kelly Rivett
    Katrina MacSween
    Linh Le-Anh
    Theoretical and Applied Climatology, 2017, 127 : 169 - 185
  • [40] Using administrative health data to describe colorectal and lung cancer care in New South Wales, Australia: a validation study
    David E Goldsbury
    Katie Armstrong
    Leonardo Simonella
    Bruce K Armstrong
    Dianne L O’Connell
    BMC Health Services Research, 12