Predictive lithology mapping using semisupervised learning: Practical insights using a case study from New South Wales, Australia

被引:0
|
作者
Dunham, Michael W. [1 ,2 ]
Malcolm, Alison [1 ]
Welford, J. Kim [1 ]
机构
[1] Mem Univ Newfoundland, Dept Earth Sci, St John, NF, Canada
[2] ALS Goldspot Discoveries, N Vancouver, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
REMOTE-SENSING DATA; SELF-ORGANIZING MAPS; AIRBORNE GAMMA-RAY; MINERAL EXPLORATION; RANDOM FORESTS; IMAGE CLASSIFICATION; LANDSAT; GEOLOGY; IDENTIFICATION; VALIDATION;
D O I
10.1190/GEO2022-0476.1
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
We develop a comprehensive study involving three different types of machine learning (unsupervised, supervised, and semi -supervised, which we emphasize) for bedrock-lithology classifi-cation using a publicly available data set from New South Wales, Australia. The goal of this work is to demonstrate (1) the value each different type of machine learning can provide and (2) which machine learning type(s) may be preferable under different circumstances. Training data are characteristically limited for geoscience problems, which makes supervised techniques sus-ceptible to overfitting; we explore if semisupervised methods can perform better in these circumstances. Using the geophysical data and geologic map provided for the study area, we compare the performance of two supervised methods (the Light Gradient Boosting Machine and eXtreme Gradient Boosting) with one semisupervised algorithm (label propagation [LP]) in three sce-narios with varied limited a priori lithologic constraints (i.e., the training data). Hyperparameter tuning is an essential component of supervised and semisupervised techniques, and the default procedure is to choose the hyperparameter combination with the largest mean cross-validation score. However, we use a new hyperparameter selection strategy that simultaneously uses the mean and standard deviation scores, and we test this new tactic for supervised and semisupervised methods. The results indicate (1) that the new hyperparameter selection technique can slightly improve the performance for supervised and semisupervised methods by 1%-2% compared with the standard selection ap-proach and (2) that LP can outperform the two supervised meth-ods by up to 10%, but it depends on how the training data are distributed. As for the unsupervised analysis, the clusters indicate heterogeneous regions that correlate well with the high-entropy areas in the supervised and semisupervised results. The clustering provides complementary results to the other two types of machine learning and is a source of supporting evidence for suggesting where more in-depth field mapping may be needed.
引用
收藏
页码:JM1 / JM17
页数:17
相关论文
共 50 条
  • [41] Birthplace in New South Wales, Australia: an analysis of perinatal outcomes using routinely collected data
    Caroline SE Homer
    Charlene Thornton
    Vanessa L Scarf
    David A Ellwood
    Jeremy JN Oats
    Maralyn J Foureur
    David Sibbritt
    Helen L McLachlan
    Della A Forster
    Hannah G Dahlen
    BMC Pregnancy and Childbirth, 14
  • [42] Pregnancy, prison and perinatal outcomes in New South Wales, Australia: a retrospective cohort study using linked health data
    Walker, Jane R.
    Hilder, Lisa
    Levy, Michael H.
    Sullivan, Elizabeth A.
    BMC PREGNANCY AND CHILDBIRTH, 2014, 14
  • [43] Pregnancy, prison and perinatal outcomes in New South Wales, Australia: a retrospective cohort study using linked health data
    Jane R Walker
    Lisa Hilder
    Michael H Levy
    Elizabeth A Sullivan
    BMC Pregnancy and Childbirth, 14
  • [44] Genotypic identification of Panicum spp. in New South Wales, Australia using DNA barcoding
    Yuchi Chen
    Xiaocheng Zhu
    Panayiotis Loukopoulos
    Leslie A. Weston
    David E. Albrecht
    Jane C. Quinn
    Scientific Reports, 11
  • [45] Association of climate drivers with rainfall in New South Wales, Australia, using Bayesian Model Averaging
    Hiep Nguyen Duc
    Rivett, Kelly
    MacSween, Katrina
    Linh Le-Anh
    THEORETICAL AND APPLIED CLIMATOLOGY, 2017, 127 (1-2) : 169 - 185
  • [46] LARGE AREA CROP CLASSIFICATION IN NEW-SOUTH-WALES, AUSTRALIA, USING LANDSAT DATA
    DAWBIN, KW
    EVANS, JC
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 1988, 9 (02) : 295 - 301
  • [47] Genotypic identification of Panicum spp. in New South Wales, Australia using DNA barcoding
    Chen, Yuchi
    Zhu, Xiaocheng
    Loukopoulos, Panayiotis
    Weston, Leslie A.
    Albrecht, David E.
    Quinn, Jane C.
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [48] Birthplace in New South Wales, Australia: an analysis of perinatal outcomes using routinely collected data
    Homer, Caroline S. E.
    Thornton, Charlene
    Scarf, Vanessa L.
    Ellwood, David A.
    Oats, Jeremy J. N.
    Foureur, Maralyn J.
    Sibbritt, David
    McLachlan, Helen L.
    Forster, Della A.
    Dahlen, Hannah G.
    BMC PREGNANCY AND CHILDBIRTH, 2014, 14
  • [49] Agricultural drought risk assessment of Northern New South Wales, Australia using geospatial techniques
    Hoque, Muhammad Al-Amin
    Pradhan, Biswajeet
    Ahmed, Naser
    Sohel, Md Shawkat Islam
    SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 756 (756)
  • [50] Evaluating the effectiveness of the maximum permitted dose of midazolam in seizure termination: Insights from New South Wales, Australia
    Fouche, Pieter Francsois
    Nichols, Martin
    Abrahams, Raquel
    Maximous, Kristina
    Bendall, Jason
    EMERGENCY MEDICINE AUSTRALASIA, 2024,