Predictive lithology mapping using semisupervised learning: Practical insights using a case study from New South Wales, Australia

被引:0
|
作者
Dunham, Michael W. [1 ,2 ]
Malcolm, Alison [1 ]
Welford, J. Kim [1 ]
机构
[1] Mem Univ Newfoundland, Dept Earth Sci, St John, NF, Canada
[2] ALS Goldspot Discoveries, N Vancouver, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
REMOTE-SENSING DATA; SELF-ORGANIZING MAPS; AIRBORNE GAMMA-RAY; MINERAL EXPLORATION; RANDOM FORESTS; IMAGE CLASSIFICATION; LANDSAT; GEOLOGY; IDENTIFICATION; VALIDATION;
D O I
10.1190/GEO2022-0476.1
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
We develop a comprehensive study involving three different types of machine learning (unsupervised, supervised, and semi -supervised, which we emphasize) for bedrock-lithology classifi-cation using a publicly available data set from New South Wales, Australia. The goal of this work is to demonstrate (1) the value each different type of machine learning can provide and (2) which machine learning type(s) may be preferable under different circumstances. Training data are characteristically limited for geoscience problems, which makes supervised techniques sus-ceptible to overfitting; we explore if semisupervised methods can perform better in these circumstances. Using the geophysical data and geologic map provided for the study area, we compare the performance of two supervised methods (the Light Gradient Boosting Machine and eXtreme Gradient Boosting) with one semisupervised algorithm (label propagation [LP]) in three sce-narios with varied limited a priori lithologic constraints (i.e., the training data). Hyperparameter tuning is an essential component of supervised and semisupervised techniques, and the default procedure is to choose the hyperparameter combination with the largest mean cross-validation score. However, we use a new hyperparameter selection strategy that simultaneously uses the mean and standard deviation scores, and we test this new tactic for supervised and semisupervised methods. The results indicate (1) that the new hyperparameter selection technique can slightly improve the performance for supervised and semisupervised methods by 1%-2% compared with the standard selection ap-proach and (2) that LP can outperform the two supervised meth-ods by up to 10%, but it depends on how the training data are distributed. As for the unsupervised analysis, the clusters indicate heterogeneous regions that correlate well with the high-entropy areas in the supervised and semisupervised results. The clustering provides complementary results to the other two types of machine learning and is a source of supporting evidence for suggesting where more in-depth field mapping may be needed.
引用
收藏
页码:JM1 / JM17
页数:17
相关论文
共 50 条
  • [22] Mapping accumulated mine subsidence using small stack of SAR differential interferograms in the Southern coalfield of New South Wales, Australia
    Ng, Alex Hay-Man
    Ge, Linlin
    Yan, Yueguan
    Li, Xiaojing
    Chang, Hsing-Chung
    Zhang, Kui
    Rizos, Chris
    ENGINEERING GEOLOGY, 2010, 115 (1-2) : 1 - 15
  • [23] Predicting the age of fish using general and generalized linear models of biometric data: A case study of two estuarine finfish from New South Wales, Australia
    Ochwada, Faith A.
    Scandol, James P.
    Gray, Charles A.
    FISHERIES RESEARCH, 2008, 90 (1-3) : 187 - 197
  • [24] Mapping the distribution of DDT residues as DDE in the soils of the irrigated regions of northern New South Wales, Australia using ELISA and GIS
    Shivaramaiah, HM
    Odeh, IOA
    Kennedy, IR
    Skerritt, JH
    JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2002, 50 (19) : 5360 - 5367
  • [25] Mapping available soil water capacity in New South Wales, Australia using sparse data-An inverse Bayesian approach
    Somarathna, P. D. S. N.
    Searle, Ross
    Gladish, Daniel W.
    GEODERMA REGIONAL, 2021, 25
  • [26] Impact of Forest Fires on Air Quality in Wolgan Valley, New South Wales, Australia-A Mapping and Monitoring Study Using Google Earth Engine
    Singh, Sachchidanand
    Singh, Harikesh
    Sharma, Vishal
    Shrivastava, Vaibhav
    Kumar, Pankaj
    Kanga, Shruti
    Sahu, Netrananda
    Meraj, Gowhar
    Farooq, Majid
    Singh, Suraj Kumar
    FORESTS, 2022, 13 (01):
  • [27] 2001 Regional Disability Estimates for New South Wales, Australia, Using Spatial Microsimulation
    S. Lymer
    L. Brown
    M. Yap
    A. Harding
    Applied Spatial Analysis and Policy, 2008, 1 (2) : 99 - 116
  • [28] Estimation of vegetative fuel loads using LandsatTM imagery in New South Wales, Australia
    Brandis, K
    Jacobson, C
    INTERNATIONAL JOURNAL OF WILDLAND FIRE, 2003, 12 (02) : 185 - 194
  • [29] Simulating Wheat Yield in New South Wales of Australia Using Interpolation and Neural Networks
    Guo, William W.
    Li, Lily D.
    Whymark, Greg
    NEURAL INFORMATION PROCESSING: MODELS AND APPLICATIONS, PT II, 2010, 6444 : 708 - 715
  • [30] Increasing the Adaptation Pathways Capacity of Land Use Planning - Insights from New South Wales, Australia
    McNicol, Ian
    URBAN POLICY AND RESEARCH, 2021, 39 (02) : 143 - 156