Estimation of Spatial Distribution Using the Gaussian Mixture Model with Multivariate Geoscience Data

被引:2
|
作者
Kim, Ho-Rim [1 ]
Yu, Soonyoung [2 ]
Yun, Seong-Taek [2 ]
Kim, Kyoung-Ho [3 ]
Lee, Goon-Taek [4 ]
Lee, Jeong-Ho [1 ]
Heo, Chul-Ho [1 ]
Ryu, Dong-Woo [1 ]
机构
[1] Korea Inst Geosci & Mineral Resources, Daejeon, South Korea
[2] Korea Univ, Seoul, South Korea
[3] Korea Environm Inst, Yeongi Gun, South Korea
[4] Seoul Natl Univ, Natl Instrumentat Ctr Environm Management, Seoul, South Korea
来源
ECONOMIC AND ENVIRONMENTAL GEOLOGY | 2022年 / 55卷 / 04期
关键词
Gaussian Mixture Model (GMM); multivariate; geoscience data (geo-data); machine learning; soil contamination; GROUNDWATER QUALITY; DENSITY-ESTIMATION; SOILS; CLASSIFICATION; SIMULATION; PXRF;
D O I
10.9719/EEG2022.55.4.353
中图分类号
P5 [地质学];
学科分类号
0709 ; 081803 ;
摘要
Spatial estimation of geoscience data (geo-data) is challenging due to spatial heterogeneity, data scarcity, and high dimensionality. Anovel spatial estimation method is needed to consider the characteristics of geo-data. In this study, we proposed the application ofGaussian Mixture Model (GMM) among machine learning algorithms with multivariate data for robust spatial predictions. Theperformance of the proposed approach was tested through soil chemical concentration data from a former smelting area. Theconcentrations of As and Pb determined by ex-situ ICP-AES were the primary variables to be interpolated, while the other metalconcentrations by ICP-AES and all data determined by in-situ portable X-ray fluorescence (PXRF) were used as auxiliary variables inGMM and ordinary cokriging (OCK). Among the multidimensional auxiliary variables, important variables were selected using avariable selection method based on the random forest. The results of GMM with important multivariate auxiliary data decreased theroot mean-squared error (RMSE) down to 0.11 for As and 0.33 for Pb and increased the correlations (r) up to 0.31 for As and 0.46 forPb compared to those from ordinary kriging and OCK using univariate or bivariate data. The use of GMM improved the performanceof spatial interpretation of anthropogenic metals in soil. The multivariate spatial approach can be applied to understand complex andheterogeneous geological and geochemical features.
引用
收藏
页码:353 / 366
页数:14
相关论文
共 50 条
  • [1] Multivariate data clustering for the Gaussian mixture model
    Kavaliauskas, M
    Rudzkis, R
    [J]. INFORMATICA, 2005, 16 (01) : 61 - 74
  • [2] Mixture Gaussian process model with Gaussian mixture distribution for big data
    Guan, Yaonan
    He, Shaoying
    Ren, Shuangshuang
    Liu, Shuren
    Li, Dewei
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2024, 253
  • [3] Multivariate data imputation using Gaussian mixture models
    Silva, Diogo S. F.
    Deutsch, Clayton, V
    [J]. SPATIAL STATISTICS, 2018, 27 : 74 - 90
  • [4] Estimation of Leakage Distribution Utilizing Gaussian Mixture Model
    Kwon, Hyunjeong
    Kim, Young Hwan
    Kang, Seokhyeong
    [J]. 2018 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2018, : 149 - 150
  • [5] Latent Gaussian Processes for Distribution Estimation of Multivariate Categorical Data
    Gal, Yarin
    Chen, Yutian
    Ghahramani, Zoubin
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 645 - 654
  • [6] Estimation of Spatial Distribution Considering Indirect Data Using Gaussian Process Regression
    Tsuda, Yuto
    Tomizawa, Yukihisa
    Yoshida, Ikumasa
    Otake, Yu
    [J]. GEO-RISK 2023: INNOVATION IN DATA AND ANALYSIS METHODS, 2023, 345 : 94 - 101
  • [7] Multivariate Gaussian Copula in Estimation of Distribution Algorithm with Model Migration
    Hyrs, Martin
    Schwarz, Josef
    [J]. 2014 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE (FOCI), 2014, : 114 - 119
  • [8] An efficient mixture sampling model for gaussian estimation of distribution algorithm
    Dang, Qianlong
    Gao, Weifeng
    Gong, Maoguo
    [J]. INFORMATION SCIENCES, 2022, 608 : 1157 - 1182
  • [9] A copula model for non-Gaussian multivariate spatial data
    Krupskii, Pavel
    Genton, Marc G.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 169 : 264 - 277
  • [10] PARAMETER ESTIMATION OF GAUSSIAN MIXTURE MODEL UTILIZING BOUNDARY DATA
    Omachi, Masako
    Omachi, Shinichiro
    Aso, Hirotomo
    Saito, Tsuneo
    [J]. PROCEEDINGS OF THE 38TH INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2008, : 291 - 297