Environmental data mining and modeling based on machine learning algorithms and geostatistics

被引:91
|
作者
Kanevski, M
Parkin, R
Pozdnukhov, A
Timonin, V
Maignan, M
Demyanov, V
Canu, S
机构
[1] Russian Acad Sci, Nucl Safety Inst, IBRAE, Environm Modeling & Syst Anal Lab, Moscow 113191, Russia
[2] IDIAP Dalle Molle Inst Perceptual Artificial Inte, CH-1920 Martigny, Switzerland
[3] Univ Lausanne, Lausanne, Switzerland
[4] Moscow MV Lomonosov State Univ, Dept Phys, Div Math, Moscow, Russia
[5] INSA Rouen, Rouen, France
关键词
environmental data mining and assimilation; geostatistics; machine learning; stochastic simulation; radioactive pollution;
D O I
10.1016/j.envsoft.2003.03.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The paper presents some contemporary approaches to spatial environmental data analysis. The main topics are concentrated on the decision-oriented problems of environmental spatial data mining and modeling: valorization and representativity of data with the help of exploratory data analysis, spatial predictions, probabilistic and risk mapping, development and application of conditional stochastic simulation models. The innovative part of the paper presents integrated/hybrid model-machine learning (ML) residuals sequential simulations-MLRSS. The models are based on multilayer perceptron and support vector regression ML algorithms used for modeling long-range spatial trends and sequential simulations of the residuals. NIL algorithms deliver non-linear solution for the spatial non-stationary problems, which are difficult for geostatistical approach. Geostatistical tools (variography) are used to characterize performance of ML algorithms, by analyzing quality and quantity of the spatially structured information extracted from data with ML algorithms. Sequential simulations provide efficient assessment of uncertainty and spatial variability. Case study from the Chernobyl fallouts illustrates the performance of the proposed model. It is shown that probability mapping, provided by the combination of ML data driven and geostatistical model based approaches, can be efficiently used in decision-making process. (C) 2003 Elsevier Ltd. All rights reserved.
引用
收藏
页码:845 / 855
页数:11
相关论文
共 50 条
  • [1] Algorithms for Data Mining and Machine Learning
    Schulz, Volker H.
    [J]. SIAM REVIEW, 2020, 62 (03) : 739 - 739
  • [2] The Application of Machine Learning Algorithms in Data Mining
    Zhang, Wei
    [J]. 2016 INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING AND COMMUNICATIONS TECHNOLOGY (IECT 2016), 2016, : 521 - 527
  • [3] Research on Data Mining of Network Security Hazards Based on Machine Learning Algorithms
    Wu, Liwan
    Yang, Chong
    [J]. Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [4] Comparison of various uncertainty modelling approaches based on geostatistics and machine learning algorithms
    Szatmari, Gabor
    Pasztor, Laszlo
    [J]. GEODERMA, 2019, 337 : 1329 - 1340
  • [5] Application of machine learning algorithms for clinical predictive modeling: a data-mining approach in SCT
    Shouval, R.
    Bondi, O.
    Mishan, H.
    Shimoni, A.
    Unger, R.
    Nagler, A.
    [J]. BONE MARROW TRANSPLANTATION, 2014, 49 (03) : 332 - 337
  • [6] Application of machine learning algorithms for clinical predictive modeling: a data-mining approach in SCT
    R Shouval
    O Bondi
    H Mishan
    A Shimoni
    R Unger
    A Nagler
    [J]. Bone Marrow Transplantation, 2014, 49 : 332 - 337
  • [7] Mining Mixed Data Bases Using Machine Learning Algorithms
    Kuri-Morales, Angel
    [J]. PATTERN RECOGNITION, MCPR 2022, 2022, 13264 : 70 - 80
  • [8] Machine learning algorithms for analysis and modeling of GeoSpatial data
    Pozdnoukhov, Alexei
    Kanevski, Mikhail
    [J]. PROCEEDINGS OF THE IAMG '07: GEOMATHEMATICS AND GIS ANALYSIS OF RESOURCES, ENVIRONMENT AND HAZARDS, 2007, : 216 - +
  • [9] Extreme Precipitation Modelling Using Geostatistics and Machine Learning Algorithms
    Foresti, Loris
    Pozdnoukhov, Alexei
    Tuia, Devis
    Kanevski, Mikhail
    [J]. GEOENV VII - GEOSTATISTICS FOR ENVIRONMENTAL APPLICATIONS, 2010, 16 : 41 - 52
  • [10] Overview of Data Mining Based on Machine Learning
    Zhou, Jia-Sheng
    Cai, Zhi-Yuan
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMMUNICATION ENGINEERING (CSCE 2015), 2015, : 51 - 56