Environmental data mining and modeling based on machine learning algorithms and geostatistics

被引:91
|
作者
Kanevski, M
Parkin, R
Pozdnukhov, A
Timonin, V
Maignan, M
Demyanov, V
Canu, S
机构
[1] Russian Acad Sci, Nucl Safety Inst, IBRAE, Environm Modeling & Syst Anal Lab, Moscow 113191, Russia
[2] IDIAP Dalle Molle Inst Perceptual Artificial Inte, CH-1920 Martigny, Switzerland
[3] Univ Lausanne, Lausanne, Switzerland
[4] Moscow MV Lomonosov State Univ, Dept Phys, Div Math, Moscow, Russia
[5] INSA Rouen, Rouen, France
关键词
environmental data mining and assimilation; geostatistics; machine learning; stochastic simulation; radioactive pollution;
D O I
10.1016/j.envsoft.2003.03.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The paper presents some contemporary approaches to spatial environmental data analysis. The main topics are concentrated on the decision-oriented problems of environmental spatial data mining and modeling: valorization and representativity of data with the help of exploratory data analysis, spatial predictions, probabilistic and risk mapping, development and application of conditional stochastic simulation models. The innovative part of the paper presents integrated/hybrid model-machine learning (ML) residuals sequential simulations-MLRSS. The models are based on multilayer perceptron and support vector regression ML algorithms used for modeling long-range spatial trends and sequential simulations of the residuals. NIL algorithms deliver non-linear solution for the spatial non-stationary problems, which are difficult for geostatistical approach. Geostatistical tools (variography) are used to characterize performance of ML algorithms, by analyzing quality and quantity of the spatially structured information extracted from data with ML algorithms. Sequential simulations provide efficient assessment of uncertainty and spatial variability. Case study from the Chernobyl fallouts illustrates the performance of the proposed model. It is shown that probability mapping, provided by the combination of ML data driven and geostatistical model based approaches, can be efficiently used in decision-making process. (C) 2003 Elsevier Ltd. All rights reserved.
引用
收藏
页码:845 / 855
页数:11
相关论文
共 50 条
  • [41] Machine learning and robust data mining
    Croux, Christophe
    Leuven, K. U.
    Gallopoulos, Efstratios
    Van Aelst, Stefan
    Zha, Hongyuan
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 151 - 154
  • [42] DATA MINING AND MACHINE LEARNING IN ASTRONOMY
    Ball, Nicholas M.
    Brunner, Robert J.
    [J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS D, 2010, 19 (07): : 1049 - 1106
  • [43] Machine learning for mining imbalanced data
    Arafat, Md. Yasir
    Hoque, Sabera
    Xu, Shuxiang
    Farid, Dewan Md
    [J]. IAENG International Journal of Computer Science, 2019, 46 (02) : 332 - 348
  • [44] Machine learning for data mining in medicine
    Lavrac, N
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 1999, 1620 : 47 - 62
  • [45] Fuzzy machine learning and data mining
    Huellermeier, Eyke
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (04) : 269 - 283
  • [46] Analysis and Modeling of Geodetic Data Based on Machine Learning
    Wu, Tong
    [J]. Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [47] A machine learning based data modeling for medical diagnosis
    Mahoto, Naeem Ahmed
    Shaikh, Asadullah
    Sulaiman, Adel
    Reshan, Mana Saleh Al
    Rajab, Adel
    Rajab, Khairan
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81
  • [48] Outlier data mining model for sports data analysis based on machine learning
    Yin, Zhimeng
    Cui, Wei
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (02) : 2733 - 2742
  • [49] Urban landscape modeling and algorithms under machine learning and remote sensing data
    Song, Ting
    Lu, Guoying
    [J]. EARTH SCIENCE INFORMATICS, 2024, 17 (03) : 2303 - 2316
  • [50] Bagging-based machine learning algorithms for landslide susceptibility modeling
    Tingyu Zhang
    Quan Fu
    Hao Wang
    Fangfang Liu
    Huanyuan Wang
    Ling Han
    [J]. Natural Hazards, 2022, 110 : 823 - 846