Adaptive RBF Interpolation for Estimating Missing Values in Geographical Data

被引:4
|
作者
Gao, Kaifeng [1 ]
Mei, Gang [1 ]
Cuomo, Salvatore [2 ]
Piccialli, Francesco [2 ]
Xu, Nengxiong [1 ]
机构
[1] China Univ Geosci Beijing, Beijing, Peoples R China
[2] Univ Naples Federico II, Naples, Italy
来源
NUMERICAL COMPUTATIONS: THEORY AND ALGORITHMS, PT I | 2020年 / 11973卷
基金
中国国家自然科学基金;
关键词
Data mining; Data quality; Data imputation; RBF interpolation; kNN;
D O I
10.1007/978-3-030-39081-5_12
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The quality of datasets is a critical issue in big data mining. More interesting things could be found for datasets with higher quality. The existence of missing values in geographical data would worsen the quality of big datasets. To improve the data quality, the missing values are generally needed to be estimated using various machine learning algorithms or mathematical methods such as approximations and interpolations. In this paper, we propose an adaptive Radial Basis Function (RBF) interpolation algorithm for estimating missing values in geographical data. In the proposed method, the samples with known values are considered as the data points, while the samples with missing values are considered as the interpolated points. For each interpolated point, first, a local set of data points are adaptively determined. Then, the missing value of the interpolated point is imputed via interpolating using the RBF interpolation based on the local set of data points. Moreover, the shape factors of the RBF are also adaptively determined by considering the distribution of the local set of data points. To evaluate the performance of the proposed method, we compare our method with the commonly used k-Nearest Neighbor (kNN) interpolation and Adaptive Inverse Distance Weighted (AIDW) interpolation, and conduct three groups of benchmark experiments. Experimental results indicate that the proposed method outperforms the kNN interpolation and AIDW interpolation in terms of accuracy, but worse than the kNN interpolation and AIDW interpolation in terms of efficiency.
引用
收藏
页码:122 / 130
页数:9
相关论文
共 50 条
  • [41] A new effective method for estimating missing values in the sequence data prior to phylogenetic analysis
    Diallo, Abdoulaye Banire
    Lapointe, Francois-Joseph
    Makarenkov, Vladimir
    EVOLUTIONARY BIOINFORMATICS, 2006, 2 : 237 - 246
  • [42] MISSING VALUES IN MULTIVARIATE DATA
    KUZMA, JW
    BIOMETRICS, 1965, 21 (01) : 254 - &
  • [44] The Use of Spatial Interpolation to Improve the Quality of Corn Silage Data in Case of Presence of Extreme or Missing Values
    Koutsos, Thomas M.
    Menexes, Georgios C.
    Eleftherohorinos, Ilias G.
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (03)
  • [45] Interpolation of signals with missing data using PCA
    Oliveira, P.
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 3279 - 3282
  • [46] An estimating method for missing values by using multiple SOMs
    Kikuchi, Yuui
    Okada, Nobuhiro
    Tsuji, Yasutaka
    Kiguchi, Kazuo
    PROCEEDINGS OF THE EIGHTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 18TH '13), 2013, : 468 - 471
  • [47] Estimating the reliability coefficient of tests in presence of missing values
    Cuesta Izquierdo, Marcelino
    Fonseca Pedrero, Eduardo
    PSICOTHEMA, 2014, 26 (04) : 516 - 523
  • [48] Evaluations of a multiple SOMs method for estimating missing values
    Arima, Kouki
    Okada, Nobuhiro
    Tsuji, Yasutaka
    Kiguchi, Kazuo
    2014 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2014, : 796 - 801
  • [49] Interpolation of missing wind data based on ANFIS
    Yang, Zhiling
    Liu, Yongqian
    Li, Chengrong
    RENEWABLE ENERGY, 2011, 36 (03) : 993 - 998
  • [50] Adaptive pairing of classifier and imputation methods based on the characteristics of missing values in data sets
    Sim, Jaemun
    Kwon, Ohbyung
    Lee, Kun Chang
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 46 : 485 - 493