Comparison of missing data imputation methods using weather data

被引:1
|
作者
Nida, Hafiza [1 ]
Kashif, Muhammad [1 ]
Khan, Muhammad Imran [1 ]
Ghamkhar, Madiha [1 ]
机构
[1] Univ Agr Faisalabad, Fac Sci, Dept Math & Stat, Faisalabad, Pakistan
来源
关键词
Rainfall; temperature; missing data; imputation methods; root mean square error; TEMPERATURE; PAKISTAN; CLIMATE; CROP;
D O I
10.21162/PAKJAS/23.228
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Researchers and data analysts commonly experience challenges while dealing with missing data for analyzing large data sets in their respective field of studies. It is necessary to handle missing data properly to obtain better and more reliable outcomes about any research. The objective of this research is to evaluate different imputation techniques for handling missing observations occurred in the weather data. For this purpose, weather data of the variables: daily rainfall, maximum temperature (Tmax) and minimum temperature (Tmin) of 23 stations of Pakistan have been taken from Pakistan Metrological department for the years 1981 to 2020. There are about 14610 total observations of each variable while each variable has different number of missing observations, called as size of missingness, at different stations. The techniques: mean imputation, k nearest neighbors (KNN) imputation, predictive mean matching (PMM) imputation and sample imputation have been considered for the estimation of missing observations found while analyzing data of each station. The minimal value of root mean square error (RMSE) is considered to decide about station-wise imputation technique because the size of missingness varied from station to station. The KNN technique is the most appropriate to estimate the missing observations of the rainfall variables for all the stations while mean imputation technique is recommended for Tmax and Tmin data; as compared to other imputation methods.
引用
收藏
页码:327 / 336
页数:10
相关论文
共 50 条
  • [31] A comparison of imputation techniques for handling missing data
    Musil, CM
    Warner, CB
    Yobas, PK
    Jones, SL
    [J]. WESTERN JOURNAL OF NURSING RESEARCH, 2002, 24 (07) : 815 - 829
  • [32] Estimation of Missing Rainfall Data Using Spatial Interpolation and Imputation Methods
    Radia, Noor Fadhilah Ahmad
    Zakaria, Roslinazairimah
    Azman, Muhammad Az-Zuhri
    [J]. 2ND ISM INTERNATIONAL STATISTICAL CONFERENCE 2014 (ISM-II): EMPOWERING THE APPLICATIONS OF STATISTICAL AND MATHEMATICAL SCIENCES, 2015, 1643 : 42 - 48
  • [33] Imputation of missing values for compositional data using classical and robust methods
    Hron, K.
    Templ, M.
    Filzmoser, P.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) : 3095 - 3107
  • [34] Improving Accuracy Rate of Imputation of Missing Data using Classifier Methods
    Thirukumaran, S.
    Sumathi, A.
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,
  • [35] Comparison of Imputation Methods Based on Missing Value Detection for Multidimensional Feature Data
    Qiao, Fei
    Zhai, Xiaodong
    Wang, Qiaoling
    [J]. Tongji Daxue Xuebao/Journal of Tongji University, 2023, 51 (12): : 1972 - 1982
  • [36] IMPUTATION OF MISSING DATA
    Lunt, M.
    [J]. ANNALS OF THE RHEUMATIC DISEASES, 2014, 73 : 49 - 49
  • [37] Improved methods for the imputation of missing data by nearest neighbor methods
    Tutz, Gerhard
    Ramzan, Shahla
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 90 : 84 - 99
  • [38] Some imputation methods for missing data in sample surveys
    Singh, G. N.
    Maurya, S.
    Khetan, M.
    Kadilar, Cem
    [J]. Hacettepe Journal of Mathematics and Statistics, 2016, 45 (06): : 1865 - 1880
  • [39] Ensemble imputation methods for missing software engineering data
    Twala, B
    Cartwright, M
    [J]. 2005 11TH INTERNATIONAL SYMPOSIUM ON SOFTWARE METRICS (METRICS), 2005, : 268 - 277
  • [40] Imputation methods for missing data in educational diagnostic evaluation
    Fernandez-Alonso, Ruben
    Suarez-Alvarez, Javier
    Muniz, Jose
    [J]. PSICOTHEMA, 2012, 24 (01) : 167 - 175