Optimization of missing value imputation for neural networks

被引:3
|
作者
Han, Jongmin [1 ]
Kang, Seokho [1 ]
机构
[1] Sungkyunkwan Univ, Dept Ind Engn, 2066 Seobu Ro, Suwon 16419, South Korea
基金
新加坡国家研究基金会;
关键词
Machine learning; Neural network; Data incompleteness; Missing value imputation; FUZZY C-MEANS; MULTIPLE IMPUTATION; CLASSIFICATION; REGRESSION;
D O I
10.1016/j.ins.2023.119668
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To train a neural network with an incomplete dataset, missing values can be replaced with plausible substitutions using missing value imputation. Various missing value imputers are available for use, each with its own competencies. Using multiple different imputers can improve the predictive performance of neural networks. Existing methods selected the best imputer or combined multiple imputers, irrespective of the training of the neural network. In this study, we propose an Optimization of Missing Value Imputation (OptMVI) method for improved training of a neural network in the presence of missing values in a training dataset. For each instance in the training dataset, multiple imputations are obtained from different imputers. A convex combination of the imputations is then used as the input for the neural network, with the combination weights indicating the relative contribution of each imputer. We simultaneously train the combination weights and neural network. This allows the combination weights to be optimized toward improving the predictive performance of the neural network. Through experimental evaluation on benchmark datasets with varying missing rates, we demonstrate that the proposed method outperforms the existing methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Missing value imputation in multivariate time series with end-to-end generative adversarial networks
    Zhang, Ying
    Zhou, Baohang
    Cai, Xiangrui
    Guo, Wenya
    Ding, Xiaoke
    Yuan, Xiaojie
    Information Sciences, 2021, 551 : 67 - 82
  • [42] Simultaneous Missing Value Imputation and Structure Learning with Groups
    Morales-Alvarez, Pablo
    Gong, Wenbo
    Lamb, Angus
    Woodhead, Simon
    Jones, Simon Peyton
    Pawlowski, Nick
    Allamanis, Miltiadis
    Zhang, Cheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [43] Missing value imputation in longitudinal measures of alcohol consumption
    Grittner, Ulrike
    Gmel, Gerhard
    Ripatti, Samuli
    Bloomfield, Kim
    Wicki, Matthias
    INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH, 2011, 20 (01) : 50 - 61
  • [44] Combining instance selection for better missing value imputation
    Tsai, Chih-Fong
    Chang, Fu-Yu
    JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 122 : 63 - 71
  • [45] Fuzzy min-max neural networks for categorical data: application to missing data imputation
    Rey-del-Castillo, Pilar
    Cardenosa, Jesus
    NEURAL COMPUTING & APPLICATIONS, 2012, 21 (06): : 1349 - 1362
  • [46] Missing Value Imputation via Clusterwise Linear Regression
    Karmitsa, Napsu
    Taheri, Sona
    Bagirov, Adil
    Makinen, Pauliina
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (04) : 1889 - 1901
  • [47] On the use of adaptive nearest neighbors for missing value imputation
    Jhun, Myoungshic
    Jeong, Hyeong Chul
    Koo, Ja-Yong
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2007, 36 (06) : 1275 - 1286
  • [48] imputeTS: Time Series Missing Value Imputation in R
    Moritz, Steffen
    Bartz-Beielstein, Thomas
    R JOURNAL, 2017, 9 (01): : 207 - 218
  • [49] A Review On Missing Value Estimation Using Imputation Algorithm
    Armina, Roslan
    Zain, Azlan Mohd
    Ali, Nor Azizah
    Sallehuddin, Roselina
    6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL MATHEMATICS (ICCSCM 2017), 2017, 892
  • [50] A hybrid imputation approach for microarray missing value estimation
    Huihui Li
    Changbo Zhao
    Fengfeng Shao
    Guo-Zheng Li
    Xiao Wang
    BMC Genomics, 16