Optimization of missing value imputation for neural networks

被引:3
|
作者
Han, Jongmin [1 ]
Kang, Seokho [1 ]
机构
[1] Sungkyunkwan Univ, Dept Ind Engn, 2066 Seobu Ro, Suwon 16419, South Korea
基金
新加坡国家研究基金会;
关键词
Machine learning; Neural network; Data incompleteness; Missing value imputation; FUZZY C-MEANS; MULTIPLE IMPUTATION; CLASSIFICATION; REGRESSION;
D O I
10.1016/j.ins.2023.119668
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To train a neural network with an incomplete dataset, missing values can be replaced with plausible substitutions using missing value imputation. Various missing value imputers are available for use, each with its own competencies. Using multiple different imputers can improve the predictive performance of neural networks. Existing methods selected the best imputer or combined multiple imputers, irrespective of the training of the neural network. In this study, we propose an Optimization of Missing Value Imputation (OptMVI) method for improved training of a neural network in the presence of missing values in a training dataset. For each instance in the training dataset, multiple imputations are obtained from different imputers. A convex combination of the imputations is then used as the input for the neural network, with the combination weights indicating the relative contribution of each imputer. We simultaneously train the combination weights and neural network. This allows the combination weights to be optimized toward improving the predictive performance of the neural network. Through experimental evaluation on benchmark datasets with varying missing rates, we demonstrate that the proposed method outperforms the existing methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Missing Value Imputation: With Application to Handwriting Data
    Xu, Zhen
    Srihari, Sargur N.
    DOCUMENT RECOGNITION AND RETRIEVAL XXII, 2015, 9402
  • [32] Soft Clustering Based Missing Value Imputation
    Raja, P. S.
    Thangavel, K.
    DIGITAL CONNECTIVITY - SOCIAL IMPACT, 2016, 679 : 119 - 133
  • [33] The importance of batch sensitization in missing value imputation
    Harvard Wai Hann Hui
    Weijia Kong
    Hui Peng
    Wilson Wen Bin Goh
    Scientific Reports, 13
  • [34] IFGAN: Missing Value Imputation using Feature-specific Generative Adversarial Networks
    Qiu, Wei
    Huang, Yangsibo
    Li, Quanzheng
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4715 - 4723
  • [35] Missing Data Imputation in Cardiometabolic Risk Assessment: A Solution Based on Artificial Neural Networks
    Vrbaski, Dunja
    Kupusinac, Aleksandar
    Doroslovacki, Rade
    Stokic, Edita
    Ivetic, Dragan
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2020, 17 (02) : 379 - 401
  • [36] Fuzzy min–max neural networks for categorical data: application to missing data imputation
    Pilar Rey-del-Castillo
    Jesús Cardeñosa
    Neural Computing and Applications, 2012, 21 : 1349 - 1362
  • [37] Medical examination data prediction with missing information imputation based on recurrent neural networks
    Kim, Han-Gyu
    Jang, Gil-Jin
    Choi, Ho-Jin
    Lim, Myungeun
    Choi, Jaehun
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 19 (03) : 202 - 220
  • [38] Imputation off Missing Values by Integrating Neural Networks and Case-based Reasoning
    Ennett, Colleen M.
    Frize, Monique
    Walker, C. Robin
    2008 30TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-8, 2008, : 4337 - +
  • [39] Missing value imputation in multivariate time series with end-to-end generative adversarial networks
    Zhang, Ying
    Zhou, Baohang
    Cai, Xiangrui
    Guo, Wenya
    Ding, Xiaoke
    Yuan, Xiaojie
    INFORMATION SCIENCES, 2021, 551 : 67 - 82
  • [40] Collateral missing value imputation: a new robust missing value estimation algorithm for microarray data
    Sehgal, MSB
    Gondal, I
    Dooley, LS
    BIOINFORMATICS, 2005, 21 (10) : 2417 - 2423