Optimization of missing value imputation for neural networks

被引:1
|
作者
Han, Jongmin [1 ]
Kang, Seokho [1 ]
机构
[1] Sungkyunkwan Univ, Dept Ind Engn, 2066 Seobu Ro, Suwon 16419, South Korea
基金
新加坡国家研究基金会;
关键词
Machine learning; Neural network; Data incompleteness; Missing value imputation; FUZZY C-MEANS; MULTIPLE IMPUTATION; CLASSIFICATION; REGRESSION;
D O I
10.1016/j.ins.2023.119668
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To train a neural network with an incomplete dataset, missing values can be replaced with plausible substitutions using missing value imputation. Various missing value imputers are available for use, each with its own competencies. Using multiple different imputers can improve the predictive performance of neural networks. Existing methods selected the best imputer or combined multiple imputers, irrespective of the training of the neural network. In this study, we propose an Optimization of Missing Value Imputation (OptMVI) method for improved training of a neural network in the presence of missing values in a training dataset. For each instance in the training dataset, multiple imputations are obtained from different imputers. A convex combination of the imputations is then used as the input for the neural network, with the combination weights indicating the relative contribution of each imputer. We simultaneously train the combination weights and neural network. This allows the combination weights to be optimized toward improving the predictive performance of the neural network. Through experimental evaluation on benchmark datasets with varying missing rates, we demonstrate that the proposed method outperforms the existing methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] A Two-Stage Missing Value Imputation Method Based on Autoencoder Neural Network
    Yu, Jiayin
    He, Yulin
    Huang, Joshua Zhexue
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 6064 - 6066
  • [22] Missing Value Imputation Techniques Depth Survey And an Imputation Algorithm To Improve The Efficiency Of Imputation
    Thirukumaran, S.
    Sumathi, A.
    [J]. 2012 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2012,
  • [23] The importance of batch sensitization in missing value imputation
    Hui, Harvard Wai Hann
    Kong, Weijia
    Peng, Hui
    Goh, Wilson Wen Bin
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [24] Missing value imputation strategies for metabolomics data
    Grace Armitage, Emily
    Godzien, Joanna
    Alonso-Herranz, Vanesa
    Lopez-Gonzalvez, Angeles
    Barbas, Coral
    [J]. ELECTROPHORESIS, 2015, 36 (24) : 3050 - 3060
  • [25] Missing Value Imputation Using Correlation Coefficient
    Manna, Sweta
    Pati, Soumen Kumar
    [J]. COMPUTATIONAL INTELLIGENCE IN PATTERN RECOGNITION, CIPR 2020, 2020, 1120 : 551 - 558
  • [26] A Comprehensive Bibliometric Analysis of Missing Value Imputation
    Nugroho, Heru
    Surendro, Kridanto
    [J]. IEEE ACCESS, 2024, 12 : 14819 - 14846
  • [27] Missing Value Imputation on Multidimensional Time Series
    Bansal, Parikshit
    Deshpande, Prathamesh
    Sarawagi, Sunita
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (11): : 2533 - 2545
  • [28] Triple Imputation for Microarray Missing Value Estimation
    He, Chong
    Li, Hui-Hui
    Zhao, Changbo
    Li, Guo-Zheng
    Zhang, Wei
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 208 - 213
  • [29] The importance of batch sensitization in missing value imputation
    Harvard Wai Hann Hui
    Weijia Kong
    Hui Peng
    Wilson Wen Bin Goh
    [J]. Scientific Reports, 13
  • [30] Missing Value Imputation: With Application to Handwriting Data
    Xu, Zhen
    Srihari, Sargur N.
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL XXII, 2015, 9402