Optimization of missing value imputation for neural networks

被引:1
|
作者
Han, Jongmin [1 ]
Kang, Seokho [1 ]
机构
[1] Sungkyunkwan Univ, Dept Ind Engn, 2066 Seobu Ro, Suwon 16419, South Korea
基金
新加坡国家研究基金会;
关键词
Machine learning; Neural network; Data incompleteness; Missing value imputation; FUZZY C-MEANS; MULTIPLE IMPUTATION; CLASSIFICATION; REGRESSION;
D O I
10.1016/j.ins.2023.119668
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To train a neural network with an incomplete dataset, missing values can be replaced with plausible substitutions using missing value imputation. Various missing value imputers are available for use, each with its own competencies. Using multiple different imputers can improve the predictive performance of neural networks. Existing methods selected the best imputer or combined multiple imputers, irrespective of the training of the neural network. In this study, we propose an Optimization of Missing Value Imputation (OptMVI) method for improved training of a neural network in the presence of missing values in a training dataset. For each instance in the training dataset, multiple imputations are obtained from different imputers. A convex combination of the imputations is then used as the input for the neural network, with the combination weights indicating the relative contribution of each imputer. We simultaneously train the combination weights and neural network. This allows the combination weights to be optimized toward improving the predictive performance of the neural network. Through experimental evaluation on benchmark datasets with varying missing rates, we demonstrate that the proposed method outperforms the existing methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Imputation of missing data with neural networks for classification
    Choudhury, Suyra Jyoti
    Pal, Nikhil R.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 182
  • [2] Optimization of Missing Value Imputation using Reinforcement Programming
    Rachmawan, Irene Erlyn Wina
    Barakbah, Ali Ridho
    [J]. 2015 International Electronics Symposium (IES), 2015, : 128 - 133
  • [3] SCALABLE MISSING DATA IMPUTATION WITH GRAPH NEURAL NETWORKS
    Lachaud, Guillaume
    Conde-Cespedes, Patricia
    Trocan, Maria
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW, 2023,
  • [4] Missing values imputation techniques for Neural Networks patterns
    Lopez-Molina, Thomas
    Perez-Mendez, Anna
    Rivas-Echeverria, Francklin
    [J]. NEW ASPECTS OF SYSTEMS, PTS I AND II, 2008, : 290 - +
  • [5] Distributed Neural Networks for Missing Big Data Imputation
    Petrozziello, Alessio
    Jordanov, Ivan
    Sommeregger, Christian
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 131 - 138
  • [6] Long-term missing value imputation for time series data using deep neural networks
    Park, Jangho
    Muller, Juliane
    Arora, Bhavna
    Faybishenko, Boris
    Pastorello, Gilberto
    Varadharajan, Charuleka
    Sahu, Reetik
    Agarwal, Deborah
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (12): : 9071 - 9091
  • [7] Long-term missing value imputation for time series data using deep neural networks
    Jangho Park
    Juliane Müller
    Bhavna Arora
    Boris Faybishenko
    Gilberto Pastorello
    Charuleka Varadharajan
    Reetik Sahu
    Deborah Agarwal
    [J]. Neural Computing and Applications, 2023, 35 : 9071 - 9091
  • [8] An Optimization Algorithm for Missing Value Imputation in Microarray Based on Integrated Information
    Liu, Feng
    Zhang, Yiding
    Liu, Zeming
    Gao, Meng
    [J]. FUZZY SYSTEMS AND DATA MINING V (FSDM 2019), 2019, 320 : 55 - 64
  • [9] Missing Value Imputation of Time-Series Air-Quality Data via Deep Neural Networks
    Kim, Taesung
    Kim, Jinhee
    Yang, Wonho
    Lee, Hunjoo
    Choo, Jaegul
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (22)
  • [10] Missing Pavement Performance Data Imputation Using Graph Neural Networks
    Gao, Lu
    Yu, Ke
    Lu, Pan
    [J]. TRANSPORTATION RESEARCH RECORD, 2022, 2676 (12) : 409 - 419