Missing data imputation of questionnaires by means of genetic algorithms with different fitness functions

被引:40
|
作者
Ordonez Galan, Celestino [1 ]
Sanchez Lasheras, Fernando [2 ]
Javier de Cos Juez, Francisco [1 ]
Bernardo Sanchez, Antonio [2 ]
机构
[1] Univ Oviedo, Dept Min Exploitat & Prospecting, C Independencia 13, Oviedo 33004, Spain
[2] Univ Oviedo, Dept Construct & Mfg Engn, Gijon 33204, Spain
关键词
Imputation method; Item response theory; Genetic algorithms; Multivariate imputation by chained equations (MICE); Missing data; CYANOTOXINS PRESENCE; MODEL; REGRESSION;
D O I
10.1016/j.cam.2016.08.012
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This article proposes a new missing data imputation method based on genetic algorithms. The algorithm presented in this paper is a useful tool for the completion of missing data in knowledge and skills tests. This algorithm uses both Bayesian and Akaike's information criterions as fitness functions and applies them to the classical item response theory models of one, two and three parameters. The results obtained by this new algorithm have been compared with those achieved by means of the Multivariate Imputation by Chained Equations (MICE) algorithm. For all the missing data ratios checked, the average incorrect imputation percentages obtained with the GA algorithm were, statistically, significantly lower than the results obtained with the MICE method. The most favorable frameworks for the use of the algorithm developed in the present research are those questionnaires in which missing answers would be considered as missing completely at random (MCAR). In other words, those questionnaires in which the same questions are present for all the examinees, but not necessarily in the same order. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:704 / 717
页数:14
相关论文
共 50 条
  • [21] Traffic Missing Data Imputation: A Selective Overview of Temporal Theories and Algorithms
    Sun, Tuo
    Zhu, Shihao
    Hao, Ruochen
    Sun, Bo
    Xie, Jiemin
    [J]. MATHEMATICS, 2022, 10 (14)
  • [22] Note about Bias in Bayesian Genetic Algorithms for Discrete Missing Values Imputation
    Migdady, Hazem
    Alrabaiah, Hussam
    Al-Talib, Mohammad
    [J]. 2019 INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2019, : 87 - 90
  • [23] On imputation for planned missing data in context questionnaires using plausible values: a comparison of three designs
    Kaplan, David
    Su, Dan
    [J]. LARGE-SCALE ASSESSMENTS IN EDUCATION, 2018, 6
  • [24] Imputation of genetic composition for missing pedigree data in Serrasalmidae using morphometric data
    Costa, Adriano Carvalho
    Balestre, Marcio
    Botelho, Hortencia Aparecida
    Fonseca de Freitas, Rilke Tadeu
    da Silva Gomes, Richardson Cesar
    de Sousa Campos, Sergio Augusto
    Foresti, Fabio Porto
    Hashimoto, Diogo Teruo
    Martins, Diego Galetti
    do Prado, Fernanda Dotti
    Correa Mendonca, Maria Andreia
    [J]. SCIENTIA AGRICOLA, 2017, 74 (06): : 443 - 449
  • [25] The impact of heterogeneous distance functions on missing data imputation and classification performance
    Santos, Miriam Seoane
    Abreu, Pedro Henriques
    Fernandez, Alberto
    Luengo, Julian
    Santos, Joao
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 111
  • [26] Evaluation of different approaches for missing data imputation on features associated to genomic data
    Petrazzini, Ben Omega
    Naya, Hugo
    Lopez-Bello, Fernando
    Vazquez, Gustavo
    Spangenberg, Lucia
    [J]. BIODATA MINING, 2021, 14 (01)
  • [27] Missing Data Imputation of Solar Radiation Data under Different Atmospheric Conditions
    Crespo Turrado, Concepcion
    Meizoso Lopez, Maria del Carmen
    Sanchez Lasheras, Fernando
    Rodriguez Gomez, Benigno Antonio
    Calvo Rolle, Jose Luis
    de Cos Juez, Francisco Javier
    [J]. SENSORS, 2014, 14 (11) : 20382 - 20399
  • [28] Evaluation of different approaches for missing data imputation on features associated to genomic data
    Ben Omega Petrazzini
    Hugo Naya
    Fernando Lopez-Bello
    Gustavo Vazquez
    Lucía Spangenberg
    [J]. BioData Mining, 14
  • [29] A Genetic Programming-Based Imputation Method for Classification with Missing Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    [J]. GENETIC PROGRAMMING, EUROGP 2016, 2016, 9594 : 149 - 163
  • [30] Missing Values Imputation Using Genetic Algorithm for the Analysis of Traffic Data
    Midde, Ranjit Reddy
    Srinivasa, K. G.
    Reddy, Eswara B.
    [J]. ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, ICAIECES 2017, 2018, 668 : 251 - 261