Missing data imputation of questionnaires by means of genetic algorithms with different fitness functions

被引:40
|
作者
Ordonez Galan, Celestino [1 ]
Sanchez Lasheras, Fernando [2 ]
Javier de Cos Juez, Francisco [1 ]
Bernardo Sanchez, Antonio [2 ]
机构
[1] Univ Oviedo, Dept Min Exploitat & Prospecting, C Independencia 13, Oviedo 33004, Spain
[2] Univ Oviedo, Dept Construct & Mfg Engn, Gijon 33204, Spain
关键词
Imputation method; Item response theory; Genetic algorithms; Multivariate imputation by chained equations (MICE); Missing data; CYANOTOXINS PRESENCE; MODEL; REGRESSION;
D O I
10.1016/j.cam.2016.08.012
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This article proposes a new missing data imputation method based on genetic algorithms. The algorithm presented in this paper is a useful tool for the completion of missing data in knowledge and skills tests. This algorithm uses both Bayesian and Akaike's information criterions as fitness functions and applies them to the classical item response theory models of one, two and three parameters. The results obtained by this new algorithm have been compared with those achieved by means of the Multivariate Imputation by Chained Equations (MICE) algorithm. For all the missing data ratios checked, the average incorrect imputation percentages obtained with the GA algorithm were, statistically, significantly lower than the results obtained with the MICE method. The most favorable frameworks for the use of the algorithm developed in the present research are those questionnaires in which missing answers would be considered as missing completely at random (MCAR). In other words, those questionnaires in which the same questions are present for all the examinees, but not necessarily in the same order. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:704 / 717
页数:14
相关论文
共 50 条
  • [1] TENSOR FACTORIZATION FOR MISSING DATA IMPUTATION IN MEDICAL QUESTIONNAIRES
    Dauwels, Justin
    Garg, Lalit
    Earnest, Arul
    Pang, Leong Khai
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2109 - 2112
  • [2] Imputation of Missing Data Using PCA, Neuro-Fuzzy and Genetic Algorithms
    Hlalele, Nthabiseng
    Nelwamondo, Fulufhelo
    Marwala, Tshilidzi
    [J]. ADVANCES IN NEURO-INFORMATION PROCESSING, PT II, 2009, 5507 : 485 - +
  • [3] Missing data imputation in multivariate data by evolutionary algorithms
    Figueroa Garcia, Juan C.
    Kalenatic, Dusko
    Lopez Bello, Cesar Amilcar
    [J]. COMPUTERS IN HUMAN BEHAVIOR, 2011, 27 (05) : 1468 - 1474
  • [4] An Experimental Survey of Missing Data Imputation Algorithms
    Miao, Xiaoye
    Wu, Yangyang
    Chen, Lu
    Gao, Yunjun
    Yin, Jianwei
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) : 6630 - 6650
  • [5] Some Imputation Algorithms for Restoration of Missing Data
    Ryazanov, Vladimir
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, 2011, 7042 : 372 - 379
  • [6] The ability of different imputation methods for missing values in mental measurement questionnaires
    Xu, Xueying
    Xia, Leizhen
    Zhang, Qimeng
    Wu, Shaoning
    Wu, Mingcheng
    Liu, Hongbo
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2020, 20 (01)
  • [7] The ability of different imputation methods for missing values in mental measurement questionnaires
    Xueying Xu
    Leizhen Xia
    Qimeng Zhang
    Shaoning Wu
    Mingcheng Wu
    Hongbo Liu
    [J]. BMC Medical Research Methodology, 20
  • [8] A genetic algorithm for multivariate missing data imputation
    Carlos Figueroa-Garcia, Juan
    Neruda, Roman
    Hernandez-Perez, German
    [J]. INFORMATION SCIENCES, 2023, 619 : 947 - 967
  • [9] Nearest neighbours in least-squares data imputation algorithms with different missing patterns
    Wasito, I
    Mirkin, B
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (04) : 926 - 949
  • [10] Missing Data Imputation in Time Series by Evolutionary Algorithms
    Figueroa Garcia, Juan C.
    Kalenatic, Dusko
    Lopez Bello, Cesar Amilcar
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2008, 5227 : 275 - +