WIMP: Web server tool for missing data imputation

被引:4
|
作者
Urda, D. [1 ]
Subirats, J. L. [1 ]
Garcia-Laencina, P. J.
Franco, L. [1 ]
Sancho-Gomez, J. L. [2 ]
Jerez, J. M. [1 ]
机构
[1] Univ Malaga, Dept Lenguajes & Ciencias Comp, ETSI Informat, E-29071 Malaga, Spain
[2] Univ Politecn Cartagena, Dept Tecnol Informac & Comunicac, Cartagena, Spain
关键词
Imputation; Missing data; Machine learning; Web application; EMPIRICAL LIKELIHOOD; MICROARRAY DATA; LINEAR-MODELS; REGRESSION; CLASSIFICATION; ALGORITHM; VALUES;
D O I
10.1016/j.cmpb.2012.08.006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The imputation of unknown or missing data is a crucial task on the analysis of biomedical datasets. There are several situations where it is necessary to classify or identify instances given incomplete vectors, and the existence of missing values can much degrade the performance of the algorithms used for the classification/recognition. The task of learning accurately from incomplete data raises a number of issues some of which have not been completely solved in machine learning applications. In this sense, effective missing value estimation methods are required. Different methods for missing data imputations exist but most of the times the selection of the appropriate technique involves testing several methods, comparing them and choosing the right one. Furthermore, applying these methods, in most cases, is not straightforward, as they involve several technical details, and in particular in cases such as when dealing with microarray datasets, the application of the methods requires huge computational resources. As far as we know, there is not a public software application that can provide the computing capabilities required for carrying the task of data imputation. This paper presents a new public tool for missing data imputation that is attached to a computer cluster in order to execute high computational tasks. The software WIMP (Web IMPutation) is a public available web site where registered users can create, execute, analyze and store their simulations related to missing data imputation. (C) 2012 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:1247 / 1254
页数:8
相关论文
共 50 条
  • [41] MISSING DATA, IMPUTATION AND REGRESSION TREES
    Loh, Wei-Yin
    Zhang, Qiong
    Zhang, Wenwen
    Zhou, Peigen
    STATISTICA SINICA, 2020, 30 (04) : 1697 - 1722
  • [42] Imputation of missing data in industrial databases
    Lakshminarayan, K
    Harp, SA
    Samad, T
    APPLIED INTELLIGENCE, 1999, 11 (03) : 259 - 275
  • [43] Multiple imputation for nonignorable missing data
    Jongho Im
    Soeun Kim
    Journal of the Korean Statistical Society, 2017, 46 : 583 - 592
  • [44] RELIABILITY OF AN AUTOMATED ELECTRONIC IMPUTATION TOOL FOR IMPUTING MISSING ACTIVITIES OF DAILY LIVING DATA
    Fan, Lei
    Biehl, Michelle
    Singh, Balwinder
    Wilson, Gregory
    Li, Man
    CRITICAL CARE MEDICINE, 2012, 40 (12) : U38 - U39
  • [45] The MORPH-R web server and software tool for predicting missing genes in biological pathways
    Amar, David
    Frades, Itziar
    Diels, Tim
    Zaltzman, David
    Ghatan, Netanel
    Hedley, Pete E.
    Alexandersson, Erik
    Tzfadia, Oren
    Shamir, Ron
    PHYSIOLOGIA PLANTARUM, 2015, 155 (01) : 12 - 20
  • [46] Missing phenotype data imputation in pedigree data analysis
    Fridley, B
    de Andrade, M
    GENETIC EPIDEMIOLOGY, 2005, 29 (03) : 249 - 249
  • [47] Missing phenotype data imputation in pedigree data analysis
    Fridley, Brooke L.
    de Andrade, Mariza
    GENETIC EPIDEMIOLOGY, 2008, 32 (01) : 52 - 60
  • [48] Missing Data Imputation with High-Dimensional Data
    Brini, Alberto
    van den Heuvel, Edwin R.
    AMERICAN STATISTICIAN, 2024, 78 (02): : 240 - 252
  • [49] Missing data imputation in multivariate data by evolutionary algorithms
    Figueroa Garcia, Juan C.
    Kalenatic, Dusko
    Lopez Bello, Cesar Amilcar
    COMPUTERS IN HUMAN BEHAVIOR, 2011, 27 (05) : 1468 - 1474
  • [50] Exploring the Effects of Data Distribution in Missing Data Imputation
    Soares, Jastin Pompeu
    Santos, Miriam Seoane
    Abreu, Pedro Henriques
    Araujo, Helder
    Santos, Joao
    ADVANCES IN INTELLIGENT DATA ANALYSIS XVII, IDA 2018, 2018, 11191 : 251 - 263