WIMP: Web server tool for missing data imputation

被引:4
|
作者
Urda, D. [1 ]
Subirats, J. L. [1 ]
Garcia-Laencina, P. J.
Franco, L. [1 ]
Sancho-Gomez, J. L. [2 ]
Jerez, J. M. [1 ]
机构
[1] Univ Malaga, Dept Lenguajes & Ciencias Comp, ETSI Informat, E-29071 Malaga, Spain
[2] Univ Politecn Cartagena, Dept Tecnol Informac & Comunicac, Cartagena, Spain
关键词
Imputation; Missing data; Machine learning; Web application; EMPIRICAL LIKELIHOOD; MICROARRAY DATA; LINEAR-MODELS; REGRESSION; CLASSIFICATION; ALGORITHM; VALUES;
D O I
10.1016/j.cmpb.2012.08.006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The imputation of unknown or missing data is a crucial task on the analysis of biomedical datasets. There are several situations where it is necessary to classify or identify instances given incomplete vectors, and the existence of missing values can much degrade the performance of the algorithms used for the classification/recognition. The task of learning accurately from incomplete data raises a number of issues some of which have not been completely solved in machine learning applications. In this sense, effective missing value estimation methods are required. Different methods for missing data imputations exist but most of the times the selection of the appropriate technique involves testing several methods, comparing them and choosing the right one. Furthermore, applying these methods, in most cases, is not straightforward, as they involve several technical details, and in particular in cases such as when dealing with microarray datasets, the application of the methods requires huge computational resources. As far as we know, there is not a public software application that can provide the computing capabilities required for carrying the task of data imputation. This paper presents a new public tool for missing data imputation that is attached to a computer cluster in order to execute high computational tasks. The software WIMP (Web IMPutation) is a public available web site where registered users can create, execute, analyze and store their simulations related to missing data imputation. (C) 2012 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:1247 / 1254
页数:8
相关论文
共 50 条
  • [1] imputomics: web server and R package for missing values imputation in metabolomics data
    Chilimoniuk, Jaroslaw
    Grzesiak, Krystyna
    Kala, Jakub
    Nowakowski, Dominik
    Kretowski, Adam
    Kolenda, Rafal
    Ciborowski, Michal
    Burdukiewicz, Michal
    BIOINFORMATICS, 2024, 40 (03)
  • [2] Missing value imputation for microarray data: a comprehensive comparison study and a web tool
    Chiu, Chia-Chun
    Chan, Shih-Yao
    Wang, Chung-Ching
    Wu, Wei-Sheng
    BMC SYSTEMS BIOLOGY, 2013, 7
  • [3] MIDA: a Web Tool for MIssing DAta Imputation based on a Boosted and Incremental Learning Algorithm
    Acampora, Giovanni
    Vitiello, Autilia
    Siciliano, Roberta
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [4] Multiple Imputation A Flexible Tool for Handling Missing Data
    Li, Peng
    Stuart, Elizabeth A.
    Allison, David B.
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2015, 314 (18): : 1966 - 1967
  • [5] IMPUTATION OF MISSING DATA
    Lunt, M.
    ANNALS OF THE RHEUMATIC DISEASES, 2014, 73 : 49 - 49
  • [6] Missing data imputation: focusing on single imputation
    Zhang, Zhongheng
    ANNALS OF TRANSLATIONAL MEDICINE, 2016, 4 (01)
  • [7] Multiple imputation as a flexible tool for missing data handling in clinical research
    Enders, Craig K.
    BEHAVIOUR RESEARCH AND THERAPY, 2017, 98 : 4 - 18
  • [8] Missing Data: data replacement and imputation
    Hutcheson, Graeme
    Pampaka, Maria
    JOURNAL OF MODELLING IN MANAGEMENT, 2012, 7 (02)
  • [9] MVIAeval: a web tool for comprehensively evaluating the performance of a new missing value imputation algorithm
    Wei-Sheng Wu
    Meng-Jhun Jhou
    BMC Bioinformatics, 18
  • [10] MVIAeval: a web tool for comprehensively evaluating the performance of a new missing value imputation algorithm
    Wu, Wei-Sheng
    Jhou, Meng-Jhun
    BMC BIOINFORMATICS, 2017, 18