MIDA: a Web Tool for MIssing DAta Imputation based on a Boosted and Incremental Learning Algorithm

被引:0
|
作者
Acampora, Giovanni [1 ]
Vitiello, Autilia [1 ]
Siciliano, Roberta [2 ]
机构
[1] Univ Naples Federico II, Dept Phys Ettore Pancini, Naples, Italy
[2] Univ Naples Federico II, Dept Ind Engn, Naples, Italy
基金
欧盟地平线“2020”;
关键词
SOFTWARE TOOL; KEEL;
D O I
10.1109/fuzz48607.2020.9177644
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the main issues in machine learning is related to the quality of data used to efficiently train statistical models for classification/regression tasks. Among these issues, the presence of missing values in data sets is particularly prone in affecting the accuracy performance of learning methods. As a consequence there is a strong emergence of software tools aimed at supporting machine learning users in "filling-in" their data sets before inputting them to training algorithms. This paper bridges this gap by introducing a web-based tool for MIssing DAta imputation (MIDA) based on a novel supervised learning method, namely Generalized Boosted Incremental Non Parametric Imputation algorithm (G-BINPI), able to address the missing values issue in scenarios where a "missing at random" assumption occurs. The proposed approach enables machine learning users to remotely imputing their data sets by means of an intuitive graphical user interface. As highlighted in the experimental section, the proposed approach yields better performance than conventional approaches for missing data imputation on different benchmark data sets.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Boosted incremental tree-based imputation of missing data
    Siciliano, Roberta
    Aria, Massimo
    D'Ambrosio, Antonio
    [J]. DATA ANALYSIS, CLASSIFICATION AND THE FORWARD SEARCH, 2006, : 271 - +
  • [2] WIMP: Web server tool for missing data imputation
    Urda, D.
    Subirats, J. L.
    Garcia-Laencina, P. J.
    Franco, L.
    Sancho-Gomez, J. L.
    Jerez, J. M.
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2012, 108 (03) : 1247 - 1254
  • [3] Adaptive Deep Incremental Learning - Assisted Missing Data Imputation for Streaming Data
    Syavasya, C. V. S. R.
    Lakshmi, M. A.
    [J]. JOURNAL OF INTERCONNECTION NETWORKS, 2022, 22 (SUPP02)
  • [4] Missing data incremental imputation through tree based methods
    Conversano, C
    Cappelli, C
    [J]. COMPSTAT 2002: PROCEEDINGS IN COMPUTATIONAL STATISTICS, 2002, : 455 - 460
  • [5] Missing Data Imputation using Machine Learning Algorithm for Supervised Learning
    Cenitta, D.
    Arjunan, R. Vijaya
    Prema, K., V
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [6] Incremental Tree-Based Missing Data Imputation with Lexicographic Ordering
    Conversano, Claudio
    Siciliano, Roberta
    [J]. JOURNAL OF CLASSIFICATION, 2009, 26 (03) : 361 - 379
  • [7] Incremental Tree-Based Missing Data Imputation with Lexicographic Ordering
    Claudio Conversano
    Roberta Siciliano
    [J]. Journal of Classification, 2009, 26 : 361 - 379
  • [8] Learning-Based Adaptive Imputation Method with kNN Algorithm for Missing Power Data
    Kim, Minkyung
    Park, Sangdon
    Lee, Joohyung
    Joo, Yongjae
    Choi, Jun Kyun
    [J]. ENERGIES, 2017, 10 (10)
  • [9] Missing value imputation for microarray data: a comprehensive comparison study and a web tool
    Chiu, Chia-Chun
    Chan, Shih-Yao
    Wang, Chung-Ching
    Wu, Wei-Sheng
    [J]. BMC SYSTEMS BIOLOGY, 2013, 7
  • [10] Analysis of Machine Learning Based Imputation of Missing Data
    Rizvi, Syed Tahir Hussain
    Latif, Muhammad Yasir
    Amin, Muhammad Saad
    Telmoudi, Achraf Jabeur
    Shah, Nasir Ali
    [J]. CYBERNETICS AND SYSTEMS, 2023,