Modeling naive bayes imputation classification for missing data

被引:1
|
作者
Khotimah, B. K. [1 ,3 ]
Miswanto [1 ,2 ]
Suprajitno, H. [1 ,2 ]
机构
[1] Univ Airlangga, Fac Sci & Technol, Surabaya, Indonesia
[2] Univ Airlangga, Dept Math, Surabaya, Indonesia
[3] Univ Trunojoyo Madura, Dept Informat Engn, Bangkalan, Indonesia
关键词
VALUES;
D O I
10.1088/1755-1315/243/1/012111
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Naive Bayes Imputation (NBI) is used to fill in missing values by replacing the attribute information according to the probability estimate. The NBI process divides the whole data into two sub-sets is the complete data and data containing missing data. Complete data is used for the imputation process at the lost value. The process is repeated for each missing attribute to generate complete data for classification. This research applies NBI for imputation and preprocessing as preparation of classification process. The trial of this study used NBI for imputation compared to using the mean and mode to predict the missing data. The data used for imputation is full train of complete data as a whole to predict the missing value so as to represent the entire data. The results of this study prove that imputation with NBI produces the right imputation with higher accuracy than other imputations. NBI with single imputation and multiple imputation results in better performance because of the right features. This study aims to calculate the effect of missing values on Naive Bayes Imputation Algorithm is based on a probalistic model using mixed data. Empirically shows that the interaction between several methods of imputation and supervised classification results in differences in the performance of classification for the same imputation method.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A HYBRID SELF ORGANIZING MAP IMPUTATION (SOMI) WITH NAIVE BAYES FOR IMPUTATION MISSING DATA CLASSIFICATION
    Khotimah, Bain Khusnul
    Miswanto
    Suprajitno, Herry
    INTERNATIONAL JOURNAL OF GEOMATE, 2019, 17 (62): : 195 - 202
  • [2] Naive Bayes as an imputation tool for classification problems
    Garcia, AJT
    Hruschka, ER
    HIS 2005: 5th International Conference on Hybrid Intelligent Systems, Proceedings, 2005, : 497 - 499
  • [3] Naive Bayes Classification of Uncertain Data
    Ren, Jiangtao
    Lee, Sau Dan
    Chen, Xianlu
    Kao, Ben
    Cheng, Reynold
    Cheung, David
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 944 - +
  • [4] Naive Bayes Classification Ensembles to Support Modeling Decisions in Data Stream Mining
    Lutu, Patricia E. N.
    2015 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2015, : 335 - 340
  • [5] Imputation of missing data with neural networks for classification
    Choudhury, Suyra Jyoti
    Pal, Nikhil R.
    KNOWLEDGE-BASED SYSTEMS, 2019, 182
  • [7] Missing data imputation using classification and regression trees
    Chen, Cheng-Yang
    Chang, Yu-Wei
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [8] Missing Data Imputation and Its Effect on the Accuracy of Classification
    Hunt, Lynette A.
    DATA SCIENCE: INNOVATIVE DEVELOPMENTS IN DATA ANALYSIS AND CLUSTERING, 2017, : 3 - 14
  • [9] Data Classification Using Rough Sets and Naive Bayes
    Al-Aidaroos, Khadija
    Abu Bakar, Azuraliza
    Othman, Zalinda
    ROUGH SET AND KNOWLEDGE TECHNOLOGY (RSKT), 2010, 6401 : 134 - 142
  • [10] Constrained Naive Bayes with application to unbalanced data classification
    Blanquero, Rafael
    Carrizosa, Emilio
    Ramirez-Cobo, Pepa
    Sillero-Denamiel, M. Remedios
    CENTRAL EUROPEAN JOURNAL OF OPERATIONS RESEARCH, 2022, 30 (04) : 1403 - 1425