A Probabilistic Approach for Missing Data Imputation

被引:0
|
作者
Arefin, Muhammed Nazmul [1 ]
Masum, Abdul Kadar Muhammad [2 ]
机构
[1] Int Islamic Univ Chittagong, Dept Comp Sci & Engn, Chattogram 4318, Bangladesh
[2] Daffodil Int Univ, Dept Software Engn, Dhaka 1216, Savar, Bangladesh
关键词
D O I
10.1155/2024/4737963
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In the context of data analysis, missing data imputation is a vital issue due to the typically large scale and complexity of the datasets. It often results in a higher incidence of missing data. So, addressing missing data through the imputation technique is essential to ensure the integrity and completeness of the data. It will ultimately improve the accuracy and validity of the data analysis. The prime objective of this study is to propose an imputation model. This paper presents a method for imputing missing employee data through a combination of features and probability calculations. The study utilized employee datasets that were collected from the Kaggle along with primary data collected from RMG factories located in Chittagong. The suggested algorithm demonstrated a notable level of accuracy on the datasets, and the average accuracy for each identified technique was also quite satisfactory. This study contributes to the existing body of research on missing data imputation in big data analysis and offers practical implications for handling missing data in different datasets. Usage of this technique will enhance the accuracy of data analysis and decision-making in organizations.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Probabilistic Missing Value Imputation for Mixed Categorical and Ordered Data
    Zhao, Yuxuan
    Townsend, Alex
    Udell, Madeleine
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [2] A New Approach for Missing Data Imputation in Big Data Interface
    Wang, Chunzhi
    Shakhovska, Nataliya
    Sachenko, Anatoliy
    Komar, Myroslav
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2020, 49 (04): : 541 - 555
  • [3] IMPUTATION OF MISSING DATA
    Lunt, M.
    [J]. ANNALS OF THE RHEUMATIC DISEASES, 2014, 73 : 49 - 49
  • [4] Multiple imputation: a mature approach to dealing with missing data
    Chevret, S.
    Seaman, S.
    Resche-Rigon, M.
    [J]. INTENSIVE CARE MEDICINE, 2015, 41 (02) : 348 - 350
  • [5] A nonparametric multiple imputation approach for missing categorical data
    Zhou, Muhan
    He, Yulei
    Yu, Mandi
    Hsu, Chiu-Hsieh
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2017, 17
  • [6] Tree-based Approach to Missing Data Imputation
    Vateekul, Peerapon
    Sarinnapakorn, Kanoksri
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 70 - +
  • [7] A nonparametric multiple imputation approach for missing categorical data
    Muhan Zhou
    Yulei He
    Mandi Yu
    Chiu-Hsieh Hsu
    [J]. BMC Medical Research Methodology, 17
  • [8] Multiple imputation: a mature approach to dealing with missing data
    S. Chevret
    S. Seaman
    M. Resche-Rigon
    [J]. Intensive Care Medicine, 2015, 41 : 348 - 350
  • [9] A First Approach on Big Data Missing Values Imputation
    Montesdeoca, Besay
    Luengo, Julian
    Maillo, Jesus
    Garcia-Gil, Diego
    Garcia, Salvador
    Herrera, Francisco
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS 2019), 2019, : 315 - 323
  • [10] Missing Categorical Data Imputation Approach Based on Similarity
    Wu, Sen
    Feng, Xiaodong
    Han, Yushan
    Wang, Qiang
    [J]. PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 2827 - 2832