CN-GAIN: Classification and NormalizationDenormalization-Based Generative Adversarial Imputation Network for Missing SMES Data Imputation

被引:0
|
作者
Sudrajat, Antonius Wahyu [1 ,3 ]
Ermatita [2 ]
Samsuryadi [2 ]
机构
[1] Univ Sriwijaya, Doctoral Program Engn Sci, Palembang, Indonesia
[2] Univ Sriwijaya, Fac Comp Sci, Palembang, Indonesia
[3] Univ Multi Data Palembang, Fac Comp Sci & Engn, Palembang, Indonesia
关键词
Missing values; GAIN method; normalization-; denormalization; imputation; UMKM data; FEATURE-SELECTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Quality data is crucial for supporting the management and development of SMES carried out by the government. However, the inability of SMES actors to provide complete data often results in incomplete dataset. Missing values present a significant challenge to producing quality data. To address this, missing data imputation methods are essential for improving the accuracy of data analysis. The Generative Adversarial Imputation Network (GAIN) is a machine learning method used for imputing missing data, where data preprocessing plays an important role. This study proposes a new model for missing data imputation called the Classification and Normalization-Denormalization-based Generative Adversarial Imputation Network (CN-GAIN). The study simulates different patterns of missing values, specifically MAR (Missing at Random), MCAR (Missing Completely at Random), and MNAR (Missing Not at Random). For comparison, each missing value pattern is processed using both the CN-GAIN and the base GAIN methods. The results demonstrate that the CN-GAIN model outperforms GAIN in predicting missing values. The CN-GAIN model achieves an accuracy of 0.0801% for the MCAR category and shows a lower error rate (RMSE) of 48.78% for the MNAR category. The mean error (MSE) for the MAR category is 99.60%, while the deviation (MAE) for the MNAR category is 70%.
引用
收藏
页码:314 / 322
页数:9
相关论文
共 50 条
  • [21] Missing Features Reconstruction Using a Wasserstein Generative Adversarial Imputation Network
    Friedjungova, Magda
    Vasata, Daniel
    Balatsko, Maksym
    Jirina, Marcel
    COMPUTATIONAL SCIENCE - ICCS 2020, PT IV, 2020, 12140 : 225 - 239
  • [22] Multiple Imputation by Generative Adversarial Networks for Classification with Incomplete Data
    Bao Ngoc Vi
    Dinh Tan Nguyen
    Cao Truong Tran
    Huu Phuc Ngo
    Chi Cong Nguyen
    Hai-Hong Phan
    2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 162 - 167
  • [23] Joint Representation Learning with Generative Adversarial Imputation Network for Improved Classification of Longitudinal Data
    Pingi, Sharon Torao
    Zhang, Duoyi
    Bashar, Md Abul
    Nayak, Richi
    DATA SCIENCE AND ENGINEERING, 2024, 9 (01) : 5 - 25
  • [24] Joint Representation Learning with Generative Adversarial Imputation Network for Improved Classification of Longitudinal Data
    Sharon Torao Pingi
    Duoyi Zhang
    Md Abul Bashar
    Richi Nayak
    Data Science and Engineering, 2024, 9 : 5 - 25
  • [25] MISSING DATA IMPUTATION FOR HEALTH CARE BIG DATA USING DENOISING AUTOENCODER WITH GENERATIVE ADVERSARIAL NETWORK
    Zhang, Yinbing
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (05): : 3850 - 3857
  • [26] A data imputation method for multivariate time series based on generative adversarial network
    Guo, Zijian
    Wan, Yiming
    Ye, Hao
    NEUROCOMPUTING, 2019, 360 : 185 - 197
  • [27] Detracking Autoencoding Conditional Generative Adversarial Network: Improved Generative Adversarial Network Method for Tabular Missing Value Imputation
    Liu, Jingrui
    Duan, Zixin
    Hu, Xinkai
    Zhong, Jingxuan
    Yin, Yunfei
    ENTROPY, 2024, 26 (05)
  • [28] Generative Adversarial Networks Assist Missing Data Imputation: A Comprehensive Survey and Evaluation
    Shahbazian, Reza
    Greco, Sergio
    IEEE ACCESS, 2023, 11 : 88908 - 88928
  • [29] Traffic volume imputation using the attention-based spatiotemporal generative adversarial imputation network
    Duan, Yixin
    Wang, Chengcheng
    Wang, Chao
    Tang, Jinjun
    Chen, Qun
    TRANSPORTATION SAFETY AND ENVIRONMENT, 2024, 6 (04):
  • [30] Imputation of missing data with class imbalance using conditional generative adversarial networks
    Awan, Saqib Ejaz
    Bennamoun, Mohammed
    Sohel, Ferdous
    Sanfilippo, Frank
    Dwivedi, Girish
    NEUROCOMPUTING, 2021, 453 : 164 - 171