A Novel Index Measure Imputation Algorithm for Missing Data Values: A Machine Learning Approach

被引:0
|
作者
Madhu, G. [1 ]
Rajinikanth, T. V. [2 ]
机构
[1] VNR VJIET, Dept Informat Technol, Hyderabad 500090, Andhra Pradesh, India
[2] GRIET, Dept Informat Technol, Hyderabad 500085, Andhra Pradesh, India
关键词
classification; decision tree; index measure; missing values;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of missing data in the real world datasets has very significant role in the real time data mining process and becomes more complex in large databases. The presence of missing values influences data set features and the class attributes, thus affecting the predictive accuracies of the classifiers. For the last one decade, many researchers have come out with different techniques for dealing with missing attribute values in databases with homogeneous and/or numeric attributes. In this research work, we proposed a new indexing measure to the imputation algorithm for missing data values of the attributes to compute the similarity measure between any two typical elements in the dataset. It can also be applied on any dataset be it a nominal and/or real. The proposed algorithm is evaluated by extensive experiments and comparison with KNNI, SVMI, WKNNI, KMI and FKMI algorithms. The results showed that the proposed algorithm has better performance than the existing imputation algorithms in terms of classification accuracy and also our decision tree algorithm employs highly accurate decision rules.
引用
收藏
页码:81 / 87
页数:7
相关论文
共 50 条
  • [1] Deep Learning Approach for Imputation of Missing Values in Actigraphy Data: Algorithm Development Study
    Jang, Jong-Hwan
    Choi, Junggu
    Roh, Hyun Woong
    Son, Sang Joon
    Hong, Chang Hyung
    Kim, Eun Young
    Kim, Tae Young
    Yoon, Dukyong
    [J]. JMIR MHEALTH AND UHEALTH, 2020, 8 (07):
  • [2] Missing Data Imputation using Machine Learning Algorithm for Supervised Learning
    Cenitta, D.
    Arjunan, R. Vijaya
    Prema, K., V
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [3] Water-Quality Data Imputation with a High Percentage of Missing Values: A Machine Learning Approach
    Rodriguez, Rafael
    Pastorini, Marcos
    Etcheverry, Lorena
    Chreties, Christian
    Fossati, Monica
    Castro, Alberto
    Gorgoglione, Angela
    [J]. SUSTAINABILITY, 2021, 13 (11)
  • [4] ExtraImpute: A Novel Machine Learning Method for Missing Data Imputation
    Alabadla, Mustafa
    Sidi, Fatimah
    Ishak, Iskandar
    Ibrahim, Hamidah
    Affendey, Lilly Suriani
    Hamdan, Hazlina
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2022, 13 (05) : 470 - 476
  • [5] Missing Values and Imputation in Healthcare Data: Can Interpretable Machine Learning Help?
    Chen, Zhi
    Tan, Sarah
    Chajewska, Urszula
    Rudin, Cynthia
    Caruana, Rich
    [J]. CONFERENCE ON HEALTH, INFERENCE, AND LEARNING, VOL 209, 2023, 209 : 86 - 99
  • [6] Mathura (MBI)-A novel imputation measure for imputation of missing values in medical datasets
    Mathura Bai, B.
    Mangathayaru, N.
    Padmaja Rani, B.
    Aljawarneh, Shadi
    [J]. Recent Advances in Computer Science and Communications, 2021, 14 (05) : 1358 - 1369
  • [7] A Novel Algorithm for the Integration of the Imputation of Missing Values and Clustering
    Ben Ihay, Roni
    Herman, Maya
    [J]. MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2015, 2015, 9166 : 115 - 129
  • [8] A First Approach on Big Data Missing Values Imputation
    Montesdeoca, Besay
    Luengo, Julian
    Maillo, Jesus
    Garcia-Gil, Diego
    Garcia, Salvador
    Herrera, Francisco
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS 2019), 2019, : 315 - 323
  • [9] A novel approach for imputation of missing continuous attribute values in databases using genetic algorithm
    Priya, R. Devi
    Kuppuswami, S.
    [J]. International Journal of Information Technology and Management, 2015, 14 (2-3) : 185 - 200
  • [10] A Novel Approach for Dealing with Missing Values in Machine Learning Datasets with Discrete Values
    Abu-Soud, Saleh M.
    [J]. 2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, : 118 - 122