Missing Data Imputation using Machine Learning Algorithm for Supervised Learning

被引:4
|
作者
Cenitta, D. [1 ]
Arjunan, R. Vijaya [1 ]
Prema, K., V [1 ]
机构
[1] Manipal Inst Technol MAHE, Dept CSE, Manipal, India
关键词
Heart Disease; Data mining; UCI; Decision Tree; Missing Data; !text type='PYTHON']PYTHON[!/text;
D O I
10.1109/ICCC150826.2021.9402558
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With a transience rate of over 18 million per year, Heart Disease (HD) has emerged out to be the lethal disease of the world. Data mining-based heart disease diagnosis systems can surely aid cardiac professionals in a timely diagnosis of the patient's condition. In this proposed work, a Python-based data mining system capable of diagnosing the HD using a Decision Tree has been developed. In the methodology, the UCI data repository was taken into consideration with 14 Attributes. In the dataset, there are few missing values (yet found to be hyperparameter), and pre-processing with such missing values is a common yet challenging problem. A mere substitution will give biased results from the data to be observed for HD diagnosis and will certainly affect the value of the learning process in Machine Learning. Therefore, in the proposed work, a missing value imputation is done, which gave better accuracy, and it is trustable.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Sharpening the BLADE: Missing Data Imputation Using Supervised Machine Learning
    Suresh, Marcus
    Taib, Ronnie
    Zhao, Yanchang
    Jin, Warren
    [J]. AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 215 - 227
  • [2] Missing Data Imputation for Supervised Learning
    Poulos, Jason
    Valle, Rafael
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2018, 32 (02) : 186 - 196
  • [3] Analysis of Machine Learning Based Imputation of Missing Data
    Rizvi, Syed Tahir Hussain
    Latif, Muhammad Yasir
    Amin, Muhammad Saad
    Telmoudi, Achraf Jabeur
    Shah, Nasir Ali
    [J]. CYBERNETICS AND SYSTEMS, 2023,
  • [4] A Novel Index Measure Imputation Algorithm for Missing Data Values: A Machine Learning Approach
    Madhu, G.
    Rajinikanth, T. V.
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2012, : 81 - 87
  • [5] Imputation of missing gas permeability data for polymer membranes using machine learning
    Yuan, Qi
    Longo, Mariagiulia
    Thornton, Aaron W.
    McKeown, Neil B.
    Comesana-Gandara, Bibiana
    Jansen, Johannes C.
    Jelfs, Kim E.
    [J]. JOURNAL OF MEMBRANE SCIENCE, 2021, 627
  • [6] Machine Learning Based Missing Data Imputation in Categorical Datasets
    Ishaq, Muhammad
    Zahir, Sana
    Iftikhar, Laila
    Bulbul, Mohammad Farhad
    Rho, Seungmin
    Lee, Mi Young
    [J]. IEEE ACCESS, 2024, 12 : 88332 - 88344
  • [7] ExtraImpute: A Novel Machine Learning Method for Missing Data Imputation
    Alabadla, Mustafa
    Sidi, Fatimah
    Ishak, Iskandar
    Ibrahim, Hamidah
    Affendey, Lilly Suriani
    Hamdan, Hazlina
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2022, 13 (05) : 470 - 476
  • [8] Missing value imputation using unsupervised machine learning techniques
    Raja, P. S.
    Thangavel, K.
    [J]. SOFT COMPUTING, 2020, 24 (06) : 4361 - 4392
  • [9] Missing value imputation using unsupervised machine learning techniques
    P. S. Raja
    K. Thangavel
    [J]. Soft Computing, 2020, 24 : 4361 - 4392
  • [10] Semi-supervised learning with missing values imputation
    Huang, Buliao
    Zhu, Yunhui
    Usman, Muhammad
    Chen, Huanhuan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 284