Comparative Analysis of K-Nearest Neighbor and Modified K-Nearest Neighbor Algorithm for Data Classification

被引:0
|
作者
Okfalisa [1 ]
Mustakim [2 ]
Gazalba, Ikbal [3 ]
Reza, Nurul Gayatri Indah [3 ]
机构
[1] UIN Sultan Syarif Kasim Riau, Dept Informat Engn, Pekanbaru, Indonesia
[2] UIN Sultan Syarif Kasim Riau, Dept Informat Syst, Data Min Lab, Pekanbaru, Indonesia
[3] UIN Sultan Syarif Kasim Riau, Dept Informat Syst, Pekanbaru, Indonesia
关键词
Classification; Data Mining; K-Nearest Neighbor; Modified K-Nearest Neighbor;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data mining is the process of handling information from a database which is invisible directly. Data mining is predicted to become a highly revolutionary branch of science over the next decade. One of data mining techniques is classification. The most popular classification technique is K-Nearest Neighbor (KNN). But there is also the Modified K-Nearest Neighbor (MKNN) classification algorithm which is the derived algorithm of KNN. In this paper we will analyze the comparison of KNN and MKNN algorithms to classify the data of Conditional Cash Transfer Implementation Unit (Unit Pelaksana Program Keluarga Harapan) which consist of 7395 records. Comparative analysis is based on the accuracy of both algorithms. Before classification, K-Fold Cross Validation was done to search for the optimal data modeling resulted in data modeling on cross 2 with accuracy of 93.945%. The results of K-Fold Cross Validation modeling will be the model for training data samples and testing data to test KNN and MKNN for classification. Classification result produced accuracy based on the rules of confusion matrix. The test resulted in the highest accuracy of KKN by 94.95% with average accuracy during the test was 93.94% and the highest accuracy of MKNN was 99.51% with the average accuracy during the test was 99.20%, almost all testing from the first test up to the tenth, MKNN algorithm is superior and has better accuracy value than KNN so it can be analyzed that the ability of MKNN algorithm in accuracy is better than KNN. It can be concluded that MKNN algorithm is capable of handling accuracy better for classification than KNN algorithm, by ignoring other aspects such as computerization, time efficiency, and algorithm effectiveness.
引用
收藏
页码:294 / 298
页数:5
相关论文
共 50 条
  • [1] Analysis of the k-nearest neighbor classification
    Li, Jing
    Cheng, Ming
    [J]. INFORMATION SCIENCE AND MANAGEMENT ENGINEERING, VOLS 1-3, 2014, 46 : 1911 - 1917
  • [3] Improved k-nearest neighbor classification
    Wu, YQ
    Ianakiev, K
    Govindaraju, V
    [J]. PATTERN RECOGNITION, 2002, 35 (10) : 2311 - 2318
  • [4] MKNN: Modified K-Nearest Neighbor
    Parvin, Hamid
    Alizadeh, Hoscin
    Minael-Bidgoli, Behrouz
    [J]. WCECS 2008: WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, 2008, : 831 - 834
  • [5] A Modified K-Nearest Neighbor Algorithm to Handle Uncertain Data
    Agrawal, Rashmi
    Ram, Babu
    [J]. 2015 5TH INTERNATIONAL CONFERENCE ON IT CONVERGENCE AND SECURITY (ICITCS), 2015,
  • [6] A FUZZY K-NEAREST NEIGHBOR ALGORITHM
    KELLER, JM
    GRAY, MR
    GIVENS, JA
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (04): : 580 - 585
  • [7] Modified K-nearest Neighbor Algorithm with Variant K Values
    Waghmare, Kalyani C.
    Sonkamble, Balwant A.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (10) : 220 - 224
  • [8] An Improved K-Nearest Neighbor Algorithm for Pattern Classification
    Sultana, Zinnia
    Ferdousi, Ashifatul
    Tasnim, Farzana
    Nahar, Lutfun
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (08) : 760 - 767
  • [9] Fuzzy Monotonic K-Nearest Neighbor Versus Monotonic Fuzzy K-Nearest Neighbor
    Zhu, Hong
    Wang, Xizhao
    Wang, Ran
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (09) : 3501 - 3513
  • [10] Joint Evidential K-Nearest Neighbor Classification
    Gong, Chaoyu
    Li, Yongbin
    Liu, Yong
    Wang, Pei-hong
    You, Yang
    [J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2113 - 2126