Analysis of Variants of KNN for Disease Risk Prediction

被引:0
|
作者
Negi, Archita [1 ]
Hajati, Farshid [1 ]
机构
[1] Victoria Univ Sydney, Coll Engn & Sci, Sydney, NSW, Australia
来源
ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 3 | 2022年 / 451卷
关键词
D O I
10.1007/978-3-030-99619-2_50
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As we all know, Supervised Machine learning algorithms are gaining a lot of attention these days and are used in various fields. These algorithms play a great role in classifying, handling and predicting the data with the help of machine learning and makes use of a labelled dataset to predict and classify any given unlabeled data. KNN is one of the most popular supervised learning algorithms that delivers excellent results and is widely used these days for predicting the risk of certain diseases. Different variants of KNN are being introduced by different researchers due to distinct inadequacies of the standard KNN. The goal of this research is to evaluate the performance of such KNN variants in disease risk prediction by taking into account various factors such as performance, accuracy and so on. These variants which are already being proposed by different researchers will be reviewed and analyzed in this study. All these variants of K Nearest neighbor are implemented in Python Programming Language on different medical datasets taken from Kaggle. Extensive efforts are being made to analyze the different variants of KNN with respect to disease risk prediction and after implementation, it was noticed that some variants worked really well for some datasets. Out of all, Generalized KNN turned out to be one of the most efficient variants of KNN as it gave a good accuracy of 90.15% for mole cancer dataset. Also, it gave a precision of 90.13%, recall of 94.67% and AUC of 88.61% which overall is a very good performance. It was noticed that the Genetic KNN with GA Algorithm gave an accuracy of 99% for both Chronic Kidney Disease datasets. This study will be really beneficial for other researchers as well in the health industry and will also help in improvising their studies.
引用
收藏
页码:531 / 545
页数:15
相关论文
共 50 条
  • [1] Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction
    Shahadat Uddin
    Ibtisham Haque
    Haohui Lu
    Mohammad Ali Moni
    Ergun Gide
    Scientific Reports, 12
  • [2] Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction
    Uddin, Shahadat
    Haque, Ibtisham
    Lu, Haohui
    Moni, Mohammad Ali
    Gide, Ergun
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [3] Disease risk prediction with rare and common variants
    Chengqing Wu
    Kyle M Walsh
    Andrew T DeWan
    Josephine Hoh
    Zuoheng Wang
    BMC Proceedings, 5 (Suppl 9)
  • [4] Prediction and Analysis of Rheumatic Heart Disease using kNN Classification with ACO
    Rajathi, S.
    Radhamani, G.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON DATA MINING AND ADVANCED COMPUTING (SAPIENCE), 2016, : 68 - 73
  • [5] Improving coeliac disease risk prediction by testing non-HLA variants additional to HLA variants
    Romanos, Jihane
    Rosen, Anna
    Kumar, Vinod
    Trynka, Gosia
    Franke, Lude
    Szperl, Agata
    Gutierrez-Achury, Javier
    van Diemen, Cleo C.
    Kanninga, Roan
    Jankipersadsing, Soesma A.
    Steck, Andrea
    Eisenbarth, Georges
    van Heel, David A.
    Cukrowska, Bozena
    Bruno, Valentina
    Mazzilli, Maria Cristina
    Nunez, Concepcion
    Ramon Bilbao, Jose
    Mearin, M. Luisa
    Barisani, Donatella
    Rewers, Marian
    Norris, Jill M.
    Ivarsson, Anneli
    Boezen, H. Marieke
    Liu, Edwin
    Wijmenga, Cisca
    GUT, 2014, 63 (03) : 415 - 422
  • [6] Analysis And Comparison Of Diabetic Prediction Using Medium KNN Classifier And Cosine KNN Classifier
    Nirupama, S.
    Rani, D. Jenila
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 386 - 394
  • [7] Genetic variants and their interactions in disease risk prediction – machine learning and network perspectives
    Sebastian Okser
    Tapio Pahikkala
    Tero Aittokallio
    BioData Mining, 6
  • [8] Genetic variants and their interactions in disease risk prediction - machine learning and network perspectives
    Okser, Sebastian
    Pahikkala, Tapio
    Aittokallio, Tero
    BIODATA MINING, 2013, 6
  • [9] GENETIC VARIANTS AND RISK OF DISEASE
    Alliey, Ney
    EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2019, 29 : S716 - S716
  • [10] ON THE PREDICTION OF RISK FOR AUTISM FROM COMMON VARIANTS
    Zhu, Lingxue
    Chaste, Pauline
    Song, Youeun
    Devlin, Bernie
    Roeder, Kathryn
    EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2017, 27 : S131 - S131