A novel K-nearest neighbor classifier for lung cancer disease diagnosis

被引:0
|
作者
Sachdeva, Ravi Kumar [1 ]
Bathla, Priyanka [2 ]
Rani, Pooja [3 ]
Lamba, Rohit [4 ]
Ghantasala, G. S. Pradeep [5 ]
Nassar, Ibrahim F. [6 ]
机构
[1] Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, Rajpura, India
[2] Chandigarh University, Punjab, Gharuan, Mohali, India
[3] MMICTBM, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala, India
[4] Department of Electronics and Communication Engineering, MMEC, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala, India
[5] Department of Computer Science and Engineering, Alliance College of Engineering and Design, Alliance University, Bengaluru, India
[6] Faculty of Specific Education, Ain Shams University, 365 Ramsis Street, Abassia, Cairo, Egypt
关键词
K-near neighbor - Logistics regressions - Lung Cancer - Machine-learning - Naive bayes - Nearest-neighbour - Pearson correlation - Pearson correlation weighted KNN - Random forests - Support vectors machine;
D O I
10.1007/s00521-024-10235-w
中图分类号
学科分类号
摘要
One of the world's deadliest diseases is lung cancer. Based on a few features, machine learning techniques can help in the diagnosis of lung cancer. The performance of several classifiers: support vector machine (SVM), logistic regression (LR), Naïve Bayes (NB), random forest (RF), and K-nearest neighbor (KNN), was evaluated by the authors using the dataset available on Kaggle to create a systematic approach for the diagnosis of lung cancer disease based on readily observable signs and historical medical data without the requirement of CT scan images. The authors have proposed a novel approach for classification called Pearson correlation weighted KNN (PCWKNN), which is a modified version of KNN and uses Pearson correlation coefficient values to determine weights in a weighted KNN. The performance of the classifiers was evaluated using the hold-out validation method. SVM, LR, and RF were 96.77% accurate. NB obtained 95.16% accuracy. KNN achieved 91.93% accuracy. PCWKNN outperformed the employed classifiers and obtained an accuracy of 98.39%. Addressing the imperative for improved model generalization, the researchers utilized PCWKNN on an alternative, more extensive lung cancer dataset and subsequently broadened its application to diverse diseases, including the brain stroke dataset. The encouraging outcomes underscore PCWKNN's resilience and adaptability, suggesting its viability for real-world implementation. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
下载
收藏
页码:22403 / 22416
页数:13
相关论文
共 50 条
  • [1] Hybrid k-Nearest Neighbor Classifier
    Yu, Zhiwen
    Chen, Hantao
    Liu, Jiming
    You, Jane
    Leung, Hareton
    Han, Guoqiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (06) : 1263 - 1275
  • [2] Fault Diagnosis Based on LTSA and K-Nearest Neighbor Classifier
    Jiang, Jingsheng
    Wang, Huaqing
    Ke, Yanliang
    Xiang, Wei
    Zhendong yu Chongji/Journal of Vibration and Shock, 2017, 36 (11): : 134 - 139
  • [3] Evidential Editing K-Nearest Neighbor Classifier
    Jiao, Lianmeng
    Denoeux, Thierry
    Pan, Quan
    SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2015, 2015, 9161 : 461 - 471
  • [4] Optimization Strategies for the k-Nearest Neighbor Classifier
    Yepdjio Nkouanga H.
    Vajda S.
    SN Computer Science, 4 (1)
  • [5] K-Nearest Neighbor Classifier for Signature Verification System
    Abdelrahaman, Ahmed A. A.
    Abdallah, Ahmed M. E.
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONICS ENGINEERING (ICCEEE), 2013, : 58 - 62
  • [6] Feature-weighted k-nearest neighbor classifier
    Vivencio, Diego P.
    Hruschka, Estevarn R., Jr.
    Nicoletti, M. do Carmo
    dos Santos, Edimilson B.
    Galvao, Sebastian D. C. O.
    2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 481 - +
  • [7] An Improvement To The k-Nearest Neighbor Classifier For ECG Database
    Jaafar, Haryati
    Ramli, Nur Hidayah
    Nasir, Aimi Salihah Abdul
    MALAYSIAN TECHNICAL UNIVERSITIES CONFERENCE ON ENGINEERING AND TECHNOLOGY 2017 (MUCET 2017), 2018, 318
  • [8] A K-Nearest Neighbor Classifier for Ship Route Prediction
    Lo Duca, Angelica
    Bacciu, Clara
    Marchetti, Andrea
    OCEANS 2017 - ABERDEEN, 2017,
  • [9] Use of K-Nearest Neighbor classifier for intrusion detection
    Liao, YH
    Vemuri, VR
    COMPUTERS & SECURITY, 2002, 21 (05) : 439 - 448
  • [10] Privacy-preserving k-Nearest Neighbor Classifier
    Xu J.
    Wang A.-D.
    Bi M.
    Zhou F.-C.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (11): : 3503 - 3517