A novel K-nearest neighbor classifier for lung cancer disease diagnosis

被引:0
|
作者
Sachdeva, Ravi Kumar [1 ]
Bathla, Priyanka [2 ]
Rani, Pooja [3 ]
Lamba, Rohit [4 ]
Ghantasala, G. S. Pradeep [5 ]
Nassar, Ibrahim F. [6 ]
机构
[1] Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, Rajpura, India
[2] Chandigarh University, Punjab, Gharuan, Mohali, India
[3] MMICTBM, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala, India
[4] Department of Electronics and Communication Engineering, MMEC, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala, India
[5] Department of Computer Science and Engineering, Alliance College of Engineering and Design, Alliance University, Bengaluru, India
[6] Faculty of Specific Education, Ain Shams University, 365 Ramsis Street, Abassia, Cairo, Egypt
关键词
K-near neighbor - Logistics regressions - Lung Cancer - Machine-learning - Naive bayes - Nearest-neighbour - Pearson correlation - Pearson correlation weighted KNN - Random forests - Support vectors machine;
D O I
10.1007/s00521-024-10235-w
中图分类号
学科分类号
摘要
One of the world's deadliest diseases is lung cancer. Based on a few features, machine learning techniques can help in the diagnosis of lung cancer. The performance of several classifiers: support vector machine (SVM), logistic regression (LR), Naïve Bayes (NB), random forest (RF), and K-nearest neighbor (KNN), was evaluated by the authors using the dataset available on Kaggle to create a systematic approach for the diagnosis of lung cancer disease based on readily observable signs and historical medical data without the requirement of CT scan images. The authors have proposed a novel approach for classification called Pearson correlation weighted KNN (PCWKNN), which is a modified version of KNN and uses Pearson correlation coefficient values to determine weights in a weighted KNN. The performance of the classifiers was evaluated using the hold-out validation method. SVM, LR, and RF were 96.77% accurate. NB obtained 95.16% accuracy. KNN achieved 91.93% accuracy. PCWKNN outperformed the employed classifiers and obtained an accuracy of 98.39%. Addressing the imperative for improved model generalization, the researchers utilized PCWKNN on an alternative, more extensive lung cancer dataset and subsequently broadened its application to diverse diseases, including the brain stroke dataset. The encouraging outcomes underscore PCWKNN's resilience and adaptability, suggesting its viability for real-world implementation. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:22403 / 22416
页数:13
相关论文
共 50 条
  • [21] A MODIFIED K-NEAREST NEIGHBOR CLASSIFIER TO DEAL WITH UNBALANCED CLASSES
    AlSukker, Akram
    Al-Ani, Ahmed
    Atiya, Amir
    IJCCI 2009: PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2009, : 408 - +
  • [22] Boosting the distance estimation -: Application to the K-Nearest Neighbor Classifier
    Amores, J
    Sebe, N
    Radeva, P
    PATTERN RECOGNITION LETTERS, 2006, 27 (03) : 201 - 209
  • [23] Adaptation of the fuzzy k-nearest neighbor classifier for manufacturing automation
    Tobin, KW
    Gleason, SS
    Karnowski, TP
    MACHINE VISION APPLICATIONS IN INDUSTRIAL INSPECTION VI, 1998, 3306 : 122 - 130
  • [24] A fall detection system using k-nearest neighbor classifier
    Liu, Chien-Liang
    Lee, Chia-Hoang
    Lin, Ping-Min
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (10) : 7174 - 7181
  • [25] A fuzzy K-nearest neighbor classifier to deal with imperfect data
    Cadenas, Jose M.
    Carmen Garrido, M.
    Martinez, Raquel
    Munoz, Enrique
    Bonissone, Piero P.
    SOFT COMPUTING, 2018, 22 (10) : 3313 - 3330
  • [26] Consistency of the k-Nearest Neighbor Classifier for Spatially Dependent Data
    Ahmad Younso
    Ziad Kanaya
    Nour Azhari
    Communications in Mathematics and Statistics, 2023, 11 : 503 - 518
  • [27] Classification of facial expressions using K-Nearest Neighbor Classifier
    Sohail, Abu Sayeed Md.
    Bhattacharya, Prabir
    COMPUTER VISION/COMPUTER GRAPHICS COLLABORATION TECHNIQUES, 2007, 4418 : 555 - +
  • [28] A novel ensemble method for k-nearest neighbor
    Zhang, Youqiang
    Cao, Guo
    Wang, Bisheng
    Li, Xuesong
    PATTERN RECOGNITION, 2019, 85 : 13 - 25
  • [29] Heart Disease Prediction Using k-Nearest Neighbor Classifier Based on Handwritten Text
    Kedar, Seema
    Bormane, D. S.
    Nair, Vaishnavi
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 1, CIDM 2015, 2016, 410 : 49 - 56
  • [30] Optimal gene selection for cancer classification with partial correlation and k-nearest neighbor classifier
    Yoo, SH
    Cho, SB
    PRICAI 2004: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3157 : 713 - 722