A novel K-nearest neighbor classifier for lung cancer disease diagnosis

被引:0
|
作者
Sachdeva, Ravi Kumar [1 ]
Bathla, Priyanka [2 ]
Rani, Pooja [3 ]
Lamba, Rohit [4 ]
Ghantasala, G. S. Pradeep [5 ]
Nassar, Ibrahim F. [6 ]
机构
[1] Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, Rajpura, India
[2] Chandigarh University, Punjab, Gharuan, Mohali, India
[3] MMICTBM, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala, India
[4] Department of Electronics and Communication Engineering, MMEC, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala, India
[5] Department of Computer Science and Engineering, Alliance College of Engineering and Design, Alliance University, Bengaluru, India
[6] Faculty of Specific Education, Ain Shams University, 365 Ramsis Street, Abassia, Cairo, Egypt
关键词
K-near neighbor - Logistics regressions - Lung Cancer - Machine-learning - Naive bayes - Nearest-neighbour - Pearson correlation - Pearson correlation weighted KNN - Random forests - Support vectors machine;
D O I
10.1007/s00521-024-10235-w
中图分类号
学科分类号
摘要
One of the world's deadliest diseases is lung cancer. Based on a few features, machine learning techniques can help in the diagnosis of lung cancer. The performance of several classifiers: support vector machine (SVM), logistic regression (LR), Naïve Bayes (NB), random forest (RF), and K-nearest neighbor (KNN), was evaluated by the authors using the dataset available on Kaggle to create a systematic approach for the diagnosis of lung cancer disease based on readily observable signs and historical medical data without the requirement of CT scan images. The authors have proposed a novel approach for classification called Pearson correlation weighted KNN (PCWKNN), which is a modified version of KNN and uses Pearson correlation coefficient values to determine weights in a weighted KNN. The performance of the classifiers was evaluated using the hold-out validation method. SVM, LR, and RF were 96.77% accurate. NB obtained 95.16% accuracy. KNN achieved 91.93% accuracy. PCWKNN outperformed the employed classifiers and obtained an accuracy of 98.39%. Addressing the imperative for improved model generalization, the researchers utilized PCWKNN on an alternative, more extensive lung cancer dataset and subsequently broadened its application to diverse diseases, including the brain stroke dataset. The encouraging outcomes underscore PCWKNN's resilience and adaptability, suggesting its viability for real-world implementation. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
下载
收藏
页码:22403 / 22416
页数:13
相关论文
共 50 条
  • [41] A representation coefficient-based k-nearest centroid neighbor classifier
    Gou, Jianping
    Sun, Liyuan
    Du, Lan
    Ma, Hongxing
    Xiong, Taisong
    Ou, Weihua
    Zhan, Yongzhao
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 194
  • [42] Detection and Localization of Myocardial Infarction using K-nearest Neighbor Classifier
    Arif, Muhammad
    Malagore, Ijaz A.
    Afsar, Fayyaz A.
    JOURNAL OF MEDICAL SYSTEMS, 2012, 36 (01) : 279 - 289
  • [43] A Local Mean-Based k-Nearest Centroid Neighbor Classifier
    Gou, Jianping
    Yi, Zhang
    Du, Lan
    Xiong, Taisong
    COMPUTER JOURNAL, 2012, 55 (09): : 1058 - 1071
  • [44] Arrhythmia Detection from Heartbeat Using k-Nearest Neighbor Classifier
    Park, Juyoung
    Lee, Kuyeon
    Kang, Kyungtae
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [45] Improving performance of the k-nearest neighbor classifier by tolerant rough sets
    Bao, YG
    Du, XY
    Ishii, N
    PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON COOPERATIVE DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2000, : 167 - 171
  • [46] A new fuzzy k-nearest neighbor classifier based on the Bonferroni mean
    Kumbure, Mahinda Mailagaha
    Luukka, Pasi
    Collan, Mikael
    PATTERN RECOGNITION LETTERS, 2020, 140 : 172 - 178
  • [47] A generalized mean distance-based k-nearest neighbor classifier
    Gou, Jianping
    Ma, Hongxing
    Ou, Weihua
    Zeng, Shaoning
    Rao, Yunbo
    Yang, Hebiao
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 115 : 356 - 372
  • [48] On Convergence of the Class Membership Estimator in Fuzzy k-Nearest Neighbor Classifier
    Banerjee, Imon
    Mullick, Sankha Subhra
    Das, Swagatam
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (06) : 1226 - 1236
  • [49] Frog sound identification using extended k-nearest neighbor classifier
    Mukahar, Nordiana
    Rosdi, Bakhtiar Affendi
    Ramli, Dzati Athiar
    Jaafar, Haryati
    1ST INTERNATIONAL CONFERENCE ON APPLIED & INDUSTRIAL MATHEMATICS AND STATISTICS 2017 (ICOAIMS 2017), 2017, 890
  • [50] A Local Mean Representation-based K-Nearest Neighbor Classifier
    Gou, Jianping
    Qiu, Wenmo
    Yi, Zhang
    Xu, Yong
    Mao, Qirong
    Zhan, Yongzhao
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2019, 10 (03)