A novel K-nearest neighbor classifier for lung cancer disease diagnosis

被引：0

作者：

Sachdeva, Ravi Kumar ^{[1
]}

Bathla, Priyanka ^{[2
]}

Rani, Pooja ^{[3
]}

Lamba, Rohit ^{[4
]}

Ghantasala, G. S. Pradeep ^{[5
]}

Nassar, Ibrahim F. ^{[6
]}

机构：

[1] Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, Rajpura, India

[2] Chandigarh University, Punjab, Gharuan, Mohali, India

[3] MMICTBM, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala, India

[4] Department of Electronics and Communication Engineering, MMEC, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala, India

[5] Department of Computer Science and Engineering, Alliance College of Engineering and Design, Alliance University, Bengaluru, India

[6] Faculty of Specific Education, Ain Shams University, 365 Ramsis Street, Abassia, Cairo, Egypt

来源：

Neural Computing and Applications | 2024年 / 36卷 / 35期

关键词：

K-near neighbor - Logistics regressions - Lung Cancer - Machine-learning - Naive bayes - Nearest-neighbour - Pearson correlation - Pearson correlation weighted KNN - Random forests - Support vectors machine;

D O I：

10.1007/s00521-024-10235-w

中图分类号：

学科分类号：

摘要：

One of the world's deadliest diseases is lung cancer. Based on a few features, machine learning techniques can help in the diagnosis of lung cancer. The performance of several classifiers: support vector machine (SVM), logistic regression (LR), Naïve Bayes (NB), random forest (RF), and K-nearest neighbor (KNN), was evaluated by the authors using the dataset available on Kaggle to create a systematic approach for the diagnosis of lung cancer disease based on readily observable signs and historical medical data without the requirement of CT scan images. The authors have proposed a novel approach for classification called Pearson correlation weighted KNN (PCWKNN), which is a modified version of KNN and uses Pearson correlation coefficient values to determine weights in a weighted KNN. The performance of the classifiers was evaluated using the hold-out validation method. SVM, LR, and RF were 96.77% accurate. NB obtained 95.16% accuracy. KNN achieved 91.93% accuracy. PCWKNN outperformed the employed classifiers and obtained an accuracy of 98.39%. Addressing the imperative for improved model generalization, the researchers utilized PCWKNN on an alternative, more extensive lung cancer dataset and subsequently broadened its application to diverse diseases, including the brain stroke dataset. The encouraging outcomes underscore PCWKNN's resilience and adaptability, suggesting its viability for real-world implementation. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

引用

下载

页码：22403 / 22416

页数：13

共 50 条

[41] A representation coefficient-based k-nearest centroid neighbor classifier
Gou, Jianping
Sun, Liyuan
Du, Lan
Ma, Hongxing
Xiong, Taisong
Ou, Weihua
Zhan, Yongzhao
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 194
[42] Detection and Localization of Myocardial Infarction using K-nearest Neighbor Classifier
Arif, Muhammad
Malagore, Ijaz A.
Afsar, Fayyaz A.
JOURNAL OF MEDICAL SYSTEMS, 2012, 36 (01) : 279 - 289
[43] A Local Mean-Based k-Nearest Centroid Neighbor Classifier
Gou, Jianping
Yi, Zhang
Du, Lan
Xiong, Taisong
COMPUTER JOURNAL, 2012, 55 (09): : 1058 - 1071
[44] Arrhythmia Detection from Heartbeat Using k-Nearest Neighbor Classifier
Park, Juyoung
Lee, Kuyeon
Kang, Kyungtae
2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
[45] Improving performance of the k-nearest neighbor classifier by tolerant rough sets
Bao, YG
Du, XY
Ishii, N
PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON COOPERATIVE DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2000, : 167 - 171
[46] A new fuzzy k-nearest neighbor classifier based on the Bonferroni mean
Kumbure, Mahinda Mailagaha
Luukka, Pasi
Collan, Mikael
PATTERN RECOGNITION LETTERS, 2020, 140 : 172 - 178
[47] A generalized mean distance-based k-nearest neighbor classifier
Gou, Jianping
Ma, Hongxing
Ou, Weihua
Zeng, Shaoning
Rao, Yunbo
Yang, Hebiao
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 115 : 356 - 372
[48] On Convergence of the Class Membership Estimator in Fuzzy k-Nearest Neighbor Classifier
Banerjee, Imon
Mullick, Sankha Subhra
Das, Swagatam
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (06) : 1226 - 1236
[49] Frog sound identification using extended k-nearest neighbor classifier
Mukahar, Nordiana
Rosdi, Bakhtiar Affendi
Ramli, Dzati Athiar
Jaafar, Haryati
1ST INTERNATIONAL CONFERENCE ON APPLIED & INDUSTRIAL MATHEMATICS AND STATISTICS 2017 (ICOAIMS 2017), 2017, 890
[50] A Local Mean Representation-based K-Nearest Neighbor Classifier
Gou, Jianping
Qiu, Wenmo
Yi, Zhang
Xu, Yong
Mao, Qirong
Zhan, Yongzhao
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2019, 10 (03)

← 1 2 3 4 5 →