A hybrid approach for lung cancer diagnosis using optimized random forest classification and K-means visualization algorithm

被引:8
|
作者
Bhattacharjee, Ananya [1 ]
Murugan, R. [1 ]
Goel, Tripti [1 ]
机构
[1] Natl Inst Technol Silchar, Dept Elect & Commun Engn, Biomed Imaging Lab BIOMIL, Silchar 788010, Assam, India
关键词
Lung cancer; Visualization; Hyperparameter optimization; Feature extraction; Segmentation and optimized random forest; FEATURE-SELECTION; FRAMEWORK; NETWORK;
D O I
10.1007/s12553-022-00679-2
中图分类号
R-058 [];
学科分类号
摘要
Lung cancer detection has become one of the most challenging oncology problems. It is an arduous task for radiologists to detect nodules based on the naked eye vision. The main goal of this paper is to present a well-defined approach for malignant nodule detection from computed tomography scans and a visualization tool to show how the extracted features are responsible for the malignant cluster. Inspired by hyperparameter optimization and visualization technique, we uniquely deployed a hybrid approach based on an optimized random forest classifier and a K-means visualization tool that tried to best tune the model's hyperparameters to provide the optimal results and visualize the malignant and non-malignant clusters, respectively. Out of the four experiments performed for the hyperparameter optimization, the best model classified malignant and non-malignant cases effectively and achieved a 10-Fold cross-validation accuracy of 92.14% on the LIDC-IDRI dataset. Moreover, the least inertia score and the highest silhouette score obtained by the best visualization configuration were 16.21 and 0.815, respectively. The proposed hybrid approach appeared to be apt for lung cancer diagnosis. The integration of the visualization approach provided the ability to localize the malignant cluster and hence drew inference out of it.
引用
收藏
页码:787 / 800
页数:14
相关论文
共 50 条
  • [31] ARRHYTHMIA DISEASE DIAGNOSIS USING NEURAL NETWORK, SVM, AND GENETIC ALGORITHM-OPTIMIZED k-MEANS CLUSTERING
    Martis, Roshan Joy
    Chakraborty, Chandan
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2011, 11 (04) : 897 - 915
  • [32] Clustering of Lung Cancer Data Using Foggy K-Means
    Yadav, Akhilesh Kumar
    Tomar, Divya
    Agarwal, Sonali
    2013 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2013, : 13 - 18
  • [33] Hybrid Fragmentation of XML Data Warehouse Using K-Means Algorithm
    Kechar, Mohamed
    Bahloul, Safia Nait
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS (ADBIS 2014), 2014, 8716 : 70 - 82
  • [35] Automatic Lung Segmentation By Using Histogram Based K-means Algorithm
    Dincer, Esra
    Duru, Nevcihan
    2016 ELECTRIC ELECTRONICS, COMPUTER SCIENCE, BIOMEDICAL ENGINEERINGS' MEETING (EBBT), 2016,
  • [36] Using Random Forest Algorithm for Breast Cancer Diagnosis
    Dai, Bin
    Chen, Rung-Ching
    Zhu, Shun-Zhi
    Zhang, Wei-Wei
    2018 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2018), 2018, : 449 - 452
  • [37] Hybrid Intrusion Detection System using K-means and Random Tree Algorithms
    Aung, Yi Yi
    Min, Myat Myat
    2018 19TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2018, : 218 - 223
  • [38] A Hybrid Approach for Detecting the Brain Tumor in Hyperspectral Images Using Moth Flame Optimized K-Means Clustering
    Thiyagupriyadharsan, M. R.
    Suja, S.
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (07) : 1354 - 1360
  • [39] Hybrid Approach of EEG Stress Level Classification Using K-Means Clustering and Support Vector Machine
    Wen, Tee Yi
    Mohd Aris, Siti Armiza
    IEEE Access, 2022, 10 : 18370 - 18379
  • [40] Hybrid Approach of EEG Stress Level Classification Using K-Means Clustering and Support Vector Machine
    Wen, Tee Yi
    Aris, Siti Armiza Mohd
    IEEE ACCESS, 2022, 10 : 18370 - 18379