A hybrid approach for lung cancer diagnosis using optimized random forest classification and K-means visualization algorithm

被引:8
|
作者
Bhattacharjee, Ananya [1 ]
Murugan, R. [1 ]
Goel, Tripti [1 ]
机构
[1] Natl Inst Technol Silchar, Dept Elect & Commun Engn, Biomed Imaging Lab BIOMIL, Silchar 788010, Assam, India
关键词
Lung cancer; Visualization; Hyperparameter optimization; Feature extraction; Segmentation and optimized random forest; FEATURE-SELECTION; FRAMEWORK; NETWORK;
D O I
10.1007/s12553-022-00679-2
中图分类号
R-058 [];
学科分类号
摘要
Lung cancer detection has become one of the most challenging oncology problems. It is an arduous task for radiologists to detect nodules based on the naked eye vision. The main goal of this paper is to present a well-defined approach for malignant nodule detection from computed tomography scans and a visualization tool to show how the extracted features are responsible for the malignant cluster. Inspired by hyperparameter optimization and visualization technique, we uniquely deployed a hybrid approach based on an optimized random forest classifier and a K-means visualization tool that tried to best tune the model's hyperparameters to provide the optimal results and visualize the malignant and non-malignant clusters, respectively. Out of the four experiments performed for the hyperparameter optimization, the best model classified malignant and non-malignant cases effectively and achieved a 10-Fold cross-validation accuracy of 92.14% on the LIDC-IDRI dataset. Moreover, the least inertia score and the highest silhouette score obtained by the best visualization configuration were 16.21 and 0.815, respectively. The proposed hybrid approach appeared to be apt for lung cancer diagnosis. The integration of the visualization approach provided the ability to localize the malignant cluster and hence drew inference out of it.
引用
收藏
页码:787 / 800
页数:14
相关论文
共 50 条
  • [1] A hybrid approach for lung cancer diagnosis using optimized random forest classification and K-means visualization algorithm
    Ananya Bhattacharjee
    R. Murugan
    Tripti Goel
    Health and Technology, 2022, 12 : 787 - 800
  • [2] Road accident prediction and model interpretation using a hybrid K-means and random forest algorithm approach
    Salahadin Seid Yassin
    SN Applied Sciences, 2020, 2
  • [3] Road accident prediction and model interpretation using a hybrid K-means and random forest algorithm approach
    Yassin, Salahadin Seid
    Pooja
    SN APPLIED SCIENCES, 2020, 2 (09):
  • [4] Classification Model for Diabetes Mellitus Diagnosis based on K-Means Clustering Algorithm Optimized with Bat Algorithm
    Anam, Syaiful
    Fitriah, Zuraidah
    Hidayat, Noor
    Maulana, Mochamad Hakim Akbar Assidiq
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (01) : 653 - 659
  • [5] A Novel Computer Vision Approach for Segmentation and Classification of Satellite Image using K-Means Algorithm in comparison with Random Forest to Improve Precision
    Saitejaswini, Pyatlo
    Devi, T.
    Karthikeyan, R.
    2022 14TH INTERNATIONAL CONFERENCE ON MATHEMATICS, ACTUARIAL SCIENCE, COMPUTER SCIENCE AND STATISTICS (MACS), 2022,
  • [6] K-Random Forests: a K-means style algorithm for Random Forest clustering
    Bicego, Manuele
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [7] A Word's Difficulty Level Classification Model Based on Random Forest Algorithm and K-Means Clustering Algorithm
    Ning, Jiajie
    Huang, Feifan
    Yin, Maoyuan
    2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 143 - 146
  • [8] Bearing Fault Diagnosis using Hybrid Genetic Algorithm K-means Clustering
    Ettefagh, M. M.
    Ghaemi, M.
    Asr, M. Yazdanian
    2014 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA 2014), 2014, : 84 - 89
  • [9] CLASSIFICATION OF EDIBLE OILS BY INFRARED SPECTROSCOPY WITH OPTIMIZED K-MEANS CLUSTERING BY A HYBRID PARTICLE SWARM ALGORITHM
    Ren Haixia
    Lin Weiqi
    Shi Weimin
    Shen Qi
    ANALYTICAL LETTERS, 2013, 46 (17) : 2727 - 2738
  • [10] Pedestrian classification using K-means and Random Decision Forests
    Alencar, Francisco A. R.
    Massera Filho, Carlos
    Gomes, Diego
    Wolf, Denis F.
    2014 2ND BRAZILIAN ROBOTICS SYMPOSIUM (SBR) / 11TH LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS) / 6TH ROBOCONTROL WORKSHOP ON APPLIED ROBOTICS AND AUTOMATION, 2014, : 103 - 108