Volatile Organic Compounds for the Prediction of Lung Cancer by Using Ensembled Machine Learning Model and Feature Selection

被引:0
|
作者
Khanna, Divya [1 ]
Kumar, Arun [2 ]
Ahmad Bhat, Shahid [3 ]
机构
[1] Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, Rajpura,140401, India
[2] Madhav Institute of Technology and Science, Centre for Artificial Intelligence, Madhya Pradesh, Gwalior,474005, India
[3] LUT University, LUT Business School, Lappeenranta,53851, Finland
关键词
Blood - Prediction models;
D O I
10.1109/ACCESS.2025.3527027
中图分类号
学科分类号
摘要
The advancement of biomarkers is critically important at present, as lung cancer is a leading cause of death. In the present study, volatile organic compounds (VOCs) are considered as biomarkers to predict lung cancer. VOCs from seven different sources including breath, blood, urine, cell line, plerual fluid, cancer tissue and lung tissue are targeted to enhance the prediction reliability. Feature selection and models fusion have been focused on during this study. Five in-built and one proposed ensemble machine learning model have been utilised to investigate the different types of VOCs. The idea behind designing one ensemble model is to combine multiple individual models for better performance by using optimal feature sets. This reasoning led to the design of an ensemble model to predict breath VOCs. The AvNNet model has superior performance in predicting blood VOCs, cancer tissue VOCs, cell line VOCs, and urine VOCs compared to four other models, achieving accuracies of 70%, 80%, 70%, and 90% accordingly on the validation dataset. The Blackboost model achieved 90% accuracy on the validation dataset in its prediction of lung tissue VOCs. With 90% accuracy on a validation dataset, the random forest model predicts pleural fluid volatile organic compounds efficiently. When compared to individual models, the proposed ensemble model predicts breath VOCs more effectively and achieves 100% accuracy on the validation dataset. © 2013 IEEE.
引用
收藏
页码:9809 / 9820
相关论文
共 50 条
  • [1] Machine Learning and Feature Selection Methods for EGFR Mutation Status Prediction in Lung Cancer
    Morgado, Joana
    Pereira, Tania
    Silva, Francisco
    Freitas, Claudia
    Negrao, Eduardo
    de Lima, Beatriz Flor
    da Silva, Miguel Correia
    Madureira, Antonio J.
    Ramos, Isabel
    Hespanhol, Venceslau
    Costa, Jose Luis
    Cunha, Antonio
    Oliveira, Helder P.
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [2] A Comparative Study for Breast Cancer Prediction using Machine Learning and Feature Selection
    Dhanya, R.
    Paul, Irene Rose
    Akula, Sai Sindhu
    Sivakumar, Madhumathi
    Nair, Jyothisha J.
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 1049 - 1055
  • [3] Feature selection and classification in breast cancer prediction using IoT and machine learning
    Gopal, V. Nanda
    Al-Turjman, Fadi
    Kumar, R.
    Anand, L.
    Rajesh, M.
    [J]. MEASUREMENT, 2021, 178
  • [4] Lung Cancer Prediction Using Stochastic Diffusion Search (SDS) Based Feature Selection and Machine Learning Methods
    S. Shanthi
    N. Rajkumar
    [J]. Neural Processing Letters, 2021, 53 : 2617 - 2630
  • [5] Lung Cancer Prediction Using Stochastic Diffusion Search (SDS) Based Feature Selection and Machine Learning Methods
    Shanthi, S.
    Rajkumar, N.
    [J]. NEURAL PROCESSING LETTERS, 2021, 53 (04) : 2617 - 2630
  • [6] Feature Extraction Techniques Using Multivariate Analysis for Identification of Lung Cancer Volatile Organic Compounds
    Thriumani, Reena
    Zakaria, Ammar
    Hashim, Yumi Zuhanis Has-Yun
    Helmy, Khaled Mohamed
    Omar, Mohammad Iqbal
    Jeffree, Amanina
    Adom, Abdul Hamid
    Shakaff, Ali Yeon Md
    Kamarudin, Latifah Munirah
    [J]. 11TH ASIAN CONFERENCE ON CHEMICAL SENSORS (ACCS2015), 2017, 1808
  • [7] Prediction of core cancer genes using a hybrid of feature selection and machine learning methods
    Liu, Y. X.
    Zhang, N. N.
    He, Y.
    Lun, L. J.
    [J]. GENETICS AND MOLECULAR RESEARCH, 2015, 14 (03): : 8871 - 8882
  • [8] Breast cancer prediction with transcriptome profiling using feature selection and machine learning methods
    Taghizadeh, Eskandar
    Heydarheydari, Sahel
    Saberi, Alihossein
    JafarpoorNesheli, Shabnam
    Rezaeijo, Seyed Masoud
    [J]. BMC BIOINFORMATICS, 2022, 23 (01)
  • [9] Breast cancer prediction with transcriptome profiling using feature selection and machine learning methods
    Eskandar Taghizadeh
    Sahel Heydarheydari
    Alihossein Saberi
    Shabnam JafarpoorNesheli
    Seyed Masoud Rezaeijo
    [J]. BMC Bioinformatics, 23
  • [10] Eye state Prediction using Ensembled Machine Learning Models
    Singla, Dipali
    Rana, Prashant Singh
    [J]. 2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 2, 2016, : 246 - 250