Volatile Organic Compounds for the Prediction of Lung Cancer by Using Ensembled Machine Learning Model and Feature Selection

被引:0
|
作者
Khanna, Divya [1 ]
Kumar, Arun [2 ]
Ahmad Bhat, Shahid [3 ]
机构
[1] Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, Rajpura,140401, India
[2] Madhav Institute of Technology and Science, Centre for Artificial Intelligence, Madhya Pradesh, Gwalior,474005, India
[3] LUT University, LUT Business School, Lappeenranta,53851, Finland
关键词
Blood - Prediction models;
D O I
10.1109/ACCESS.2025.3527027
中图分类号
学科分类号
摘要
The advancement of biomarkers is critically important at present, as lung cancer is a leading cause of death. In the present study, volatile organic compounds (VOCs) are considered as biomarkers to predict lung cancer. VOCs from seven different sources including breath, blood, urine, cell line, plerual fluid, cancer tissue and lung tissue are targeted to enhance the prediction reliability. Feature selection and models fusion have been focused on during this study. Five in-built and one proposed ensemble machine learning model have been utilised to investigate the different types of VOCs. The idea behind designing one ensemble model is to combine multiple individual models for better performance by using optimal feature sets. This reasoning led to the design of an ensemble model to predict breath VOCs. The AvNNet model has superior performance in predicting blood VOCs, cancer tissue VOCs, cell line VOCs, and urine VOCs compared to four other models, achieving accuracies of 70%, 80%, 70%, and 90% accordingly on the validation dataset. The Blackboost model achieved 90% accuracy on the validation dataset in its prediction of lung tissue VOCs. With 90% accuracy on a validation dataset, the random forest model predicts pleural fluid volatile organic compounds efficiently. When compared to individual models, the proposed ensemble model predicts breath VOCs more effectively and achieves 100% accuracy on the validation dataset. © 2013 IEEE.
引用
收藏
页码:9809 / 9820
相关论文
共 50 条
  • [41] Optimal Feature Selection of Technical Indicator and Stock Prediction Using Machine Learning Technique
    Naik, Nagaraj
    Mohan, Biju R.
    [J]. EMERGING TECHNOLOGIES IN COMPUTER ENGINEERING: MICROSERVICES IN BIG DATA ANALYTICS, 2019, 985 : 261 - 268
  • [42] Prediction of Cardiovascular Disease by Feature Selection and Machine Learning Techniques
    Ranade, Aditya
    Pise, Nitin
    [J]. ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 2, AITA 2023, 2024, 844 : 457 - 472
  • [43] Enhancing Parkinson's Disease Prediction Using Machine Learning and Feature Selection Methods
    Saeed, Faisal
    Al-Sarem, Mohammad
    Al-Mohaimeed, Muhannad
    Emara, Abdelhamid
    Boulila, Wadii
    Alasli, Mohammed
    Ghabban, Fahad
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 5639 - 5657
  • [44] Battery Health Prediction Using Fusion-Based Feature Selection and Machine Learning
    Hu, Xiaosong
    Che, Yunhong
    Lin, Xianke
    Onori, Simona
    [J]. IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2021, 7 (02) : 382 - 398
  • [45] Feature selection for effective prediction of SARS-COV-2 using machine learning
    Gagan Punacha
    Rama Adiga
    [J]. Genes & Genomics, 2024, 46 : 341 - 354
  • [46] Explainable Machine Learning Model to Prediction EGFR Mutation in Lung Cancer
    Yang, Ruiyuan
    Xiong, Xingyu
    Wang, Haoyu
    Li, Weimin
    [J]. FRONTIERS IN ONCOLOGY, 2022, 12
  • [47] Feature selection for effective prediction of SARS-COV-2 using machine learning
    Punacha, Gagan
    Adiga, Rama
    [J]. GENES & GENOMICS, 2024, 46 (01) : 95 - 112
  • [48] Crowdfunding performance prediction using feature-selection-based machine learning models
    Feng, Yuanyue
    Luo, Yuhong
    Peng, Nianjiao
    Niu, Ben
    [J]. EXPERT SYSTEMS, 2024,
  • [49] FEATURE EXTRACTION AND SUPERVISED LEARNING FOR VOLATILE ORGANIC COMPOUNDS GAS RECOGNITION
    Tombel, Nor Syahira Mohd
    Zaki, Hasan Firdaus Mohd
    Fadglullah, Hanna Farihin Binti Mohd
    [J]. IIUM ENGINEERING JOURNAL, 2023, 24 (02): : 407 - 420
  • [50] Optimizing intrusion detection using intelligent feature selection with machine learning model
    Aljehane, Nojood O.
    Mengash, Hanan A.
    Hassine, Siwar B. H.
    Alotaibi, Faiz A.
    Salama, Ahmed S.
    Abdelbagi, Sitelbanat
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2024, 91 : 39 - 49