Machine learning-based approaches for cancer prediction using microbiome data

被引:2
|
作者
Freitas, Pedro [1 ,2 ]
Silva, Francisco [1 ,3 ]
Sousa, Joana Vale [1 ,2 ]
Ferreira, Rui M. [4 ,5 ]
Figueiredo, Ceu [4 ,5 ,6 ]
Pereira, Tania [1 ]
Oliveira, Helder P. [1 ,3 ]
机构
[1] INESC TEC Inst Syst & Comp Engn Technol & Sci, P-4200465 Porto, Portugal
[2] Univ Porto, FEUP Fac Engn, P-4200465 Porto, Portugal
[3] Univ Porto, FCUP Fac Sci, P-4150177 Porto, Portugal
[4] Univ Porto, Ipatimup Inst Mol Pathol & Immunol, P-4200135 Porto, Portugal
[5] Univ Porto, I3S Inst Invest & Inovacao Saude, P-4200135 Porto, Portugal
[6] Univ Porto, FMUP Fac Med, P-4200319 Porto, Portugal
来源
SCIENTIFIC REPORTS | 2023年 / 13卷 / 01期
关键词
REVEALS; TISSUE; TUMOR;
D O I
10.1038/s41598-023-38670-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Emerging evidence of the relationship between the microbiome composition and the development of numerous diseases, including cancer, has led to an increasing interest in the study of the human microbiome. Technological breakthroughs regarding DNA sequencing methods propelled microbiome studies with a large number of samples, which called for the necessity of more sophisticated data-analytical tools to analyze this complex relationship. The aim of this work was to develop a machine learning-based approach to distinguish the type of cancer based on the analysis of the tissue-specific microbial information, assessing the human microbiome as valuable predictive information for cancer identification. For this purpose, Random Forest algorithms were trained for the classification of five types of cancer-head and neck, esophageal, stomach, colon, and rectum cancers-with samples provided by The Cancer Microbiome Atlas database. One versus all and multi-class classification studies were conducted to evaluate the discriminative capability of the microbial data across increasing levels of cancer site specificity, with results showing a progressive rise in difficulty for accurate sample classification. Random Forest models achieved promising performances when predicting head and neck, stomach, and colon cancer cases, with the latter returning accuracy scores above 90% across the different studies conducted. However, there was also an increased difficulty when discriminating esophageal and rectum cancers, failing to differentiate with adequate results rectum from colon cancer cases, and esophageal from head and neck and stomach cancers. These results point to the fact that anatomically adjacent cancers can be more complex to identify due to microbial similarities. Despite the limitations, microbiome data analysis using machine learning may advance novel strategies to improve cancer detection and prevention, and decrease disease burden.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Machine learning-based prediction of breast cancer growth rate in vivo
    Shristi Bhattarai
    Sergey Klimov
    Mohammed A. Aleskandarany
    Helen Burrell
    Anthony Wormall
    Andrew R. Green
    Padmashree Rida
    Ian O. Ellis
    Remus M. Osan
    Emad A. Rakha
    Ritu Aneja
    [J]. British Journal of Cancer, 2019, 121 : 497 - 504
  • [32] Machine learning-based prediction of breast cancer growth rate in vivo
    Bhattarai, Shristi
    Klimov, Sergey
    Aleskandarany, Mohammed A.
    Burrell, Helen
    Wormall, Anthony
    Green, Andrew R.
    Rida, Padmashree
    Ellis, Ian O.
    Osan, Remus M.
    Rakha, Emad A.
    Aneja, Ritu
    [J]. BRITISH JOURNAL OF CANCER, 2019, 121 (06) : 497 - 504
  • [33] A Machine Learning-Based Framework for the Prediction of Cervical Cancer Risk in Women
    Kaushik, Keshav
    Bhardwaj, Akashdeep
    Bharany, Salil
    Alsharabi, Naif
    Rehman, Ateeq Ur
    Eldin, Elsayed Tag
    Ghamry, Nivin A.
    [J]. SUSTAINABILITY, 2022, 14 (19)
  • [34] Machine learning-based prediction model and visual interpretation for prostate cancer
    Chen, Gang
    Dai, Xuchao
    Zhang, Mengqi
    Tian, Zhujun
    Jin, Xueke
    Mei, Kun
    Huang, Hong
    Wu, Zhigang
    [J]. BMC UROLOGY, 2023, 23 (01)
  • [35] Comparative Analysis of Machine Learning-Based Approaches for Anomaly Detection in Vehicular Data
    Demestichas, Konstantinos
    Alexakis, Theodoros
    Peppes, Nikolaos
    Adamopoulou, Evgenia
    [J]. VEHICLES, 2021, 3 (02): : 171 - 186
  • [36] Lifestyle and occupational risks assessment of bladder cancer using machine learning-based prediction models
    Shakhssalim, Naser
    Talebi, Atefeh
    Pahlevan-Fallahy, Mohammad-Taha
    Sotoodeh, Kasra
    Alavimajd, Hamid
    Borumandnia, Nasrin
    Taheri, Maryam
    [J]. CANCER REPORTS, 2023, 6 (09)
  • [37] Detecting False Data Injection Attacks Using Machine Learning-Based Approaches for Smart Grid Networks
    Abudin, M. D. Jainul
    Thokchom, Surmila
    Naayagi, R. T.
    Panda, Gayadhar
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (11):
  • [38] Electric Vehicle User Behavior Prediction using Learning-based Approaches
    Khan, Sara
    Brandherm, Boris
    Swamy, Anilkumar
    [J]. 2020 IEEE ELECTRIC POWER AND ENERGY CONFERENCE (EPEC), 2020,
  • [39] Machine Learning-based Approaches Comparison for Netflix/DAZN Streaming and Real Traffic Prediction
    Reticcioli, E.
    Di Girolamo, G. D.
    Di Carlo, C.
    Smarra, F.
    D'Innocenzo, A.
    [J]. 2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3102 - 3107
  • [40] A novel framework for addressing uncertainties in machine learning-based geospatial approaches for flood prediction
    Adnan, Mohammed Sarfaraz Gani
    Siam, Zakaria Shams
    Kabir, Irfat
    Kabir, Zobaidul
    Ahmed, M. Razu
    Hassan, Quazi K.
    Rahman, Rashedur M.
    Dewan, Ashraf
    [J]. JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2023, 326