Machine learning-based approaches for cancer prediction using microbiome data

被引:2
|
作者
Freitas, Pedro [1 ,2 ]
Silva, Francisco [1 ,3 ]
Sousa, Joana Vale [1 ,2 ]
Ferreira, Rui M. [4 ,5 ]
Figueiredo, Ceu [4 ,5 ,6 ]
Pereira, Tania [1 ]
Oliveira, Helder P. [1 ,3 ]
机构
[1] INESC TEC Inst Syst & Comp Engn Technol & Sci, P-4200465 Porto, Portugal
[2] Univ Porto, FEUP Fac Engn, P-4200465 Porto, Portugal
[3] Univ Porto, FCUP Fac Sci, P-4150177 Porto, Portugal
[4] Univ Porto, Ipatimup Inst Mol Pathol & Immunol, P-4200135 Porto, Portugal
[5] Univ Porto, I3S Inst Invest & Inovacao Saude, P-4200135 Porto, Portugal
[6] Univ Porto, FMUP Fac Med, P-4200319 Porto, Portugal
来源
SCIENTIFIC REPORTS | 2023年 / 13卷 / 01期
关键词
REVEALS; TISSUE; TUMOR;
D O I
10.1038/s41598-023-38670-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Emerging evidence of the relationship between the microbiome composition and the development of numerous diseases, including cancer, has led to an increasing interest in the study of the human microbiome. Technological breakthroughs regarding DNA sequencing methods propelled microbiome studies with a large number of samples, which called for the necessity of more sophisticated data-analytical tools to analyze this complex relationship. The aim of this work was to develop a machine learning-based approach to distinguish the type of cancer based on the analysis of the tissue-specific microbial information, assessing the human microbiome as valuable predictive information for cancer identification. For this purpose, Random Forest algorithms were trained for the classification of five types of cancer-head and neck, esophageal, stomach, colon, and rectum cancers-with samples provided by The Cancer Microbiome Atlas database. One versus all and multi-class classification studies were conducted to evaluate the discriminative capability of the microbial data across increasing levels of cancer site specificity, with results showing a progressive rise in difficulty for accurate sample classification. Random Forest models achieved promising performances when predicting head and neck, stomach, and colon cancer cases, with the latter returning accuracy scores above 90% across the different studies conducted. However, there was also an increased difficulty when discriminating esophageal and rectum cancers, failing to differentiate with adequate results rectum from colon cancer cases, and esophageal from head and neck and stomach cancers. These results point to the fact that anatomically adjacent cancers can be more complex to identify due to microbial similarities. Despite the limitations, microbiome data analysis using machine learning may advance novel strategies to improve cancer detection and prevention, and decrease disease burden.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Machine learning-based approaches for cancer prediction using microbiome data
    Pedro Freitas
    Francisco Silva
    Joana Vale Sousa
    Rui M. Ferreira
    Céu Figueiredo
    Tania Pereira
    Hélder P. Oliveira
    [J]. Scientific Reports, 13 (1)
  • [2] Machine learning-based colorectal cancer prediction using global dietary data
    Abdul Rahman, Hanif
    Ottom, Mohammad Ashraf
    Dinov, Ivo D.
    [J]. BMC CANCER, 2023, 23 (01)
  • [3] Machine learning-based colorectal cancer prediction using global dietary data
    Hanif Abdul Rahman
    Mohammad Ashraf Ottom
    Ivo D. Dinov
    [J]. BMC Cancer, 23
  • [4] Machine learning-based approaches for disease gene prediction
    Duc-Hau Le
    [J]. BRIEFINGS IN FUNCTIONAL GENOMICS, 2020, 19 (5-6) : 350 - 363
  • [5] Machine Learning-Based Prediction of Cattle Activity Using Sensor-Based Data
    Hernandez, Guillermo
    Gonzalez-Sanchez, Carlos
    Gonzalez-Arrieta, Angelica
    Sanchez-Brizuela, Guillermo
    Fraile, Juan-Carlos
    [J]. SENSORS, 2024, 24 (10)
  • [6] Machine Learning-Based Cellular Traffic Prediction Using Data Reduction Techniques
    Nashaat, Heba
    Mohammed, Nihal H.
    Abdel-Mageid, Salah M.
    Rizk, Rawya Y.
    [J]. IEEE ACCESS, 2024, 12 : 58927 - 58939
  • [7] Machine Learning-Based Prediction of Hemoglobinopathies Using Complete Blood Count Data
    Schipper, Anoeska
    Rutten, Matthieu
    van Gammeren, Adriaan
    Harteveld, Cornelis L.
    Urrechaga, Eloisa
    Weerkamp, Floor
    den Besten, Gijs
    Krabbe, Johannes
    Slomp, Jennichjen
    Schoonen, Lise
    Broeren, Maarten
    van Wijnen, Merel
    Huijskens, Mirelle J. A. J.
    Koopmann, Tamara
    van Ginneken, Bram
    Kusters, Ron
    Kurstjens, Steef
    [J]. CLINICAL CHEMISTRY, 2024, 70 (08) : 1064 - 1075
  • [8] Machine learning-based prediction of diabetic patients using blood routine data
    Li, Honghao
    Su, Dongqing
    Zhang, Xinpeng
    He, Yuanyuan
    Luo, Xu
    Xiong, Yuqiang
    Zou, Min
    Wei, Huiyan
    Wen, Shaoran
    Xi, Qilemuge
    Zuo, Yongchun
    Yang, Lei
    [J]. METHODS, 2024, 229 : 156 - 162
  • [9] Machine learning-based prediction of cancer immunotherapy response using circulating cytokines
    Wei, Feifei
    Azuma, Koichi
    Nakahara, Yoshiro
    Saito, Haruhiro
    Kouro, Taku
    Himuro, Hidetomo
    Horaguchi, Shun
    Tsuji, Kayoko
    Sasada, Tetsuro
    [J]. CANCER SCIENCE, 2023, 114 : 1013 - 1013
  • [10] BREAST CANCER PREDICTION USING MACHINE LEARNING APPROACHES
    Kiran, B. Kranthi
    [J]. JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (06): : 149 - 155