A review of machine learning methods for cancer characterization from microbiome data

被引:1
|
作者
Teixeira, Marco [1 ,2 ]
Silva, Francisco [1 ,3 ]
Ferreira, Rui M. [4 ,5 ]
Pereira, Tania [1 ,6 ]
Figueiredo, Ceu [4 ,5 ,7 ]
Oliveira, Helder P. [1 ,3 ]
机构
[1] Inst Syst & Comp Engn Technol & Sci, Porto, Portugal
[2] Univ Porto, Fac Engn, Porto, Portugal
[3] Univ Porto, Fac Sci, Porto, Portugal
[4] Univ Porto, Ipatimup Inst Mol Pathol & Immunol, Porto, Portugal
[5] Univ Porto, Inst Invest & Inovacao Saude, Porto, Portugal
[6] Univ Coimbra, Fac Sci & Technol, Coimbra, Portugal
[7] Univ Porto, Fac Med, Porto, Portugal
基金
瑞典研究理事会;
关键词
COLORECTAL-CANCER; FEATURE-SELECTION; DECISION TREES; CLASSIFICATION; MODELS; GENE; BACTERIA; TISSUE; ATLAS; TUMOR;
D O I
10.1038/s41698-024-00617-7
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Recent studies have shown that the microbiome can impact cancer development, progression, and response to therapies suggesting microbiome-based approaches for cancer characterization. As cancer-related signatures are complex and implicate many taxa, their discovery often requires Machine Learning approaches. This review discusses Machine Learning methods for cancer characterization from microbiome data. It focuses on the implications of choices undertaken during sample collection, feature selection and pre-processing. It also discusses ML model selection, guiding how to choose an ML model, and model validation. Finally, it enumerates current limitations and how these may be surpassed. Proposed methods, often based on Random Forests, show promising results, however insufficient for widespread clinical usage. Studies often report conflicting results mainly due to ML models with poor generalizability. We expect that evaluating models with expanded, hold-out datasets, removing technical artifacts, exploring representations of the microbiome other than taxonomical profiles, leveraging advances in deep learning, and developing ML models better adapted to the characteristics of microbiome data will improve the performance and generalizability of models and enable their usage in the clinic.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Machine learning in breast cancer imaging: a review on data, models and methods
    Grinet, Macro S. V. M.
    Gouveia, Ana I. R.
    Gomes, Abel J. P.
    [J]. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2024, 11 (07):
  • [2] Machine learning methods for microbiome studies
    Namkung, Junghyun
    [J]. JOURNAL OF MICROBIOLOGY, 2020, 58 (03) : 206 - 216
  • [3] Machine learning methods for microbiome studies
    Junghyun Namkung
    [J]. Journal of Microbiology, 2020, 58 : 206 - 216
  • [4] A Review and Tutorial of Machine Learning Methods for Microbiome Host Trait Prediction
    Zhou, Yi-Hui
    Gallins, Paul
    [J]. FRONTIERS IN GENETICS, 2019, 10
  • [5] Machine Learning Methods for Cancer Classification Using Gene Expression Data: A Review
    Alharbi, Fadi
    Vakanski, Aleksandar
    [J]. BIOENGINEERING-BASEL, 2023, 10 (02):
  • [6] A review of machine learning methods used for educational data
    Ersozlu, Zara
    Taheri, Sona
    Koch, Inge
    [J]. EDUCATION AND INFORMATION TECHNOLOGIES, 2024,
  • [7] A Review of Methods Used in Machine Learning and Data Analysis
    Wu, Qingyang
    [J]. ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 37 - 45
  • [8] A Review on Recent Progress in Machine Learning and Deep Learning Methods for Cancer Classification on Gene Expression Data
    Mazlan, Aina Umairah
    Sahabudin, Noor Azida
    Remli, Muhammad Akmal
    Ismail, Nor Syahidatul Nadiah
    Mohamad, Mohd Saberi
    Nies, Hui Wen
    Abd Warif, Nor Bakiah
    [J]. PROCESSES, 2021, 9 (08)
  • [9] Machine learning approaches to identify prognosis indicators from microbiome data
    Madgwick, M.
    Sudhakar, P.
    Korcsmaros, T.
    [J]. JOURNAL OF CROHNS & COLITIS, 2019, 13 : S99 - S100
  • [10] Gut microbiome, big data and machine learning to promote precision medicine for cancer
    Cammarota, Giovanni
    Ianiro, Gianluca
    Ahern, Anna
    Carbone, Carmine
    Temko, Andriy
    Claesson, Marcus J.
    Gasbarrini, Antonio
    Tortora, Giampaolo
    [J]. NATURE REVIEWS GASTROENTEROLOGY & HEPATOLOGY, 2020, 17 (10) : 635 - 648