Biomarker discovery studies for patient stratification using machine learning analysis of omics data: a scoping review

被引:28
|
作者
Glaab, Enrico [1 ]
Rauschenberger, Armin [1 ]
Banzi, Rita [2 ]
Gerardi, Chiara [2 ]
Garcia, Paula [3 ]
Demotes, Jacques [3 ]
机构
[1] Univ Luxembourg, Luxembourg Ctr Syst Biomed, Esch Sur Alzette, Luxembourg
[2] Ist Ric Farmacol Mario Negri IRCCS, Ctr Hlth Regulatory Policies, Milan, Italy
[3] ECRIN, European Clin Res Infrastruct Network, Paris, France
来源
BMJ OPEN | 2021年 / 11卷 / 12期
基金
欧盟地平线“2020”;
关键词
biomarkers; scoping review; omics; machine learning; stratification; GENOMIC CLASSIFIER COLOPRINT; PROSTATE-CANCER CLASSIFIER; GENE-EXPRESSION CLASSIFIER; ONCOTYPE DX; MOLECULAR CLASSIFICATION; 70-GENE SIGNATURE; RECURRENCE SCORE; PRIMARY SITE; VALIDATION; BIOPSY;
D O I
10.1136/bmjopen-2021-053674
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective To review biomarker discovery studies using omics data for patient stratification which led to clinically validated FDA-cleared tests or laboratory developed tests, in order to identify common characteristics and derive recommendations for future biomarker projects. Design Scoping review. Methods We searched PubMed, EMBASE and Web of Science to obtain a comprehensive list of articles from the biomedical literature published between January 2000 and July 2021, describing clinically validated biomarker signatures for patient stratification, derived using statistical learning approaches. All documents were screened to retain only peer-reviewed research articles, review articles or opinion articles, covering supervised and unsupervised machine learning applications for omics-based patient stratification. Two reviewers independently confirmed the eligibility. Disagreements were solved by consensus. We focused the final analysis on omics-based biomarkers which achieved the highest level of validation, that is, clinical approval of the developed molecular signature as a laboratory developed test or FDA approved tests. Results Overall, 352 articles fulfilled the eligibility criteria. The analysis of validated biomarker signatures identified multiple common methodological and practical features that may explain the successful test development and guide future biomarker projects. These include study design choices to ensure sufficient statistical power for model building and external testing, suitable combinations of non-targeted and targeted measurement technologies, the integration of prior biological knowledge, strict filtering and inclusion/exclusion criteria, and the adequacy of statistical and machine learning methods for discovery and validation. Conclusions While most clinically validated biomarker models derived from omics data have been developed for personalised oncology, first applications for non-cancer diseases show the potential of multivariate omics biomarker design for other complex disorders. Distinctive characteristics of prior success stories, such as early filtering and robust discovery approaches, continuous improvements in assay design and experimental measurement technology, and rigorous multicohort validation approaches, enable the derivation of specific recommendations for future studies.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Transparent Exploration of Machine Learning for Biomarker Discovery from Proteomics and Omics Data
    Torun, Furkan M.
    Winter, Sebastian Virreira
    Doll, Sophia
    Riese, Felix M.
    Vorobyev, Artem
    Mueller-Reif, Johannes B.
    Geyer, Philipp E.
    Strauss, Maximilian T.
    [J]. JOURNAL OF PROTEOME RESEARCH, 2022, : 359 - 367
  • [2] Using machine learning approaches for multi-omics data analysis: A review
    Reel, Parminder S.
    Reel, Smarti
    Pearson, Ewan
    Trucco, Emanuele
    Jefferson, Emily
    [J]. BIOTECHNOLOGY ADVANCES, 2021, 49
  • [3] Individualized Patient Risk Stratification Using Machine Learning and Topological Data Analysis
    Ng, Arnold C. T.
    Delgado, Victoria
    Bax, Jeroen J.
    [J]. JACC-CARDIOVASCULAR IMAGING, 2020, 13 (05) : 1133 - 1134
  • [4] Editorial: Leveraging machine learning for omics-driven biomarker discovery
    Li, Sheng
    Hsu, Charles
    Zhao, Tianyi
    He, Liangcan
    [J]. FRONTIERS IN MOLECULAR BIOSCIENCES, 2023, 9
  • [5] Methodology for biomarker discovery with reproducibility in microbiome data using machine learning
    Rojas-Velazquez, David
    Kidwai, Sarah
    Kraneveld, Aletta D.
    Tonda, Alberto
    Oberski, Daniel
    Garssen, Johan
    Lopez-Rincon, Alejandro
    [J]. BMC BIOINFORMATICS, 2024, 25 (01)
  • [6] Methodology for biomarker discovery with reproducibility in microbiome data using machine learning
    David Rojas-Velazquez
    Sarah Kidwai
    Aletta D. Kraneveld
    Alberto Tonda
    Daniel Oberski
    Johan Garssen
    Alejandro Lopez-Rincon
    [J]. BMC Bioinformatics, 25
  • [7] Machine Learning for Biomarker Discovery in Cancer Pharmacogenomics Data
    Mer, Arvind Singh
    Smirnov, Petr
    Haibe-Kains, Benjamin
    [J]. ACM-BCB'19: PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, 2019, : 557 - 557
  • [8] Exploratory pharmacovigilance with machine learning in big patient data: A focused scoping review
    Kaas-Hansen, Benjamin Skov
    Gentile, Simona
    Caioli, Alessandro
    Andersen, Stig Ejdrup
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2023, 132 (03) : 233 - 241
  • [9] Heart and brain traumatic stress biomarker analysis with and without machine learning: A scoping review
    Rountree-Harrison, Darius
    Berkovsky, Shlomo
    Kangas, Maria
    [J]. INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2023, 185 : 27 - 49
  • [10] Multi-omics analysis using machine learning and implications for cancer studies
    Asada, Ken
    Takasawa, Ken
    Shiraishi, Kouya
    Horinouchi, Hidehito
    Yoshida, Yukihiro
    Mukai, Masami
    Shinkai, Norio
    Yatabe, Yasushi
    Kohno, Takashi
    Hamamoto, Ryuji
    [J]. CANCER SCIENCE, 2023, 114 : 231 - 231