Multi-Omics Marker Analysis Enables Early Prediction of Breast Tumor Progression

被引:12
|
作者
Xu, Haifeng [1 ,2 ]
Lien, Tonje [1 ]
Bergholtz, Helga [1 ]
Fleischer, Thomas [1 ]
Djerroudi, Lounes [3 ]
Vincent-Salomon, Anne [3 ]
Sorlie, Therese [1 ]
Aittokallio, Tero [1 ,2 ,4 ]
机构
[1] Oslo Univ Hosp, Inst Canc Res, Dept Canc Genet, Oslo, Norway
[2] Univ Oslo, Oslo Ctr Biostat & Epidemiol OCBE, Oslo, Norway
[3] Ensemble Hosp, Inst Curie, Pole Med Diagnost & Theranost, Dept Pathol, Paris, France
[4] Univ Helsinki, Inst Mol Med Finland FIMM, HiLIFE, Helsinki, Finland
基金
芬兰科学院;
关键词
risk signature; breast cancer; disease progression; early detection; machine learning; CARCINOMA IN-SITU; DUCTAL CARCINOMA; LOCAL RECURRENCE; FOLLOW-UP; CANCER; EXPRESSION; GRADE; MODELS; BIOPSY; CELLS;
D O I
10.3389/fgene.2021.670749
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Ductal carcinoma in situ (DCIS) is a preinvasive form of breast cancer with a highly variable potential of becoming invasive and affecting mortality of the patients. Due to the lack of accurate markers of disease progression, many women with detected DCIS are currently overtreated. To distinguish those DCIS cases who are likely to require therapy from those who should be left untreated, there is a need for robust and predictive biomarkers extracted from molecular or genetic profiles. We developed a supervised machine learning approach that implements multi-omics feature selection and model regularization for the identification of biomarker combinations that could be used to distinguish low-risk DCIS lesions from those with a higher likelihood of progression. To investigate the genetic heterogeneity of disease progression, we applied this approach to 40 pure DCIS and 259 invasive breast cancer (IBC) samples profiled with genome-wide transcriptomics, DNA methylation, and DNA copy number variation. Feature selection using the multi-omics Lasso-regularized algorithm identified both known genes involved in breast cancer development, as well as novel markers for early detection. Even though the gene expression-based model features led to the highest classification accuracy alone, methylation data provided a complementary source of features and improved especially the sensitivity of correctly classifying DCIS cases. We also identified a number of repeatedly misclassified DCIS cases when using either the expression or methylation markers. A small panel of 10 gene markers was able to distinguish DCIS and IBC cases with high accuracy in nested cross-validation (AU-ROC = 0.99). The marker panel was not specific to any of the established breast cancer subtypes, suggesting that the 10-gene signature may provide a subtype-agnostic and cost-effective approach for breast cancer detection and patient stratification. We further confirmed high accuracy of the 10-gene signature in an external validation cohort (AU-ROC = 0.95), profiled using distinct transcriptomic assay, hence demonstrating robustness of the risk signature.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Integration of multi-omics data and deep phenotyping enables prediction of cytokine responses
    Bakker, Olivier B.
    Aguirre-Gamboa, Raul
    Sanna, Serena
    Oosting, Marije
    Smeekens, Sanne P.
    Jaeger, Martin
    Zorro, Maria
    Vosa, Urmo
    Withoff, Sebo
    Netea-Maier, Romana T.
    Koenen, Hans J. P. M.
    Joosten, Irma
    Xavier, Ramnik J.
    Franke, Lude
    Joosten, Leo A. B.
    Kumar, Vinod
    Wijmenga, Cisca
    Netea, Mihai G.
    Li, Yang
    NATURE IMMUNOLOGY, 2018, 19 (07) : 776 - +
  • [2] Integration of multi-omics data and deep phenotyping enables prediction of cytokine responses
    Olivier B. Bakker
    Raul Aguirre-Gamboa
    Serena Sanna
    Marije Oosting
    Sanne P. Smeekens
    Martin Jaeger
    Maria Zorro
    Urmo Võsa
    Sebo Withoff
    Romana T. Netea-Maier
    Hans J. P. M. Koenen
    Irma Joosten
    Ramnik J. Xavier
    Lude Franke
    Leo A. B. Joosten
    Vinod Kumar
    Cisca Wijmenga
    Mihai G. Netea
    Yang Li
    Nature Immunology, 2018, 19 : 776 - 786
  • [3] Editorial: Multi-omics analysis in tumor microenvironment and tumor heterogeneity
    Shi, Yuxin
    Zhang, Qinglin
    Mei, Jie
    Liu, Jinhui
    FRONTIERS IN GENETICS, 2023, 14
  • [4] Multiset correlation and factor analysis enables exploration of multi-omics data
    Brown, Brielin C.
    Wang, Collin
    Kasela, Silva
    Aguet, Francois
    Nachun, Daniel C.
    Taylor, Kent D.
    Tracy, Russell P.
    Durda, Peter
    Liu, Yongmei
    Johnson, W. Craig
    Van Den Berg, David
    Gupta, Namrata
    Gabriel, Stacy
    Smith, Joshua D.
    Gerzsten, Robert
    Clish, Clary
    Wong, Quenna
    Papanicolau, George
    Blackwell, Thomas W.
    Rotter, Jerome I.
    Rich, Stephen S.
    Barr, R. Graham
    Ardlie, Kristin G.
    Knowles, David A.
    Lappalainen, Tuuli
    CELL GENOMICS, 2023, 3 (08):
  • [5] Survey on Multi-omics, and Multi-omics Data Analysis, Integration and Application
    Shahrajabian, Mohamad Hesam
    Sun, Wenli
    CURRENT PHARMACEUTICAL ANALYSIS, 2023, 19 (04) : 267 - 281
  • [6] Multi-Omics Analysis in Initiation and Progression of Meningiomas: From Pathogenesis to Diagnosis
    Liu, Jiachen
    Xia, Congcong
    Wang, Gaiqing
    FRONTIERS IN ONCOLOGY, 2020, 10
  • [7] Multi-omics analysis of early leaf development in Arabidopsis thaliana
    Omidbakhshfard, Mohammad Amin
    Sokolowska, Ewelina M.
    Di Vittori, Valerio
    de Souza, Leonardo Perez
    Kuhalskaya, Anastasiya
    Brotman, Yariv
    Alseekh, Saleh
    Fernie, Alisdair R.
    Skirycz, Aleksandra
    PATTERNS, 2021, 2 (04):
  • [8] MSFN: a multi-omics stacked fusion network for breast cancer survival prediction
    Zhang, Ge
    Ma, Chenwei
    Yan, Chaokun
    Luo, Huimin
    Wang, Jianlin
    Liang, Wenjuan
    Luo, Junwei
    FRONTIERS IN GENETICS, 2024, 15
  • [9] SMMART Program: A multi-omics tumor board with a focus on breast cancer.
    Kong, Ben L.
    Johnson, Brett E.
    Keck, Jamie M.
    Mitri, Souraya
    Leyshock, Patrick
    Stommel, Jayne M.
    Siex, Kiara
    Klinger, Marlana
    Zheng, Christina L.
    Williams-Belizaire, Rochelle
    McWeeney, Shannon
    Goecks, Jeremy
    Kolodzie, Annette
    Guimaraes, Alexander R.
    Thomas, George V.
    Corless, Christopher L.
    Mitri, Zahi I.
    Gray, Joe W.
    Mills, Gordon B.
    Bergan, Raymond C.
    CANCER RESEARCH, 2021, 81 (13)
  • [10] Integration of multi-omics datasets enables molecular classification of COPD
    Li, Chuan-Xing
    Wheelock, Craig E.
    Skold, C. Magnus
    Wheelock, Asa M.
    EUROPEAN RESPIRATORY JOURNAL, 2018, 51 (05)