Breast cancer prognosis by combinatorial analysis of gene expression data

被引:215
|
作者
Alexe G. [1 ,2 ,3 ]
Alexe S. [1 ]
Axelrod D.E. [4 ,5 ]
Bonates T.O. [1 ]
Lozina I.I. [1 ]
Reiss M. [5 ,6 ]
Hammer P.L. [1 ]
机构
[1] RUTCOR (Rutgers University Center for Operations Research), Piscataway, NJ
[2] Computational Biology Center, TJ Watson IBM Research, Yorktown Heights, NY
[3] The Simons Center for Systems Biology, Institute for Advanced Study, Princeton, NJ
[4] Department of Genetics, Rutgers University, Piscataway, NJ
[5] The Cancer Institute of New Jersey, New Brunswick, NJ
[6] Division of Medical Oncology, UMDNJ-Robert Wood Johnson Medical School, New Brunswick, NJ
关键词
Breast Cancer; Negative Case; Negative Pattern; Positive Pattern; Prognostic System;
D O I
10.1186/bcr1512
中图分类号
学科分类号
摘要
Introduction: The potential of applying data analysis tools to microarray data for diagnosis and prognosis is illustrated on the recent breast cancer dataset of van't Veer and coworkers. We re-examine that dataset using the novel technique of logical analysis of data (LAD), with the double objective of discovering patterns characteristic for cases with good or poor outcome, using them for accurate and justifiable predictions; and deriving novel information about the role of genes, the existence of special classes of cases. and other factors. Method: Data were analyzed using the combinatorics and optimization-based method of LAD, recently shown to provide highly accurate diagnostic and prognostic systems in cardiology, cancer proteomics, hematology, pulmonology, and other disciplines. Results: LAD identified a subset of 17 of the 25,000 genes, capable of fully distinguishing between patients with poor, respectively good prognoses. An extensive list of 'patterns' or 'combinatorial biomarkers' (that is, combinations of genes and limitations on their expression levels) was generated, and 40 patterns were used to create a prognostic system, shown to have 100% and 92.9% weighted accuracy on the training and test sets, respectively. The prognostic system uses fewer genes than other methods, and has similar or better accuracy than those reported in other studies. Out of the 17 genes identified by LAD, three (respectively, five) were shown to play a significant role in determining poor (respectively, good) prognosis. Two new classes of patients (described by similar sets of covering patterns, gene expression ranges, and clinical features) were discovered. As a by-product of the study, it is shown that the training and the test sets of van't Veer have differing characteristics. Conclusion: The study shows that LAD provides an accurate and fully explanatory prognostic system for breast cancer using genomic data (that is, a system that, in addition to predicting good or poor prognosis, provides an individualized explanation of the reasons for that prognosis for each patient). Moreover, the LAD model provides valuable insights into the roles of individual and combinatorial biomarkers, allows the discovery of new classes of patients, and generates a vast library of biomedical research hypotheses. © 2006 Alexe et al; licensee BioMed Central Ltd.
引用
收藏
相关论文
共 50 条
  • [41] Effects of infiltrating lymphocytes and estrogen receptor on gene expression and prognosis in breast cancer
    Calabro, Alberto
    Beissbarth, Tim
    Kuner, Ruprecht
    Stojanov, Michael
    Benner, Axel
    Asslaber, Martin
    Ploner, Ferdinand
    Zatloukal, Kurt
    Samonigg, Hellmut
    Poustka, Annemarie
    Sueltmann, Holger
    BREAST CANCER RESEARCH AND TREATMENT, 2009, 116 (01) : 69 - 77
  • [42] Intrinsic bias in breast cancer gene expression data sets
    Jonathan D Mosley
    Ruth A Keri
    BMC Cancer, 9
  • [43] Intrinsic bias in breast cancer gene expression data sets
    Mosley, Jonathan D.
    Keri, Ruth A.
    BMC CANCER, 2009, 9
  • [44] Serial analysis of gene expression in breast cancer cells
    Aldaz, CM
    Bednarek, A
    Charpentier, A
    MacLeod, M
    Hawkins, K
    Laflin, K
    HORMONAL CARCINOGENESIS III, 2001, : 113 - 123
  • [45] Towards Improved Cancer Diagnosis and Prognosis Using Analysis of Gene Expression Data and Computer Aided Imaging
    Alexe, Gabriela
    Monaco, James
    Doyle, Scott
    Basavanhally, Ajay
    Reddy, Anupama
    Seiler, Michael
    Ganesan, Shridar
    Bhanot, Gyan
    Madabhushi, Anant
    EXPERIMENTAL BIOLOGY AND MEDICINE, 2009, 234 (08) : 860 - 879
  • [46] Predicting prognosis of breast cancer with gene signatures: are we lost in a sea of data?
    Takayuki Iwamoto
    Lajos Pusztai
    Genome Medicine, 2
  • [47] Predicting prognosis of breast cancer with gene signatures: are we lost in a sea of data?
    Iwamoto, Takayuki
    Pusztai, Lajos
    GENOME MEDICINE, 2010, 2
  • [48] Comprehensive Analysis of the Expression and Prognosis of chromobox Family Members in Breast Cancer
    Wang, Lu
    Zhao, Lin
    Zhang, Yujiao
    Shao, Shan
    Ning, Qian
    Zhao, Xinhan
    Luo, Minna
    CLINICAL BREAST CANCER, 2023, 23 (04) : E206 - E218
  • [49] Expression and prognosis analysis of mitochondrial ribosomal protein family in breast cancer
    Xiaoyi Lin
    Lijuan Guo
    Xin Lin
    Yulei Wang
    Guochun Zhang
    Scientific Reports, 12
  • [50] Comprehensive Analysis of the Expression, Prognosis, and Biological Significance of OVOLs in Breast Cancer
    Chen, Jingsheng
    Tang, Hongjun
    Li, Taidong
    Jiang, Kangwei
    Zhong, Haiming
    Wu, Yuye
    He, Jiangtao
    Li, Dongbing
    Li, Mengzhen
    Cai, Xingsheng
    INTERNATIONAL JOURNAL OF GENERAL MEDICINE, 2021, 14 : 3951 - 3960