Tissue classification with gene expression profiles

被引:448
|
作者
Ben-Dor, A
Bruhn, L
Friedman, N
Nachman, I
Schummer, M
Yakhini, Z
机构
[1] Agilent Labs, Chem & Biol Syst Dept, Palo Alto, CA 94304 USA
[2] Hebrew Univ Jerusalem, Sch Engn & Comp Sci, IL-91904 Jerusalem, Israel
[3] Hebrew Univ Jerusalem, Ctr Computat Neurosci, IL-91904 Jerusalem, Israel
[4] Univ Washington, Seattle, WA 98105 USA
[5] Agilent Labs, IL-31905 Haifa, Israel
关键词
tissue classification; gene expression analysis; ovarian cancer; colon cancer;
D O I
10.1089/106652700750050943
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Constantly improving gene expression profiling technologies are expected to provide understanding and insight into cancer-related cellular processes. Gene expression data is also expected to significantly aid in the development of efficient cancer diagnosis and classification platforms. In this work we examine three sets of gene expression data measured across sets of tumor(s) and normal clinical samples: The first set consists of 2,000 genes, measured in 62 epithelial colon samples (Alon et al., 1999). The second consists of approximate to 100,000 clones, measured in 32 ovarian samples (unpublished extension of data set described in Schummer et al, (1999)). The third set consists of approximate to 7,100 genes, measured in 72 bone marrow and peripheral brood samples (Golub et al., 1999). We examine the use of scoring methods, measuring separation of tissue type (e.g., tumors from normals) using individual gene expression levels. These are then coupled with high-dimensional classification methods to assess the classification power of complete expression profiles. We present results of performing leave-one-one cross validation (LOOCV) experiments on the three data sets, employing nearest neighbor classifier, SVM (Cortes and Vapnik, 1995), AdaBoost (Freund and Schapire, 1997) and a novel clustering-based classification technique. As tumor samples can differ from normal samples in their cell-type composition, we also perform LOOCV experiments using appropriately modified sets of genes, attempting to eliminate the resulting bias. We demonstrate success rate of at least 90% in tumor versus normal classification, using sets of selected genes, with, as well as without, cellular-contamination-related members. These results are insensitive to the exact selection mechanism, over a certain range.
引用
收藏
页码:559 / 583
页数:25
相关论文
共 50 条
  • [41] Gene selection in arthritis classification with large-scale microarray expression profiles
    Sha, N
    Vannucci, M
    Brown, PJ
    Trower, MK
    Amiphlett, G
    Falciani, F
    [J]. COMPARATIVE AND FUNCTIONAL GENOMICS, 2003, 4 (02): : 171 - 181
  • [42] Towards precise classification of cancers based on robust gene functional expression profiles
    Guo, Z
    Zhang, TW
    Li, X
    Wang, Q
    Xu, JZ
    Yu, H
    Zhu, J
    Wang, HY
    Wang, CG
    Topol, EJ
    Wang, Q
    Rao, SQ
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)
  • [43] Subtype dependent biomarker identification and tumor classification from gene expression profiles
    Wang, Aiguo
    An, Ning
    Chen, Guilin
    Liu, Li
    Alterovitz, Gil
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 146 : 104 - 117
  • [44] Comparing the characteristics of gene expression profiles derived by univariate and multivariate classification methods
    Zucknick, Manuela
    Richardson, Sylvia
    Stronach, Euan A.
    [J]. STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2008, 7 (01)
  • [45] Towards precise classification of cancers based on robust gene functional expression profiles
    Zheng Guo
    Tianwen Zhang
    Xia Li
    Qi Wang
    Jianzhen Xu
    Hui Yu
    Jing Zhu
    Haiyun Wang
    Chenguang Wang
    Eric J Topol
    Qing Wang
    Shaoqi Rao
    [J]. BMC Bioinformatics, 6
  • [46] CANCER CLASSIFICATION FROM THE GENE EXPRESSION PROFILES BY DISCRIMINANT KERNEL-PLS
    Tang, Kai-Lin
    Yao, Wei-Jia
    Li, Tong-Hua
    Li, Yi-Xue
    Cao, Zhi-Wei
    [J]. JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2010, 8 : 147 - 160
  • [47] Classification of small cell lung cancer and pulmonary carcinoid by gene expression profiles
    Anbazhagan, R
    Tihan, T
    Bornman, DM
    Johnston, JC
    Saltz, JH
    Weigering, A
    Piantadosi, S
    Gabrielson, E
    [J]. CANCER RESEARCH, 1999, 59 (20) : 5119 - 5122
  • [48] Molecular classification of disease based on peripheral gene expression profiles in FTD and AD
    Coppola, Giovanni
    Karydas, Anna
    Suberlak, Matthew N.
    Miller, Bruce L.
    Geschwind, Daniel H.
    [J]. ANNALS OF NEUROLOGY, 2007, 62 : S52 - S52
  • [49] DWT based feature extraction of gene expression data for tissue classification
    Dong, XY
    Sun, GM
    Xu, GD
    [J]. PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND COMPUTATIONAL INTELLIGENCE, 2004, : 37 - 42
  • [50] Using uncorrelated discriminant analysis for tissue classification with gene expression data
    Ye, JP
    Li, T
    Xiong, T
    Janardan, R
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2004, 1 (04) : 181 - 190