Tissue classification with gene expression profiles

被引:448
|
作者
Ben-Dor, A
Bruhn, L
Friedman, N
Nachman, I
Schummer, M
Yakhini, Z
机构
[1] Agilent Labs, Chem & Biol Syst Dept, Palo Alto, CA 94304 USA
[2] Hebrew Univ Jerusalem, Sch Engn & Comp Sci, IL-91904 Jerusalem, Israel
[3] Hebrew Univ Jerusalem, Ctr Computat Neurosci, IL-91904 Jerusalem, Israel
[4] Univ Washington, Seattle, WA 98105 USA
[5] Agilent Labs, IL-31905 Haifa, Israel
关键词
tissue classification; gene expression analysis; ovarian cancer; colon cancer;
D O I
10.1089/106652700750050943
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Constantly improving gene expression profiling technologies are expected to provide understanding and insight into cancer-related cellular processes. Gene expression data is also expected to significantly aid in the development of efficient cancer diagnosis and classification platforms. In this work we examine three sets of gene expression data measured across sets of tumor(s) and normal clinical samples: The first set consists of 2,000 genes, measured in 62 epithelial colon samples (Alon et al., 1999). The second consists of approximate to 100,000 clones, measured in 32 ovarian samples (unpublished extension of data set described in Schummer et al, (1999)). The third set consists of approximate to 7,100 genes, measured in 72 bone marrow and peripheral brood samples (Golub et al., 1999). We examine the use of scoring methods, measuring separation of tissue type (e.g., tumors from normals) using individual gene expression levels. These are then coupled with high-dimensional classification methods to assess the classification power of complete expression profiles. We present results of performing leave-one-one cross validation (LOOCV) experiments on the three data sets, employing nearest neighbor classifier, SVM (Cortes and Vapnik, 1995), AdaBoost (Freund and Schapire, 1997) and a novel clustering-based classification technique. As tumor samples can differ from normal samples in their cell-type composition, we also perform LOOCV experiments using appropriately modified sets of genes, attempting to eliminate the resulting bias. We demonstrate success rate of at least 90% in tumor versus normal classification, using sets of selected genes, with, as well as without, cellular-contamination-related members. These results are insensitive to the exact selection mechanism, over a certain range.
引用
收藏
页码:559 / 583
页数:25
相关论文
共 50 条
  • [1] Functional embedding for the classification of gene expression profiles
    Wu, Ping-Shi
    Mueller, Hans-Georg
    [J]. BIOINFORMATICS, 2010, 26 (04) : 509 - 517
  • [2] Gene boosting for cancer classification based on gene expression profiles
    Hong, Jin-Hyuk
    Cho, Sung-Bae
    [J]. PATTERN RECOGNITION, 2009, 42 (09) : 1761 - 1767
  • [3] PCP: a program for supervised classification of gene expression profiles
    Buturovic, LJ
    [J]. BIOINFORMATICS, 2006, 22 (02) : 245 - 247
  • [4] Gene classification using expression profiles: A feasibility study
    Kuramochi, M
    Karypis, G
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2005, 14 (04) : 641 - 660
  • [5] A Classification Framework Applied to Cancer Gene Expression Profiles
    Hijazi, Hussein
    Chan, Christina
    [J]. JOURNAL OF HEALTHCARE ENGINEERING, 2013, 4 (02) : 255 - 283
  • [6] Gene classification using expression profiles: A feasibility study
    Kuramochi, M
    Karypis, G
    [J]. 2ND ANNUAL IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2001, : 191 - 200
  • [7] Classification of human cancer diseases by gene expression profiles
    Salem, Hanaa
    Attiya, Gamal
    El-Fishawy, Nawal
    [J]. APPLIED SOFT COMPUTING, 2017, 50 : 124 - 134
  • [8] GENE EXPRESSION PROFILES FOR MOLECULAR CLASSIFICATION OF MUTLIPLE MYELOMA
    Broyl, A.
    Hose, D.
    de Knegt, Y.
    Lokhorst, H.
    Goldschmidt, H.
    van Duin, M.
    Sonneveld, P.
    [J]. HAEMATOLOGICA-THE HEMATOLOGY JOURNAL, 2008, 93 : 77 - 77
  • [9] Gene expression profiles of human fetal nasopharyngeal tissue
    He, ZW
    Xu, LG
    Xie, L
    Zhang, L
    Lan, K
    Ren, CP
    Yao, KT
    [J]. ACTA BIOCHIMICA ET BIOPHYSICA SINICA, 1999, 31 (06) : 711 - 714
  • [10] Cancer Classification Ensemble System Based on Gene Expression Profiles
    Tarek, Sara
    Elwahab, Reda Abd
    Shoman, Mahmoud
    [J]. 2016 5TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA), 2016,