Tumor classification by partial least squares using microarray gene expression data

被引:574
|
作者
Nguyen, DV
Rocke, DM [1 ]
机构
[1] Univ Calif Davis, Dept Appl Sci, Davis, CA 95616 USA
[2] Univ Calif Davis, Ctr Image Proc & Integrated Comp, Davis, CA 95616 USA
关键词
D O I
10.1093/bioinformatics/18.1.39
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: One important application of gene expression microarray data is classification of samples into categories, such as the type of tumor. The use of microarrays allows simultaneous monitoring of thousands of genes expressions per sample. This ability to measure gene expression en masse has resulted in data with the number of variables p (genes) far exceeding the number of samples N. Standard statistical methodologies in classification and prediction do not work well or even at all when N < p. Modification of existing statistical methodologies or development of new methodologies is needed for the analysis of microarray data. Results: We propose a novel analysis procedure for classifying (predicting) human tumor samples based on microarray gene expressions. This procedure involves dimension reduction using Partial Least Squares (PLS) and classification using Logistic Discrimination (LD) and Quadratic Discriminant Analysis (QDA). We compare PLS to the well known dimension reduction method of Principal Components Analysis (PCA). Under many circumstances PLS proves superior; we illustrate a condition when PCA particularly fails to predict well relative to PLS. The proposed methods were applied to five different microarray data sets involving various human tumor samples: (1) normal versus ovarian tumor; (2) Acute Myeloid Leukemia (AML) versus Acute Lymphoblastic Leukemia (ALL); (3) Diff use Large B-cell Lymphoma (DLBCLL) versus B-cell Chronic Lymphocytic Leukemia (BCLL); (4) normal versus colon tumor; and (5) Non-Small-Cell-Lung-Carcinoma (NSCLC) versus renal samples. Stability of classification results and methods were further assessed by re-randomization studies.
引用
收藏
页码:39 / 50
页数:12
相关论文
共 50 条
  • [21] Borrowing information from relevant microarray studies for sample classification using weighted partial least squares
    Huang, XH
    Pan, W
    Han, XQ
    Chen, YJ
    Miller, LW
    Hall, J
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2005, 29 (03) : 204 - 211
  • [22] Missing value estimation for DNA microarray gene expression data: local least squares imputation
    Kim, H
    Golub, GH
    Park, H
    [J]. BIOINFORMATICS, 2005, 21 (02) : 187 - 198
  • [23] Gene Expression Profile Analysis in Epilepsy by Using the Partial Least Squares Method
    Wang, Dong
    Song, Xixiao
    Wang, Yan
    Li, Xia
    Jia, Shanshan
    Wang, Zhijing
    [J]. SCIENTIFIC WORLD JOURNAL, 2014,
  • [24] Mapping microarray gene expression data into dissimilarity spaces for tumor classification
    Garcia, Vicente
    Sanchez, J. Salvador
    [J]. INFORMATION SCIENCES, 2015, 294 : 362 - 375
  • [25] Sparse Partial Least Squares Classification for High Dimensional Data
    Chung, Dongjun
    Keles, Sunduz
    [J]. STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2010, 9 (01)
  • [26] Optimization Based Tumor Classification from Microarray Gene Expression Data
    Dagliyan, Onur
    Uney-Yuksektepe, Fadime
    Kavakli, I. Halil
    Turkay, Metin
    [J]. PLOS ONE, 2011, 6 (02):
  • [27] Predicting survival from gene expression data by generalized partial least squares regression
    Storvold, HL
    Lingjaerde, OC
    [J]. BREAST CANCER RESEARCH, 2005, 7 (Suppl 2) : S52 - S52
  • [28] Predicting survival from gene expression data by generalized partial least squares regression
    HL Størvold
    OC Lingjærde
    [J]. Breast Cancer Research, 7
  • [29] Multi-class cancer classification via partial least squares with gene expression profiles
    Nguyen, DV
    Rocke, DM
    [J]. BIOINFORMATICS, 2002, 18 (09) : 1216 - 1226
  • [30] Feature selection and ranking of key genes for tumor classification: Using microarray gene expression data
    Mukkamala, Srinivas
    Liu, Qingzhong
    Veeraghattam, Rajeev
    Sung, Andrew H.
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2006, PROCEEDINGS, 2006, 4029 : 951 - 961