Tumor classification by partial least squares using microarray gene expression data

被引:574
|
作者
Nguyen, DV
Rocke, DM [1 ]
机构
[1] Univ Calif Davis, Dept Appl Sci, Davis, CA 95616 USA
[2] Univ Calif Davis, Ctr Image Proc & Integrated Comp, Davis, CA 95616 USA
关键词
D O I
10.1093/bioinformatics/18.1.39
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: One important application of gene expression microarray data is classification of samples into categories, such as the type of tumor. The use of microarrays allows simultaneous monitoring of thousands of genes expressions per sample. This ability to measure gene expression en masse has resulted in data with the number of variables p (genes) far exceeding the number of samples N. Standard statistical methodologies in classification and prediction do not work well or even at all when N < p. Modification of existing statistical methodologies or development of new methodologies is needed for the analysis of microarray data. Results: We propose a novel analysis procedure for classifying (predicting) human tumor samples based on microarray gene expressions. This procedure involves dimension reduction using Partial Least Squares (PLS) and classification using Logistic Discrimination (LD) and Quadratic Discriminant Analysis (QDA). We compare PLS to the well known dimension reduction method of Principal Components Analysis (PCA). Under many circumstances PLS proves superior; we illustrate a condition when PCA particularly fails to predict well relative to PLS. The proposed methods were applied to five different microarray data sets involving various human tumor samples: (1) normal versus ovarian tumor; (2) Acute Myeloid Leukemia (AML) versus Acute Lymphoblastic Leukemia (ALL); (3) Diff use Large B-cell Lymphoma (DLBCLL) versus B-cell Chronic Lymphocytic Leukemia (BCLL); (4) normal versus colon tumor; and (5) Non-Small-Cell-Lung-Carcinoma (NSCLC) versus renal samples. Stability of classification results and methods were further assessed by re-randomization studies.
引用
收藏
页码:39 / 50
页数:12
相关论文
共 50 条
  • [1] Multi-class tumor classification by discriminant partial least squares using microarray gene expression data and assessment of classification models
    Tan, YX
    Shi, LB
    Tong, WD
    Hwang, GTG
    Wang, C
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2004, 28 (03) : 235 - 244
  • [2] A Comparative Study of Two Multiple Classification Methods Based on Partial Least Squares Using Tumor Microarray Gene Expression Data
    Jin Zhichao
    Gao Qingbin
    He Jia
    [J]. COMPREHENSIVE EVALUATION OF ECONOMY AND SOCIETY WITH STATISTICAL SCIENCE, 2009, : 1212 - 1222
  • [3] Kernelized partial least squares for feature reduction and classification of gene microarray data
    Land, Walker H.
    Qiao, Xingye
    Margolis, Daniel E.
    Ford, William S.
    Paquette, Christopher T.
    Perez-Rogers, Joseph F.
    Borgia, Jeffrey A.
    Yang, Jack Y.
    Deng, Youping
    [J]. BMC SYSTEMS BIOLOGY, 2011, 5
  • [4] Partial least squares dimension reduction for microarray gene expression data with a censored response
    Nguyen, DV
    [J]. MATHEMATICAL BIOSCIENCES, 2005, 193 (01) : 119 - 137
  • [5] A Partial Least Squares Algorithm for Microarray Data Analysis Using the VIP Statistic for Gene Selection and Binary Classification
    Burguillo, Francisco J.
    Corchete, Luis A.
    Martin, Javier
    Barrera, Inmaculada
    Bardsley, William G.
    [J]. CURRENT BIOINFORMATICS, 2014, 9 (03) : 348 - 359
  • [6] Classification from microarray data using probabilistic discriminant partial least squares with reject option
    Botella, Cristina
    Ferre, Joan
    Boque, Ricard
    [J]. TALANTA, 2009, 80 (01) : 321 - 328
  • [7] Naive Bayes combined with partial least squares for classification of high dimensional microarray data
    Mehmood, Tahir
    Kanwal, Arzoo
    Butt, Muhammad Moeen
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2022, 222
  • [8] Gene selection for tumor classification using microarray gone expression data
    Yendrapalli, K.
    Basnet, R.
    Mukkamala, S.
    Sung, A. H.
    [J]. WORLD CONGRESS ON ENGINEERING 2007, VOLS 1 AND 2, 2007, : 290 - +
  • [9] Partial least squares based dimension reduction with gene selection for tumor classification
    Li, Guo-Zheng
    Zeng, Xue-Qiang
    Yang, Jack Y.
    Yang, Mary Qu
    [J]. PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 1439 - +
  • [10] MULTIVARIATE FUNCTIONAL PARTIAL LEAST SQUARES FOR CLASSIFICATION USING LONGITUDINAL DATA
    Dembowska, Sonia
    Frangi, Alex
    Houwing-Duistermaat, Jeanine
    Liu, Haiyan
    [J]. THEORETICAL BIOLOGY FORUM, 2021, 114 (01) : 75 - 88