Simultaneous Class Discovery and Classification of Microarray Data Using Spectral Analysis

被引:7
|
作者
Qiu, Peng [1 ]
Plevritis, Sylvia K. [1 ]
机构
[1] Stanford Univ, Dept Radiol, Stanford, CA 94305 USA
关键词
algorithms; computational molecular biology; machine learning; GENE-EXPRESSION DATA; NETWORK ANALYSIS; DNA ARRAYS; CANCER; PREDICTION;
D O I
10.1089/cmb.2008.0227
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Classification methods are commonly divided into two categories: unsupervised and supervised. Unsupervised methods have the ability to discover new classes by grouping data into clusters or tree structures without using the class labels, but they carry the risk of producing noninterpretable results. On the other hand, supervised methods always find decision rules that discriminate samples with different class labels. However, the class label information plays such an important role that it confines supervised methods by defining the possible classes. Consequently, supervised methods do not have the ability to discover new classes. To overcome the limitations of unsupervised and supervised methods, we propose a new method, which utilizes the class labels to a less important role so as to perform class discovery and classification simultaneously. The proposed method is called SPACC (SPectral Analysis for Class discovery and Classification). In SPACC, the training samples are nodes of an undirected weighted network. Using spectral analysis, SPACC iteratively partitions the network into a top-down binary tree. Each partitioning step is unsupervised, and the class labels are only used to define the stopping criterion. When the partitioning ends, the training samples have been divided into several subsets, each corresponding to one class label. Because multiple subsets can correspond to the same class label, SPACC may identify biologically meaningful subclasses, and minimize the impact of outliers and mislabeled data. We demonstrate the effectiveness of SPACC for class discovery and classification on microarray data of lymphomas and leukemias. SPACC software is available at http://icbp.stanford.edu/software/SPACC/.
引用
收藏
页码:935 / 944
页数:10
相关论文
共 50 条
  • [1] Spectral Methods for Cancer Classification using Microarray Data
    Kim, Saejoon
    [J]. INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL SCIENCES AND OPTIMIZATION, VOL 1, PROCEEDINGS, 2009, : 588 - 592
  • [2] Simultaneous classification and feature clustering using discriminant vector quantization with applications to microarray data analysis
    Li, J
    Zha, HY
    [J]. CSB2002: IEEE COMPUTER SOCIETY BIOINFORMATICS CONFERENCE, 2002, : 246 - 255
  • [3] Semantic Subgroup Discovery: Using Ontologies in Microarray Data Analysis
    Lavrac, Nada
    Novak, Petra Kralj
    Mozetic, Igor
    Podpecan, Vid
    Motaln, Helena
    Petek, Marko
    Gruden, Kristina
    [J]. 2009 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-20, 2009, : 5613 - +
  • [4] Microarray-Based Class Discovery for Molecular Classification of Breast Cancer: Analysis of Interobserver Agreement
    Mackay, Alan
    Weigelt, Britta
    Grigoriadis, Anita
    Kreike, Bas
    Natrajan, Rachael
    A'Hern, Roger
    Tan, David S. P.
    Dowsett, Mitch
    Ashworth, Alan
    Reis-Filho, Jorge S.
    [J]. JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2011, 103 (08) : 662 - 673
  • [5] An Adaptive Classification Model for Microarray Analysis using Big Data
    Jenifer, X. R.
    Lawrance, R.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,
  • [6] Microarray Data Classification Using the Spectral-Feature-Based TLS Ensemble Algorithm
    Sun, Zhan-Li
    Wang, Han
    Lau, Wai-Shing
    Seet, Gerald
    Wang, Danwei
    Lam, Kin-Man
    [J]. IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2014, 13 (03) : 289 - 299
  • [7] Classification of Multi-class Microarray Cancer Data Using Ensemble Learning Method
    Shekar, B. H.
    Dagnew, Guesh
    [J]. DATA ANALYTICS AND LEARNING, 2019, 43 : 279 - 292
  • [8] Classification and diagnostic prediction of cancers using gene microarray data analysis
    Osareh, Alireza
    Shadgar, Bita
    [J]. Journal of Applied Sciences, 2009, 9 (03) : 459 - 468
  • [9] Class Specific Gene Expression Estimation and Classification in Microarray Data
    Islam, Atiq
    Iftekharuddin, Khan M.
    George, E. Olusegun
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1678 - +
  • [10] CONSTRAINED LATENT CLASS ANALYSIS - SIMULTANEOUS CLASSIFICATION AND SCALING OF DISCRETE CHOICE DATA
    BOCKENHOLT, U
    BOCKENHOLT, I
    [J]. PSYCHOMETRIKA, 1991, 56 (04) : 699 - 716