Mapping microarray gene expression data into dissimilarity spaces for tumor classification

被引:31
|
作者
Garcia, Vicente [1 ]
Sanchez, J. Salvador [2 ]
机构
[1] Univ Autonoma Ciudad Juarez, Inst Ingn & Tecnol, Dept Elect & Comp Engn, Ciudad Juarez 32310, Chihuahua, Mexico
[2] Univ Jaume 1, Inst New Imaging Technol, Dept Comp Languages & Syst, Castellon de La Plana 12071, Spain
关键词
Gene expression; Dissimilarity space; Feature selection; Classification; VECTOR MACHINE CLASSIFICATION; COMPUTATIONAL INTELLIGENCE; CANCER CLASSIFICATION; FEATURE-SELECTION; SAMPLE-SIZE; VALIDATION; PREDICTION; TESTS;
D O I
10.1016/j.ins.2014.09.064
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Microarray gene expression data sets usually contain a large number of genes, but a small number of samples. In this article, we present a two-stage classification model by combining feature selection with the dissimilarity-based representation paradigm. In the preprocessing stage, the ReliefF algorithm is used to generate a subset with a number of top-ranked genes; in the learning/classification stage, the samples represented by the previously selected genes are mapped into a dissimilarity space, which is then used to construct a classifier capable of separating the classes more easily than a feature-based model. The ultimate aim of this paper is not to find the best subset of genes, but to analyze the performance of the dissimilarity-based models by means of a comprehensive collection of experiments for the classification of microarray gene expression data. To this end, we compare the classification results of an artificial neural network, a support vector machine and the Fisher's linear discriminant classifier built on the feature (gene) space with those on the dissimilarity space when varying the number of genes selected by ReliefF, using eight different microarray databases. The results show that the dissimilarity-based classifiers systematically outperform the feature-based models. In addition, classification through the proposed representation appears to be more robust (i.e. less sensitive to the number of genes) than that with the conventional feature-based representation. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:362 / 375
页数:14
相关论文
共 50 条
  • [1] Recursive partitioning for tumor classification with gene expression microarray data
    Zhang, HP
    Yu, CY
    Singer, B
    Xiong, MM
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (12) : 6730 - 6735
  • [2] Optimization Based Tumor Classification from Microarray Gene Expression Data
    Dagliyan, Onur
    Uney-Yuksektepe, Fadime
    Kavakli, I. Halil
    Turkay, Metin
    [J]. PLOS ONE, 2011, 6 (02):
  • [3] Gene selection for tumor classification using microarray gone expression data
    Yendrapalli, K.
    Basnet, R.
    Mukkamala, S.
    Sung, A. H.
    [J]. WORLD CONGRESS ON ENGINEERING 2007, VOLS 1 AND 2, 2007, : 290 - +
  • [4] A Comparison on Score Spaces for Expression Microarray Data Classification
    Perina, Alessandro
    Lovato, Pietro
    Cristani, Marco
    Bicego, Manuele
    [J]. PATTERN RECOGNITION IN BIOINFORMATICS, 2011, 7036 : 202 - +
  • [5] Tumor classification by partial least squares using microarray gene expression data
    Nguyen, DV
    Rocke, DM
    [J]. BIOINFORMATICS, 2002, 18 (01) : 39 - 50
  • [6] On the classification of microarray gene-expression data
    Basford, Kaye E.
    McLachlan, Geoffrey J.
    Rathnayake, Suren I.
    [J]. BRIEFINGS IN BIOINFORMATICS, 2013, 14 (04) : 402 - 410
  • [7] Classification of Microarray Gene Expression Data using Associative Classification
    Alagukumar, S.
    Lawrance, R.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,
  • [8] Classification of normal and tumor tissues using geometric representation of gene expression microarray data
    Kim, Saejoon
    Shin, Donghyuk
    [J]. MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4617 : 393 - +
  • [9] A STUDY ON GENE SELECTION AND CLASSIFICATION ALGORITHMS FOR CLASSIFICATION OF MICROARRAY GENE EXPRESSION DATA
    Chin, Yeo Lee
    Deris, Safaai
    [J]. JURNAL TEKNOLOGI, 2005, 43
  • [10] An efficient approach for classification of gene expression microarray data
    Sreepada, Rama Syamala
    Vipsita, Swati
    Mohapatra, Puspanjali
    [J]. 2014 FOURTH INTERNATIONAL CONFERENCE OF EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2014, : 344 - 348