Mapping microarray gene expression data into dissimilarity spaces for tumor classification

被引:31
|
作者
Garcia, Vicente [1 ]
Sanchez, J. Salvador [2 ]
机构
[1] Univ Autonoma Ciudad Juarez, Inst Ingn & Tecnol, Dept Elect & Comp Engn, Ciudad Juarez 32310, Chihuahua, Mexico
[2] Univ Jaume 1, Inst New Imaging Technol, Dept Comp Languages & Syst, Castellon de La Plana 12071, Spain
关键词
Gene expression; Dissimilarity space; Feature selection; Classification; VECTOR MACHINE CLASSIFICATION; COMPUTATIONAL INTELLIGENCE; CANCER CLASSIFICATION; FEATURE-SELECTION; SAMPLE-SIZE; VALIDATION; PREDICTION; TESTS;
D O I
10.1016/j.ins.2014.09.064
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Microarray gene expression data sets usually contain a large number of genes, but a small number of samples. In this article, we present a two-stage classification model by combining feature selection with the dissimilarity-based representation paradigm. In the preprocessing stage, the ReliefF algorithm is used to generate a subset with a number of top-ranked genes; in the learning/classification stage, the samples represented by the previously selected genes are mapped into a dissimilarity space, which is then used to construct a classifier capable of separating the classes more easily than a feature-based model. The ultimate aim of this paper is not to find the best subset of genes, but to analyze the performance of the dissimilarity-based models by means of a comprehensive collection of experiments for the classification of microarray gene expression data. To this end, we compare the classification results of an artificial neural network, a support vector machine and the Fisher's linear discriminant classifier built on the feature (gene) space with those on the dissimilarity space when varying the number of genes selected by ReliefF, using eight different microarray databases. The results show that the dissimilarity-based classifiers systematically outperform the feature-based models. In addition, classification through the proposed representation appears to be more robust (i.e. less sensitive to the number of genes) than that with the conventional feature-based representation. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:362 / 375
页数:14
相关论文
共 50 条
  • [21] A graph-theoretic technique for classification of normal and tumor tissues using gene expression microarray data
    Kim, Saejoon
    [J]. 2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 4621 - 4624
  • [22] A New Multi-objective Hybrid Gene Selection Algorithm for Tumor Classification Based on Microarray Gene Expression Data
    Li, Min
    Wu, Bangyu
    Deng, Shaobo
    Lou, Mingzhu
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2023, 22 (04)
  • [23] Tumor classification based on gene microarray data and hybrid learning method
    Liu, J
    Zhou, HB
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 2275 - 2280
  • [24] Collaborative representation-based classification of microarray gene expression data
    Shen, Lizhen
    Jiang, Hua
    He, Mingfang
    Liu, Guoqing
    [J]. PLOS ONE, 2017, 12 (12):
  • [25] Classification of breast cancer using microarray gene expression data: A survey
    Abd-Elnaby, Muhammed
    Alfonse, Marco
    Roushdy, Mohamed
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 117
  • [26] Transductive Support Vector Machines for classification of microarray gene expression data
    Semolini, R
    Von Zuben, FJ
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2946 - 2951
  • [27] MIDClass: Microarray Data Classification by Association Rules and Gene Expression Intervals
    Giugno, Rosalba
    Pulvirenti, Alfredo
    Cascione, Luciano
    Pigola, Giuseppe
    Ferro, Alfredo
    [J]. PLOS ONE, 2013, 8 (08):
  • [28] Cancer Classification by Sparse Representation using Microarray Gene Expression Data
    Hang, Xiyi
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, PROCEEDINGS, 2008, : 174 - 177
  • [29] Informative gene discovery for cancer classification from microarray expression data
    Ng, M
    Chan, LW
    [J]. 2005 IEEE WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2005, : 393 - 398
  • [30] A graph-theoretic classification of gene expression microarray data of cancer
    Kim, Saejoon
    [J]. PROCEEDINGS OF THE FRONTIERS IN THE CONVERGENCE OF BIOSCIENCE AND INFORMATION TECHNOLOGIES, 2007, : 179 - 182