Systematic Evaluation of Scaling Methods for Gene Expression Data

被引:2
|
作者
Pandey, Gaurav [1 ]
Ramakrishnan, Lakshmi Naarayanan [1 ]
Steinbach, Michael [1 ]
Kumar, Vipin [1 ]
机构
[1] Univ Minnesota, Dept Comp Sci & Engn, St Paul, MN USA
关键词
D O I
10.1109/BIBM.2008.33
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Even after an experimentally prepared gene expression data set has been pre-processed to account for variations in the microarray technology, there may be inconsistencies between the scales of measurements in different conditions. This may happen for reasons such as the accumulation of gene expression data prepared by different laboratories into a single data set. A variety of scaling and transformation methods have been used for addressing these scale inconsistencies in different studies on the analysis of gene expression data sets. However; a quantitative estimation of their relative performance has been lacking. In this paper, we report an extensive evaluation of scaling and transformation methods for their effectiveness with respect to the important problem of protein function prediction. We consider several such commonly, used methods for gene expression data, such as z-score scaling, quantile normalization, diff transformation, and two new scaling methods, sigmoid and double sigmoid, that have not been used previously in this domain to the best of our knowledge. We show that the performance of these methods can vary significantly across data sets, but Dsigmoid scaling and z-score transformation generally perform well for the two types of gene expression data, namely temporal and non-temporal, respectively.
引用
收藏
页码:376 / 381
页数:6
相关论文
共 50 条
  • [11] Feature selection methods on gene expression microarray data for cancer classification: A systematic review
    Alhenawi, Esra'a
    Al-Sayyed, Rizik
    Hudaib, Amjad
    Mirjalili, Seyedali
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 140
  • [12] Comparison and evaluation of pathway-level aggregation methods of gene expression data
    Hwang, Seungwoo
    [J]. BMC GENOMICS, 2012, 13
  • [13] Comparison and evaluation of pathway-level aggregation methods of gene expression data
    Seungwoo Hwang
    [J]. BMC Genomics, 13
  • [14] Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data
    Kim, Mi Hyeon
    Seo, Hwa Jeong
    Joung, Je-Gun
    Kim, Ju Han
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [15] Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data
    Franck Rapaport
    Raya Khanin
    Yupu Liang
    Mono Pirun
    Azra Krek
    Paul Zumbo
    Christopher E Mason
    Nicholas D Socci
    Doron Betel
    [J]. Genome Biology, 14
  • [16] Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data
    Mi Hyeon Kim
    Hwa Jeong Seo
    Je-Gun Joung
    Ju Han Kim
    [J]. BMC Bioinformatics, 12
  • [17] Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data
    Rapaport, Franck
    Khanin, Raya
    Liang, Yupu
    Pirun, Mono
    Krek, Azra
    Zumbo, Paul
    Mason, Christopher E.
    Socci, Nicholas D.
    Betel, Doron
    [J]. GENOME BIOLOGY, 2013, 14 (09):
  • [18] Methods and approaches in the analysis of gene expression data
    Dopazo, J
    Zanders, E
    Dragoni, I
    Amphlett, C
    Falciani, F
    [J]. JOURNAL OF IMMUNOLOGICAL METHODS, 2001, 250 (1-2) : 93 - 112
  • [19] Clustering methods for microarray gene expression data
    Belacel, Nabil
    Wang, Qian
    Cuperlovic-Culf, Miroslava
    [J]. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2006, 10 (04) : 507 - 531
  • [20] Gene expression data modeling and validation of gene selection methods
    Ruffino, Francesca
    [J]. BIOLOGICAL AND ARTIFICIAL INTELLIGENCE ENVIRONMENTS, 2005, : 73 - 79