Comparing transformation methods for DNA microarray data

被引:9
|
作者
Thygesen, HH [1 ]
Zwinderman, AH [1 ]
机构
[1] Univ Amsterdam, Acad Med Centrum, NL-1100 DD Amsterdam, Netherlands
关键词
Reference Signal; Transformation Method; Variance Ratio; Baseline Shift; Simple Permutation;
D O I
10.1186/1471-2105-5-77
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: When DNA microarray data are used for gene clustering, genotype/phenotype correlation studies, or tissue classification the signal intensities are usually transformed and normalized in several steps in order to improve comparability and signal/noise ratio. These steps may include subtraction of an estimated background signal, subtracting the reference signal, smoothing ( to account for nonlinear measurement effects), and more. Different authors use different approaches, and it is generally not clear to users which method they should prefer. Results: We used the ratio between biological variance and measurement variance (which is an F-like statistic) as a quality measure for transformation methods, and we demonstrate a method for maximizing that variance ratio on real data. We explore a number of transformations issues, including Box-Cox transformation, baseline shift, partial subtraction of the log-reference signal and smoothing. It appears that the optimal choice of parameters for the transformation methods depends on the data. Further, the behavior of the variance ratio, under the null hypothesis of zero biological variance, appears to depend on the choice of parameters. Conclusions: The use of replicates in microarray experiments is important. Adjustment for the null-hypothesis behavior of the variance ratio is critical to the selection of transformation method.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Support Feature Machine for DNA Microarray Data
    Maszczyk, Tomasz
    Duch, Wlodzislaw
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2010, 6086 : 178 - 186
  • [42] Classification of DNA microarray data with random forests
    Stokowy T.
    Advances in Intelligent and Soft Computing, 2010, 69 : 305 - 308
  • [43] Metric learning for DNA microarray data analysis
    Takeuchi, Ichiro
    Nakagawa, Masao
    Seto, Masao
    INTERNATIONAL WORKSHOP ON STATISTICAL-MECHANICAL INFORMATICS 2009 (IW-SMI 2009), 2009, 197
  • [44] Estimating the number of clusters in DNA microarray data
    Bolshakova, N
    Azuaje, F
    METHODS OF INFORMATION IN MEDICINE, 2006, 45 (02) : 153 - 157
  • [45] An Enumerative Biclustering Algorithm for DNA microarray Data
    Haifa, Haifa Ben Saber
    Elloumi, Mourad
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 132 - 138
  • [46] Computational approaches to analysis of DNA microarray data
    Quackenbush, J.
    METHODS OF INFORMATION IN MEDICINE, 2006, 45 : 91 - 103
  • [47] Folate system correlations in DNA microarray data
    Radivoyevitch, T
    BMC CANCER, 2005, 5 (1)
  • [48] Fuzzy Biclustering for DNA Microarray Data Analysis
    Han, Lixin
    Yan, Hong
    2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 1134 - +
  • [49] Control analysis of DNA microarray expression data
    Curtis, RK
    Brand, MD
    MOLECULAR BIOLOGY REPORTS, 2002, 29 (1-2) : 67 - 71
  • [50] Identification of significant features in DNA microarray data
    Bair, Eric
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (04): : 309 - 325