Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data

被引:94
|
作者
Chen, James J. [1 ]
Hsueh, Huey-Miin [2 ]
Delongchamp, Robert R. [3 ]
Lin, Chien-Ju [1 ]
Tsai, Chen-An [4 ,5 ]
机构
[1] US FDA, Natl Ctr Toxicol Res, Div Personalized Nutr & Med, Jefferson, AR 72079 USA
[2] Natl Chengchi Univ, Dept Stat, Taipei 11623, Taiwan
[3] Univ Arkansas Med Sci, Dept Epidemiol, Little Rock, AR 72205 USA
[4] China Med Univ, Dept Publ Hlth, Taichung, Taiwan
[5] China Med Univ, Ctr Biostat, Taichung, Taiwan
关键词
D O I
10.1186/1471-2105-8-412
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Many researchers are concerned with the comparability and reliability of microarray gene expression data. Recent completion of the MicroArray Quality Control (MAQC) project provides a unique opportunity to assess reproducibility across multiple sites and the comparability across multiple platforms. The MAQC analysis presented for the conclusion of inter- and intra-platform comparability/reproducibility of microarray gene expression measurements is inadequate. We evaluate the reproducibility/comparability of the MAQC data for 12901 common genes in four titration samples generated from five high-density one-color microarray platforms and the TaqMan technology. We discuss some of the problems with the use of correlation coefficient as metric to evaluate the inter- and intra-platform reproducibility and the percent of overlapping genes (POG) as a measure for evaluation of a gene selection procedure by MAQC. Results: A total of 293 arrays were used in the intra- and inter-platform analysis. A hierarchical cluster analysis shows distinct differences in the measured intensities among the five platforms. A number of genes show a small fold-change in one platform and a large fold-change in another platform, even though the correlations between platforms are high. An analysis of variance shows thirty percent of gene expressions of the samples show inconsistent patterns across the five platforms. We illustrated that POG does not reflect the accuracy of a selected gene list. A non-overlapping gene can be truly differentially expressed with a stringent cut, and an overlapping gene can be non-differentially expressed with non-stringent cutoff. In addition, POG is an unusable selection criterion. POG can increase or decrease irregularly as cutoff changes; there is no criterion to determine a cutoff so that POG is optimized. Conclusion: Using various statistical methods we demonstrate that there are differences in the intensities measured by different platforms and different sites within platform. Within each platform, the patterns of expression are generally consistent, but there is site-by-site variability. Evaluation of data analysis methods for use in regulatory decision should take no treatment effect into consideration, when there is no treatment effect, "a fold-change cutoff with a non-stringent p-value cutoff" could result in 100% false positive error selection.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [1] Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data
    James J Chen
    Huey-Miin Hsueh
    Robert R Delongchamp
    Chien-Ju Lin
    Chen-An Tsai
    [J]. BMC Bioinformatics, 8
  • [2] MAQC: The microarray quality control project
    Shi, Leming
    [J]. MOLECULAR & CELLULAR TOXICOLOGY, 2007, 3 (04) : 32 - 32
  • [3] The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements
    Shi, Leming
    Reid, Laura H.
    Jones, Wendell D.
    Shippy, Richard
    Warrington, Janet A.
    Baker, Shawn C.
    Collins, Patrick J.
    de Longueville, Francoise
    Kawasaki, Ernest S.
    Lee, Kathleen Y.
    Luo, Yuling
    Sun, Yongming Andrew
    Willey, James C.
    Setterquist, Robert A.
    Fischer, Gavin M.
    Tong, Weida
    Dragan, Yvonne P.
    Dix, David J.
    Frueh, Felix W.
    Goodsaid, Federico M.
    Herman, Damir
    Jensen, Roderick V.
    Johnson, Charles D.
    Lobenhofer, Edward K.
    Puri, Raj K.
    Scherf, Uwe
    Thierry-Mieg, Jean
    Wang, Charles
    Wilson, Mike
    Wolber, Paul K.
    Zhang, Lu
    Amur, Shashi
    Bao, Wenjun
    Barbacioru, Catalin C.
    Lucas, Anne Bergstrom
    Bertholet, Vincent
    Boysen, Cecilie
    Bromley, Bud
    Brown, Donna
    Brunner, Alan
    Canales, Roger
    Cao, Xiaoxi Megan
    Cebula, Thomas A.
    Chen, James J.
    Cheng, Jing
    Chu, Tzu-Ming
    Chudin, Eugene
    Corson, John
    Corton, J. Christopher
    Croner, Lisa J.
    [J]. NATURE BIOTECHNOLOGY, 2006, 24 (09) : 1151 - 1161
  • [5] Quick and simple: quality control of microarray data
    Sauer, U
    Preininger, C
    Hany-Schmatzberger, R
    [J]. BIOINFORMATICS, 2005, 21 (08) : 1572 - 1578
  • [6] The analysis of microarray data
    Hariharan, R
    [J]. PHARMACOGENOMICS, 2003, 4 (04) : 477 - 497
  • [7] Control of selection bias in microarray data analysis
    Furlanello, C
    Serafini, M
    Merler, S
    Jurman, G
    [J]. MINERVA BIOTECNOLOGICA, 2003, 15 (04) : 217 - 222
  • [8] Control analysis of DNA microarray expression data
    Curtis, RK
    Brand, MD
    [J]. MOLECULAR BIOLOGY REPORTS, 2002, 29 (1-2) : 67 - 71
  • [9] Control Analysis of DNA Microarray Expression Data
    R. Keira Curtis
    Martin D. Brand
    [J]. Molecular Biology Reports, 2002, 29 : 67 - 71
  • [10] The MicroArray Quality Control (MAQC) Project: Enabling Personalized Medicine with Genomics and Bioinformatics
    Shi, Leming
    [J]. MOLECULAR & CELLULAR TOXICOLOGY, 2009, 5 (03) : 41 - 41