Statistical analysis of high-density oligonucleotide arrays:: a multiplicative noise model

被引:66
|
作者
Sásik, R
Calvo, E
Corbeil, J
机构
[1] Univ Calif San Diego, Sch Med, La Jolla, CA 92093 USA
[2] CHU Laval, Res Ctr, Quebec City, PQ G1V 4G2, Canada
[3] Quebec Genome Ctr, Quebec City, PQ G1V 4G2, Canada
关键词
D O I
10.1093/bioinformatics/18.12.1633
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: High-density oligonucleotide arrays (GeneChip, Affymetrix, Santa Clara, CA) have become a standard research tool in many areas of biomedical research. They quantitatively monitor the expression of thousands of genes simultaneously by measuring fluorescence from gene-specific targets or probes. The relationship between signal intensities and transcript abundance as well as normalization issues have been the focus of much recent attention (Hill et al., 2001; Chudin et al., 2002; Naef et al., 2002a). It is desirable that a researcher has the best possible analytical tools to make the most of the information that this powerful technology has to offer. At present there are three analytical methods available: the newly released Affymetrix Microarray Suite 5.0 (AMS) software that accompanies the GeneChip product, the method of Li and Wong (LW; Li and Wong, 2001), and the method of Naef et al. (FN; Naef et al., 2001). The AMS method is tailored for analysis of a single microarray, and can therefore be used with any experimental design. The LW method on the other hand depends on a large number of microarrays in an experiment and cannot be used for an isolated microarray, and the FN method is particular to paired microarrays, such as resulting from an experiment in which each 'treatment' sample has a corresponding 'control' sample. Our focus is on analysis of experiments in which there is a series of samples. In this case only the AMS, LW, and the method described in this paper can be used. The present method is model-based, like the LW method, but assumes multiplicative not additive noise, and employs elimination of statistically significant outliers for improved results. Unlike LW and AMS, we do not assume probe-specific background (measured by the so-called mismatch probes). Rather, we assume uniform background, whose level is estimated using both the mismatch and perfect match probe intensities. Results: We present a new method for GeneChip analysis, based on a statistical model with multiplicative noise. We demonstrated that this method yields results superior to those obtained by the Affymetrix Microarray Suite 5.0 software and to those obtained by the model-based method of Li and Wong (Li and Wong, 2001). The present method eliminates the hard-to-interpret negative expression indices, and the binary 'presence' calls (present or absent) are replaced by the statistical significance (p-value) of gene expression. We have found that thresholding the p-values at the (0.1)(16)-level produces about the same number of 'present' calls as the AMS software. By testing our method on a pair of replicate GeneChips (hybridized with the same cRNA), we found that 95.6% of data points lie within the 1.25-fold interval. In other words, our method had a 4.4% type I error rate at the 1.25-fold level. The error rate of the LW method was 15%, and that of the AMS method was 29%. There were no points outside the 2-fold interval with the present method. Analysis of variance (ANOVA) of another experiment with multiple replicates shows that this reduction of variance is not accompanied by a corresponding reduction of signal. On the contrary, the signal-to-noise ratio (as measured by the distribution of F-statistics) of the present method is on average 3.4-times better than that of AMS, and 1.4-times better than that of Li and Wong.
引用
收藏
页码:1633 / 1640
页数:8
相关论文
共 50 条
  • [1] High-density oligonucleotide arrays
    Blanchard, AP
    Kaiser, RJ
    Hood, LE
    [J]. BIOSENSORS & BIOELECTRONICS, 1996, 11 (6-7): : 687 - 690
  • [2] Characterization of the expression ratio noise structure in high-density oligonucleotide arrays
    Felix Naef
    Coleen R Hacker
    Nila Patil
    Marcelo Magnasco
    [J]. Genome Biology, 3 (1):
  • [3] High-density oligonucleotide probe arrays
    McGall, GH
    Fidanza, JA
    [J]. ADVANCES IN NUCLEIC ACID AND PROTEIN ANALYSES, MANIPULATION, AND SEQUENCING, 2000, 1 : 106 - 110
  • [4] Empirical characterization of the expression ratio noise structure in high-density oligonucleotide arrays
    Naef, Felix
    Hacker, Coleen R.
    Patil, Nila
    Magnasco, Marcelo
    [J]. GENOME BIOLOGY, 2002, 3 (04):
  • [5] Empirical characterization of the expression ratio noise structure in high-density oligonucleotide arrays
    Felix Naef
    Coleen R Hacker
    Nila Patil
    Marcelo Magnasco
    [J]. Genome Biology, 3 (4):
  • [6] Probe selection for high-density oligonucleotide arrays
    Mei, R
    Hubbell, E
    Bekiranov, S
    Mittmann, M
    Christians, FC
    Shen, MM
    Lu, G
    Fang, J
    Liu, WM
    Ryder, T
    Kaplan, P
    Kulp, D
    Webster, TA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (20) : 11237 - 11242
  • [7] Functional genomics: High-density oligonucleotide arrays
    Roy, S
    Khanna, S
    Bentley, K
    Beffrey, P
    Sen, CK
    [J]. REDOX CELL BIOLOGY AND GENETICS, PT B, 2002, 353 : 487 - 497
  • [8] Parallelized preprocessing algorithms for high-density oligonucleotide arrays
    Schmidberger, Markus
    Mansmann, Ulrich
    [J]. 2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 497 - 503
  • [9] Expression monitoring by hybridization to high-density oligonucleotide arrays
    Lockhart, DJ
    Dong, HL
    Byrne, MC
    Follettie, MT
    Gallo, MV
    Chee, MS
    Mittmann, M
    Wang, CW
    Kobayashi, M
    Horton, H
    Brown, EL
    [J]. NATURE BIOTECHNOLOGY, 1996, 14 (13) : 1675 - 1680
  • [10] Photolithographic synthesis of high-density oligonucleotide probe arrays
    Barone, AD
    Beecher, JE
    Bury, PA
    Chen, C
    Doede, T
    Fidanza, JA
    McGall, GH
    [J]. NUCLEOSIDES NUCLEOTIDES & NUCLEIC ACIDS, 2001, 20 (4-7): : 525 - 531