Microarray data analysis: a practical approach for selecting differentially expressed genes

被引:0
|
作者
Mutch, David M. [1 ]
Berger, Alvin [1 ]
Mansourian, Robert [1 ]
Rytz, Andreas [1 ]
Roberts, Matthew-Alan [1 ]
机构
[1] Nestle Res Ctr, CH-1000 Lausanne 26, Switzerland
来源
GENOME BIOLOGY | 2001年 / 2卷 / 12期
关键词
D O I
暂无
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background The biomedical community is rapidly developing new methods of data analysis for microarray experiments, with the goal of establishing new standards to objectively process the massive datasets produced from functional genomic experiments. Each microarray experiment measures thousands of genes simultaneously producing an unprecedented amount of biological information across increasingly numerous experiments; however, in general, only a very small percentage of the genes present on any given array are identified as differentially regulated. The challenge then is to process this information objectively and efficiently in order to obtain knowledge of the biological system under study and by which to compare information gained across multiple experiments. In this context, systematic and objective mathematical approaches, which are simple to apply across a large number of experimental designs, become fundamental to correctly handle the mass of data and to understand the true complexity of the biological systems under study. Results The present report develops a method of extracting differentially expressed genes across any number of experimental samples by first evaluating the maximum fold change (FC) across all experimental parameters and across the entire range of absolute expression levels. The model developed works by first evaluating the FC across the entire range of absolute expression levels in any number of experimental conditions. The selection of those genes within the top X% of highest FCs observed within absolute expression bins was evaluated both with and without the use of replicates. Lastly, the FC model was validated by both real time polymerase chain reaction (RT-PCR) and variance data. Semi-quantitative RT-PCR analysis demonstrated 73% concordance with the microarray data from Mu11K Affymetrix GeneChips. Furthermore, 94.1% of those genes selected by the 5% FC model were found to lie above measurement variability using a SDwithin confidence level of 99.9%. Conclusion As evidenced by the high rate of validation, the FC model has the potential to minimize the number of required replicates in expensive microarray experiments by extracting information on gene expression patterns (e.g. characterizing biological and/or measurement variance) within an experiment. The simplicity of the overall process allows the analyst to easily select model limits which best describe the data. The genes selected by this process can be compared between experiments and are shown to objectively extract information which is biologically & statistically significant.
引用
收藏
页数:29
相关论文
共 50 条
  • [21] Mixture distribution approach for identifying differentially expressed genes in microarray data of Arabidopsis thaliana
    Anjum, Arfa
    Jaggi, Seema
    Varghese, Eldho
    Lall, Shwetank
    Rai, Anil
    Bhowmik, Arpan
    Mishra, Dwijesh Chandra
    Sarika
    [J]. INDIAN JOURNAL OF AGRICULTURAL SCIENCES, 2020, 90 (10): : 139 - 143
  • [22] SPRING: A METHOD FOR IDENTIFYING DIFFERENTIALLY EXPRESSED GENES IN MICROARRAY DATA
    Tian, Yuan
    Liu, Guixia
    Wu, Chunguo
    Rong, Guang
    Sun, An
    [J]. BIOTECHNOLOGY & BIOTECHNOLOGICAL EQUIPMENT, 2013, 27 (05) : 4150 - 4156
  • [23] Adaptive thresholds to detect differentially expressed genes in microarray data
    Fukuoka, Yutaka
    Inaoka, Hidenori
    Noshiro, Makoto
    [J]. BIOINFORMATION, 2011, 7 (01) : 33 - 37
  • [24] Identifying Differentially Expressed Genes in Time Course Microarray Data
    Ma, Ping
    Zhong, Wenxuan
    Liu, Jun S.
    [J]. STATISTICS IN BIOSCIENCES, 2009, 1 (02) : 144 - 159
  • [25] MicroArray data simulator for improved selection of differentially expressed genes
    Singhal, S
    Kyvernitis, CG
    Johnson, SW
    Kaiser, LR
    Liebman, MN
    Albelda, SM
    [J]. CANCER BIOLOGY & THERAPY, 2003, 2 (04) : 383 - 391
  • [26] Identifying Differentially Expressed Genes in Time Course Microarray Data
    Ping Ma
    Wenxuan Zhong
    Jun S. Liu
    [J]. Statistics in Biosciences, 2009, 1 (2) : 144 - 159
  • [27] A Comparison of Algorithms to Find Differentially Expressed Genes in Microarray Data
    Ultsch, Alfred
    Pallasch, Christian
    Bergmann, Eckhard
    Christiansen, Holger
    [J]. ADVANCES IN DATA ANALYSIS, DATA HANDLING AND BUSINESS INTELLIGENCE, 2010, : 685 - +
  • [28] Nonparametric methods for identifying differentially expressed genes in microarray data
    Troyanskaya, OG
    Garber, ME
    Brown, PO
    Botstein, D
    Altman, RB
    [J]. BIOINFORMATICS, 2002, 18 (11) : 1454 - 1461
  • [29] Microarray data analysis to identify differentially expressed genes and biological pathways associated with asthma
    Qi, Shanshan
    Liu, Guanghui
    Dong, Xiang
    Huang, Nan
    Li, Wenjing
    Chen, Hao
    [J]. EXPERIMENTAL AND THERAPEUTIC MEDICINE, 2018, 16 (03) : 1613 - 1620
  • [30] Testing for differentially-expressed genes by maximum-likelihood analysis of microarray data
    Ideker, T
    Thorsson, V
    Siegel, AF
    Hood, LE
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (06) : 805 - 817