Significance analysis of lexical bias in microarray data

被引:23
|
作者
Kim, CC [1 ]
Falkow, S [1 ]
机构
[1] Stanford Univ, Med Ctr, Stanford, CA 94305 USA
关键词
D O I
10.1186/1471-2105-4-12
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Genes that are determined to be significantly differentially regulated in microarray analyses often appear to have functional commonalities, such as being components of the same biochemical pathway. This results in certain words being under- or overrepresented in the list of genes. Distinguishing between biologically meaningful trends and artifacts of annotation and analysis procedures is of the utmost importance, as only true biological trends are of interest for further experimentation. A number of sophisticated methods for identification of significant lexical trends are currently available, but these methods are generally too cumbersome for practical use by most microarray users. Results: We have developed a tool, LACK, for calculating the statistical significance of apparent lexical bias in microarray datasets. The frequency of a user-specified list of search terms in a list of genes which are differentially regulated is assessed for statistical significance by comparison to randomly generated datasets. The simplicity of the input files and user interface targets the average microarray user who wishes to have a statistical measure of apparent lexical trends in analyzed datasets without the need for bioinformatics skills. The software is available as Perl source or a Windows executable. Conclusion: We have used LACK in our laboratory to generate biological hypotheses based on our microarray data. We demonstrate the program's utility using an example in which we confirm significant upregulation of SP1-2 pathogenicity island of Salmonella enterica serovar Typhimurium by the cation chelator dipyridyl.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] From microarray data to results - Workshop on genomic approaches to microarray data analysis
    Schlitt, T
    Kemmeren, P
    EMBO REPORTS, 2004, 5 (05) : 459 - 463
  • [42] RNA amplification results in reproducible microarray data with slight ratio bias
    Puskás, LG
    Zvara, A
    Hackler, L
    Van Hummelen, P
    BIOTECHNIQUES, 2002, 32 (06) : 1330 - +
  • [43] Normalization of dye bias in microarray data using the mixture of splines model
    Joo, Yongsung
    Casella, George
    Booth, James
    Lee, Keunbaik
    Enkemann, Steven
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2007, 6
  • [44] Chipping away at the chip bias: RNA degradation in microarray analysis
    Herbert Auer
    Sandya Lyianarachchi
    David Newsom
    Marko I Klisovic
    uido Marcucci
    Karl Kornacker
    Nature Genetics, 2003, 35 : 292 - 293
  • [45] Chipping away at the chip bias: RNA degradation in microarray analysis
    Auer, H
    Lyianarachchi, S
    Newsom, D
    Klisovic, MI
    Marcucci, U
    Kornacker, K
    NATURE GENETICS, 2003, 35 (04) : 292 - 293
  • [46] Comparison of different microarray data analysis programs and description of a database for microarray data management
    Xu, LZ
    Maresh, GA
    Giardina, J
    Pincus, SH
    DNA AND CELL BIOLOGY, 2004, 23 (10) : 643 - 651
  • [47] Fuzzy clustering analysis of microarray data
    Han, Lixin
    Zeng, Xiaoqin
    Yan, Hong
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART H-JOURNAL OF ENGINEERING IN MEDICINE, 2008, 222 (H7) : 1143 - 1148
  • [48] Analysis of microarray gene expression data
    Pham, Tuan D.
    Wells, Christine
    Crane, Denis I.
    CURRENT BIOINFORMATICS, 2006, 1 (01) : 37 - 53
  • [49] Correspondence analysis applied to microarray data
    Fellenberg, K
    Hauser, NC
    Brors, B
    Neutzner, A
    Hoheisel, JD
    Vingron, M
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (19) : 10781 - 10786
  • [50] Methods of microarray data analysis IV
    Shoemaker, JS
    Lin, SM
    METHODS OF MICROARRAY DATA ANALYSIS IV, 2005, : 1 - 8