Significance analysis of lexical bias in microarray data

被引:23
|
作者
Kim, CC [1 ]
Falkow, S [1 ]
机构
[1] Stanford Univ, Med Ctr, Stanford, CA 94305 USA
关键词
D O I
10.1186/1471-2105-4-12
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Genes that are determined to be significantly differentially regulated in microarray analyses often appear to have functional commonalities, such as being components of the same biochemical pathway. This results in certain words being under- or overrepresented in the list of genes. Distinguishing between biologically meaningful trends and artifacts of annotation and analysis procedures is of the utmost importance, as only true biological trends are of interest for further experimentation. A number of sophisticated methods for identification of significant lexical trends are currently available, but these methods are generally too cumbersome for practical use by most microarray users. Results: We have developed a tool, LACK, for calculating the statistical significance of apparent lexical bias in microarray datasets. The frequency of a user-specified list of search terms in a list of genes which are differentially regulated is assessed for statistical significance by comparison to randomly generated datasets. The simplicity of the input files and user interface targets the average microarray user who wishes to have a statistical measure of apparent lexical trends in analyzed datasets without the need for bioinformatics skills. The software is available as Perl source or a Windows executable. Conclusion: We have used LACK in our laboratory to generate biological hypotheses based on our microarray data. We demonstrate the program's utility using an example in which we confirm significant upregulation of SP1-2 pathogenicity island of Salmonella enterica serovar Typhimurium by the cation chelator dipyridyl.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Microarray data analysis for beginners
    Terry Speed
    Nature Cell Biology, 2003, 5 (12) : 1039 - 1039
  • [32] Computational analysis of microarray data
    John Quackenbush
    Nature Reviews Genetics, 2001, 2 : 418 - 427
  • [33] Outliers in microarray data analysis
    Pearson, RK
    Gonye, GE
    Schwaber, JS
    METHODS OF MICROARRAY DATA ANALYSIS III, 2003, : 41 - 55
  • [34] Microarray data management and analysis
    Prakash, PJ
    Shams, S
    GENETIC ENGINEERING NEWS, 2005, 25 (11): : 38 - 39
  • [35] Optimal significance analysis of microarray data in a class of tests whose null statistic can be constructed
    Fujisawa, Hironori
    Sakaguchi, Takayuki
    TEST, 2012, 21 (02) : 280 - 300
  • [36] Significance analysis and improved discovery of differentially co-expressed gene sets in microarray data
    Li, Haixia
    Karuturi, R. Krishna Murthy
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 196 - +
  • [37] Optimal significance analysis of microarray data in a class of tests whose null statistic can be constructed
    Hironori Fujisawa
    Takayuki Sakaguchi
    TEST, 2012, 21 : 280 - 300
  • [38] Lexical data augmentation for sentiment analysis
    Xiang, Rong
    Chersoni, Emmanuele
    Lu, Qin
    Huang, Chu-Ren
    Li, Wenjie
    Long, Yunfei
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2021, 72 (11) : 1432 - 1447
  • [39] Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data
    James J Chen
    Huey-Miin Hsueh
    Robert R Delongchamp
    Chien-Ju Lin
    Chen-An Tsai
    BMC Bioinformatics, 8
  • [40] Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data
    Chen, James J.
    Hsueh, Huey-Miin
    Delongchamp, Robert R.
    Lin, Chien-Ju
    Tsai, Chen-An
    BMC BIOINFORMATICS, 2007, 8 (1) : 1 - 14