Biological assessment of robust noise models in microarray data analysis

被引:23
|
作者
Posekany, A. [1 ]
Felsenstein, K. [2 ]
Sykacek, P. [1 ]
机构
[1] Univ Nat Resources & Life Sci, Dept Biotechnol, Chair Bioinformat, A-1180 Vienna, Austria
[2] Vienna Univ Technol, Dept Stat, A-1040 Vienna, Austria
关键词
DIFFERENTIALLY EXPRESSED GENES; NONPARAMETRIC METHODS; NORMALIZATION; TRANSCRIPTOME; MECHANISMS; ONTOLOGY; OBESITY; MICE; TOOL;
D O I
10.1093/bioinformatics/btr018
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Although several recently proposed analysis packages for microarray data can cope with heavy-tailed noise, many applications rely on Gaussian assumptions. Gaussian noise models foster computational efficiency. This comes, however, at the expense of increased sensitivity to outlying observations. Assessing potential insufficiencies of Gaussian noise in microarray data analysis is thus important and of general interest. Results: We propose to this end assessing different noise models on a large number of microarray experiments. The goodness of fit of noise models is quantified by a hierarchical Bayesian analysis of variance model, which predicts normalized expression values as a mixture of a Gaussian density and t-distributions with adjustable degrees of freedom. Inference of differentially expressed genes is taken into consideration at a second mixing level. For attaining far reaching validity, our investigations cover a wide range of analysis platforms and experimental settings. As the most striking result, we find irrespective of the chosen preprocessing and normalization method in all experiments that a heavy-tailed noise model is a better fit than a simple Gaussian. Further investigations revealed that an appropriate choice of noise model has a considerable influence on biological interpretations drawn at the level of inferred genes and gene ontology terms. We conclude from our investigation that neglecting the over dispersed noise in microarray data can mislead scientific discovery and suggest that the convenience of Gaussian-based modelling should be replaced by non-parametric approaches or other methods that account for heavy-tailed noise.
引用
收藏
页码:807 / 814
页数:8
相关论文
共 50 条
  • [1] Biological networks to the analysis of microarray data
    Fang Zhuo
    Luo Qingming
    Zhang Guoqing
    Li Yixue
    PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2006, 16 (12) : 1242 - 1251
  • [2] Biological networks to the analysis of microarray data
    FANG Zhuo~1
    2. Shanghai Center for Bioinformation and Technology
    Progress in Natural Science, 2006, (12) : 1242 - 1251
  • [3] Biological networks to the analysis of microarray data
    Hubei Bioinformatics and Molecular Imaging Key Laboratory, Huazhong University of Science and Technology, Wuhan 430074, China
    不详
    Prog. Nat. Sci., 2006, 12 (1242-1251):
  • [4] rapmad: Robust analysis of peptide microarray data
    Renard, Bernhard Y.
    Loewer, Martin
    Kuehne, Yvonne
    Reimer, Ulf
    Rothermel, Andree
    Tuereci, Oezlem
    Castle, John C.
    Sahin, Ugur
    BMC BIOINFORMATICS, 2011, 12
  • [5] rapmad: Robust analysis of peptide microarray data
    Bernhard Y Renard
    Martin Löwer
    Yvonne Kühne
    Ulf Reimer
    Andrée Rothermel
    Özlem Türeci
    John C Castle
    Ugur Sahin
    BMC Bioinformatics, 12
  • [6] Assessment of survival prediction models based on microarray data
    Schumacher, Martin
    Binder, Harald
    Gerds, Thomas
    BIOINFORMATICS, 2007, 23 (14) : 1768 - 1774
  • [7] Robust singular value decomposition analysis of microarray data
    Liu, L
    Hawkins, DM
    Ghosh, S
    Young, SS
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (23) : 13167 - 13172
  • [8] Robust processing of microarray data by independent component analysis
    Díaz, F
    Malutan, R
    Gómez, P
    Rodellar, V
    Puntonet, CG
    COMPUTATIONAL INTELLIGENCE AND BIOINSPIRED SYSTEMS, PROCEEDINGS, 2005, 3512 : 1051 - 1058
  • [9] MGraph: graphical models for microarray data analysis
    Wang, JB
    Myklebost, O
    Hovig, E
    BIOINFORMATICS, 2003, 19 (17) : 2210 - 2211
  • [10] Data Augmentation for Training of Noise Robust Acoustic Models
    Prisyach, Tatiana
    Mendelev, Valentin
    Ubskiy, Dmitry
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2016, 2017, 661 : 17 - 25