Motivation: Although several recently proposed analysis packages for microarray data can cope with heavy-tailed noise, many applications rely on Gaussian assumptions. Gaussian noise models foster computational efficiency. This comes, however, at the expense of increased sensitivity to outlying observations. Assessing potential insufficiencies of Gaussian noise in microarray data analysis is thus important and of general interest. Results: We propose to this end assessing different noise models on a large number of microarray experiments. The goodness of fit of noise models is quantified by a hierarchical Bayesian analysis of variance model, which predicts normalized expression values as a mixture of a Gaussian density and t-distributions with adjustable degrees of freedom. Inference of differentially expressed genes is taken into consideration at a second mixing level. For attaining far reaching validity, our investigations cover a wide range of analysis platforms and experimental settings. As the most striking result, we find irrespective of the chosen preprocessing and normalization method in all experiments that a heavy-tailed noise model is a better fit than a simple Gaussian. Further investigations revealed that an appropriate choice of noise model has a considerable influence on biological interpretations drawn at the level of inferred genes and gene ontology terms. We conclude from our investigation that neglecting the over dispersed noise in microarray data can mislead scientific discovery and suggest that the convenience of Gaussian-based modelling should be replaced by non-parametric approaches or other methods that account for heavy-tailed noise.
机构:
Univ Teknol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, MalaysiaUniv Teknol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, Malaysia
Kah, Wong Sou
Moorthy, Kohbalan
论文数: 0引用数: 0
h-index: 0
机构:
Univ Teknol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, MalaysiaUniv Teknol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, Malaysia
Moorthy, Kohbalan
Mohamad, Mohd Saberi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Teknol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, MalaysiaUniv Teknol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, Malaysia
Mohamad, Mohd Saberi
Kasim, Shahreen
论文数: 0引用数: 0
h-index: 0
机构:
Univ Tun Hussein Onn Malaysia, Fac Informat Technol & Multimedia, Dept Informat Syst, Batu Pahat 86400, MalaysiaUniv Teknol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, Malaysia
Kasim, Shahreen
Deris, Safaai
论文数: 0引用数: 0
h-index: 0
机构:
Univ Teknol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, MalaysiaUniv Teknol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, Malaysia
Deris, Safaai
论文数: 引用数:
h-index:
机构:
Omatu, Sigeru
Yoshioka, Michifumi
论文数: 0引用数: 0
h-index: 0
机构:
Osaka Prefecture Univ, Fac Engn, Naka Ku, Sakai, Osaka 5998531, JapanUniv Teknol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, Malaysia