EDISA: extracting biclusters from multiple time-series of gene expression profiles

被引:38
|
作者
Supper, Jochen
Strauch, Martin
Wanke, Dierk
Harter, Klaus
Zell, Andreas
机构
[1] Univ Tubingen, Ctr Bioinformat Tubingen, ZBIT, D-72076 Tubingen, Germany
[2] Univ Tubingen, Ctr Plant Mol Biol, ZMBP, D-72076 Tubingen, Germany
关键词
D O I
10.1186/1471-2105-8-334
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Cells dynamically adapt their gene expression patterns in response to various stimuli. This response is orchestrated into a number of gene expression modules consisting of co-regulated genes. A growing pool of publicly available microarray datasets allows the identification of modules by monitoring expression changes over time. These time-series datasets can be searched for gene expression modules by one of the many clustering methods published to date. For an integrative analysis, several time-series datasets can be joined into a three-dimensional gene-condition-time dataset, to which standard clustering or biclustering methods are, however, not applicable. We thus devise a probabilistic clustering algorithm for gene-condition-time datasets. Results: In this work, we present the EDISA ( Extended Dimension Iterative Signature Algorithm), a novel probabilistic clustering approach for 3D gene-condition-time datasets. Based on mathematical definitions of gene expression modules, the EDISA samples initial modules from the dataset which are then refined by removing genes and conditions until they comply with the module definition. A subsequent extension step ensures gene and condition maximality. We applied the algorithm to a synthetic dataset and were able to successfully recover the implanted modules over a range of background noise intensities. Analysis of microarray datasets has lead us to define three biologically relevant module types: 1) We found modules with independent response profiles to be the most prevalent ones. These modules comprise genes which are co-regulated under several conditions, yet with a different response pattern under each condition. 2) Coherent modules with similar responses under all conditions occurred frequently, too, and were often contained within these modules. 3) A third module type, which covers a response specific to a single condition was also detected, but rarely. All of these modules are essentially different types of biclusters. Conclusion: We successfully applied the EDISA to different 3D datasets. While previous studies were mostly aimed at detecting coherent modules only, our results show that coherent responses are often part of a more general module type with independent response profiles under different conditions. Our approach thus allows for a more comprehensive view of the gene expression response. After subsequent analysis of the resulting modules, the EDISA helped to shed light on the global organization of transcriptional control. An implementation of the algorithm is available at http://www-ra.informatik.uni-tuebingen.de/software/IAGEN/.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] EDISA: extracting biclusters from multiple time-series of gene expression profiles
    Jochen Supper
    Martin Strauch
    Dierk Wanke
    Klaus Harter
    Andreas Zell
    [J]. BMC Bioinformatics, 8
  • [2] Microarray Time-Series Data Clustering via Multiple Alignment of Gene Expression Profiles
    Subhani, Numanul
    Ngom, Alioune
    Rueda, Luis
    Burden, Conrad
    [J]. PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2009, 5780 : 377 - +
  • [3] Implications of time-series gene expression profiles of replicative senescence
    Kim, You-Mi
    Byun, Hae-Ok
    Jee, Byul A.
    Cho, Hyunwoo
    Seo, Yong-Hak
    Kim, You-Sun
    Park, Min Hi
    Chung, Hae-Young
    Woo, Hyun Goo
    Yoon, Gyesoon
    [J]. AGING CELL, 2013, 12 (04): : 622 - 634
  • [4] Dynamic Identification and Visualization of Gene Regulatory Networks from Time-Series Gene Expression Profiles
    Chen, Yu
    Han, Kyungsook
    [J]. EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, 5754 : 65 - 74
  • [5] Method for inferring and extracting reliable genetic interactions from time-series profile of gene expression
    Nakatsui, Masahiko
    Ueda, Takanori
    Maki, Yukihiro
    Ono, Isao
    Okamoto, Masahiro
    [J]. MATHEMATICAL BIOSCIENCES, 2008, 215 (01) : 105 - 114
  • [6] EXTRACTING FRACTAL COMPONENTS FROM TIME-SERIES
    YAMAMOTO, Y
    HUGHSON, RL
    [J]. PHYSICA D, 1993, 68 (02): : 250 - 264
  • [7] A Time-Series Analysis of Severe Burned Injury of Skin Gene Expression Profiles
    Xu, Hai-Ting
    Guo, Jian-Chun
    Liu, Hua-Zhen
    Jin, Wan-Wan
    [J]. CELLULAR PHYSIOLOGY AND BIOCHEMISTRY, 2018, 49 (04) : 1492 - 1498
  • [8] Time-series gene expression profiles in AGS cells stimulated with Helicobacter pylori
    You, Yuan-Hai
    Song, Yan-Yan
    Meng, Fan-Liang
    He, Li-Hua
    Zhang, Mao-Jun
    Yan, Xiao-Mei
    Zhang, Jian-Zhong
    [J]. WORLD JOURNAL OF GASTROENTEROLOGY, 2010, 16 (11) : 1385 - 1396
  • [9] Extracting parametric dynamics from time-series data
    Huimei Ma
    Xiaofan Lu
    Linan Zhang
    [J]. Nonlinear Dynamics, 2023, 111 : 15177 - 15199
  • [10] Extracting parametric dynamics from time-series data
    Ma, Huimei
    Lu, Xiaofan
    Zhang, Linan
    [J]. NONLINEAR DYNAMICS, 2023, 111 (16) : 15177 - 15199