A linear time biclustering algorithm for time series gene expression data

被引:0
|
作者
Madeira, SC [1 ]
Oliveira, AL
机构
[1] INESC, ID, Lisbon, Portugal
[2] Univ Tecn Lisboa, IST, Lisbon, Portugal
[3] Univ Beira Interior, Covilha, Portugal
来源
关键词
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Several non-supervised machine learning methods have been used in the analysis of gene expression data obtained from microarray experiments. Recently, biclustering, a non-supervised approach that performs simultaneous clustering on the row and column dimensions of the data matrix, has been shown to be remarkably effective in a variety of applications. The goal of biclustering is to find subgroups of genes and subgroups of conditions, where the genes exhibit highly correlated behaviors. In the most common settings, biclustering is an NP-complete problem, and heuristic approaches are used to obtain sub-optimal solutions using reasonable computational resources. In this work, we examine a particular setting of the problem, where we are concerned with finding biclusters in time series expression data. In this context, we are interested in finding biclusters with consecutive columns. For this particular version of the problem, we propose an algorithm that finds and reports all relevant biclusters in time linear on the size of the data matrix. This complexity is obtained by manipulating a discretized version of the matrix and by using string processing techniques based on suffix trees. We report results in both synthetic and real data that show the effectiveness of the approach.
引用
下载
收藏
页码:39 / 52
页数:14
相关论文
共 50 条
  • [21] Biclustering of gene expression data using genetic algorithm
    Chakraborty, A
    Maka, H
    PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2005, : 17 - 24
  • [22] Biclustering gene expression data by an improved optimal algorithm
    Wang, MingQian
    Tian, Wei
    Kang, Hao
    Gao, WenJu
    MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4, 2013, 321-324 : 2223 - 2226
  • [23] Biclustering of gene expression data using biclustering iterative signature algorithm and biclustering coherent column
    Kumar, E. Saravana
    Vengatesan, K.
    Singh, R. P.
    Rajan, C.
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2018, 26 (3-4) : 341 - 352
  • [24] Analyzing time series gene expression data
    Bar-Joseph, Z
    BIOINFORMATICS, 2004, 20 (16) : 2493 - 2503
  • [25] Biclustering of ARMA time series
    Jeonghwa LEE
    Chi-Hyuck JUN
    Journal of Zhejiang University-Science A(Applied Physics & Engineering), 2010, 11 (12) : 959 - 965
  • [26] Biclustering Multivariate Time Series
    Cachucho, Ricardo
    Nijssen, Siegfried
    Knobbe, Arno
    ADVANCES IN INTELLIGENT DATA ANALYSIS XVI, IDA 2017, 2017, 10584 : 27 - 39
  • [27] Biclustering of ARMA time series
    Jeonghwa Lee
    Chi-Hyuck Jun
    Journal of Zhejiang University-SCIENCE A, 2010, 11 : 959 - 965
  • [28] Biclustering of ARMA time series
    Lee, Jeonghwa
    Jun, Chi-Hyuck
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2010, 11 (12): : 959 - 965
  • [29] Biclustering of ARMA time series
    Jeonghwa LEE
    Chi-Hyuck JUN
    Journal of Zhejiang University-Science A(Applied Physics & Engineering), 2010, (12) : 959 - 965
  • [30] Biclustering analysis on tree-shaped time-series single cell gene expression data of Caenorhabditis elegans
    Guan, Qi
    Yan, Xianzhong
    Wu, Yida
    Zhou, Da
    Hu, Jie
    BMC BIOINFORMATICS, 2024, 25 (01)