A linear time biclustering algorithm for time series gene expression data

被引:0
|
作者
Madeira, SC [1 ]
Oliveira, AL
机构
[1] INESC, ID, Lisbon, Portugal
[2] Univ Tecn Lisboa, IST, Lisbon, Portugal
[3] Univ Beira Interior, Covilha, Portugal
来源
关键词
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Several non-supervised machine learning methods have been used in the analysis of gene expression data obtained from microarray experiments. Recently, biclustering, a non-supervised approach that performs simultaneous clustering on the row and column dimensions of the data matrix, has been shown to be remarkably effective in a variety of applications. The goal of biclustering is to find subgroups of genes and subgroups of conditions, where the genes exhibit highly correlated behaviors. In the most common settings, biclustering is an NP-complete problem, and heuristic approaches are used to obtain sub-optimal solutions using reasonable computational resources. In this work, we examine a particular setting of the problem, where we are concerned with finding biclusters in time series expression data. In this context, we are interested in finding biclusters with consecutive columns. For this particular version of the problem, we propose an algorithm that finds and reports all relevant biclusters in time linear on the size of the data matrix. This complexity is obtained by manipulating a discretized version of the matrix and by using string processing techniques based on suffix trees. We report results in both synthetic and real data that show the effectiveness of the approach.
引用
收藏
页码:39 / 52
页数:14
相关论文
共 50 条
  • [1] Identification of Regulatory Modules in Time Series Gene Expression Data Using a Linear Time Biclustering Algorithm
    Madeira, Sara C.
    Teixeira, Miguel C.
    Sa-Correia, Isabel
    Oliveira, Arlindo L.
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (01) : 153 - 165
  • [2] A New Biclustering Algorithm for Time-Series Gene Expression Data Analysis
    Xue, Yun
    Liao, Zhengling
    Li, Meihang
    Luo, Jie
    Hu, Xiaohui
    Luo, Guiyin
    Chen, Wen-Sheng
    [J]. 2014 TENTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2014, : 268 - 272
  • [3] A polynomial time biclustering algorithm for finding approximate expression patterns in gene expression time series
    Madeira, Sara C.
    Oliveira, Arlindo L.
    [J]. ALGORITHMS FOR MOLECULAR BIOLOGY, 2009, 4
  • [4] A polynomial time biclustering algorithm for finding approximate expression patterns in gene expression time series
    Sara C Madeira
    Arlindo L Oliveira
    [J]. Algorithms for Molecular Biology, 4
  • [5] A contiguous column coherent evolution biclustering algorithm for time-series gene expression data
    Yun Xue
    Meizhen Zhang
    Zhengling Liao
    Meihang Li
    Jie Luo
    Xiaohui Hu
    [J]. International Journal of Machine Learning and Cybernetics, 2018, 9 : 441 - 453
  • [6] A contiguous column coherent evolution biclustering algorithm for time-series gene expression data
    Xue, Yun
    Zhang, Meizhen
    Liao, Zhengling
    Li, Meihang
    Luo, Jie
    Hu, Xiaohui
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (03) : 441 - 453
  • [7] Efficient Biclustering Algorithms for Time Series Gene Expression Data Analysis
    Madeira, Sara C.
    Oliveira, Arlindo L.
    [J]. DISTRIBUTED COMPUTING, ARTIFICIAL INTELLIGENCE, BIOINFORMATICS, SOFT COMPUTING, AND AMBIENT ASSISTED LIVING, PT II, PROCEEDINGS, 2009, 5518 : 1013 - 1019
  • [8] BiGGEsTS: Integrated environment for biclustering analysis of time series gene expression data
    Gonçalves J.P.
    Madeira S.C.
    Oliveira A.L.
    [J]. BMC Research Notes, 2 (1)
  • [9] Identification of K-Tolerance Regulatory Modules in Time Series Gene Expression Data Using a Biclustering Algorithm
    Phukhachee, Tustanah
    Maneewongvatana, Songrit
    [J]. ACTIVE MEDIA TECHNOLOGY, AMT 2013, 2013, 8210 : 146 - 155
  • [10] A Biclustering Algorithm with Coherent Evolution on the Contiguous Columns Facing Time-series Gene Data
    Xue, Yun
    Li, Meihang
    Liao, Zhengling
    Luo, Jie
    Li, Tiechen
    Xiao, Hua
    Hu, Xiaohui
    [J]. 2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 328 - 333