An upper bound on the hardness of exact matrix based motif discovery

被引：2

作者：

Horton, Paul ^{[1
]}

Fujibuchi, Wataru ^{[1
]}

机构：

[1] AIST, Computat Biol Res Ctr, Tokyo, Japan

来源：

JOURNAL OF DISCRETE ALGORITHMS | 2007年 / 5卷 / 04期

关键词：

Motif discovery; Computational complexity; Combinatorics; Transcription factor binding site prediction; String algorithm;

D O I：

10.1016/j.jda.2006.10.006

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

Motif discovery is the problem of finding local patterns or motifs from a set of unlabeled sequences. One common representation of a motif is a Markov model known as a score matrix. Matrix based motif discovery has been extensively studied but no positive results have been known regarding its theoretical hardness. We present the first non-trivial upper bound on the complexity (worst-case computation time) of this problem. Other than linear terms, our bound depends only on the motif width w (which is typically 5-20) and is a dramatic improvement relative to previously known bounds. We prove this bound by relating the motif discovery problem to a search problem over permutations of strings of length w, in which the permutations have a particular property. We give a constructive proof of an upper bound on the number of such permutations. For an alphabet size of sigma (typically 4) the trivial bound is n! approximate to (n/e)(n), n = sigma(w). Our bound is roughly n(sigma log(sigma) n)(n). We relate this theoretical result to the exact motif discovery program, TsukubaBB, whose algorithm contains ideas which inspired the result. We describe a recent improvement to the TsukubaBB program which can give a speed up of nine or more and use a dataset of REB1 transcription factor binding sites to illustrate that exact methods can indeed be used in some practical situations. (C) 2006 Published by Elsevier B.V.

引用

页码：706 / 713

页数：8

共 50 条

[31] The upper bound for the index of nilpotency for a matrix commuting with a given nilpotent matrix
Oblak, Polona
LINEAR & MULTILINEAR ALGEBRA, 2008, 56 (06): : 701 - 711
[32] An Amortized O(1) Lower Bound for Dynamic Time Warping in Motif Discovery
Chao, Zemin
Gao, Hong
Miao, Dongjing
Li, Jianzhong
Wang, Hongzhi
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (05) : 2239 - 2252
[33] A linear matrix inequality-based approach for estimating upper bound of settling time
Shahbazzadeh, Majid
Sadati, Seyed Jalil
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2023, 237 (03) : 490 - 500
[34] Graph-based Approaches for Motif Discovery
Zaslavsky, Elena
CLUSTER CHALLENGES IN BIOLOGICAL NETWORKS, 2009, : 83 - 99
[35] A new upper bound for eigenvalues of the Laplacian matrix of a graph
Li, JS
Zhang, XD
LINEAR ALGEBRA AND ITS APPLICATIONS, 1997, 265 : 93 - 100
[36] An upper bound for the condition number of a matrix in spectral norm
Piazza, G
Politi, T
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2002, 143 (01) : 141 - 144
[37] An upper bound for the spectral condition number of a diagonalizable matrix
Jiang, Erxiong
Lam, Peter C. B.
Linear Algebra and Its Applications, 1997, 262 (1-3): : 165 - 178
[38] UPPER BOUND FOR THE NUMBER OF DISTINCT EIGENVALUES OF A PERTURBED MATRIX
Moon, Sunyo
Park, Seungkook
ELECTRONIC JOURNAL OF LINEAR ALGEBRA, 2018, 34 : 115 - 124
[39] A new upper bound for eigenvalues of the Laplacian matrix of a graph
Li, Jiong-Sheng
Zhang, Xiao-Dong
Linear Algebra and Its Applications, 1997, 265 (1-3): : 93 - 100
[40] Upper matrix bound of the solution for the discrete Riccati equation
Lee, CH
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (06) : 840 - 842

← 1 2 3 4 5 →