AN EFFICIENT ALGORITHM FOR CHINESE POSTMAN WALK ON BI-DIRECTED DE BRUIJN GRAPHS

被引:2
|
作者
Kundeti, Vamsi [1 ]
Rajasekaran, Sanguthevar [1 ]
Dinh, Heiu [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
基金
美国国家科学基金会;
关键词
Sequence assembly algorithms; bioinformatics; Chinese Postman problem; bi-directed graphs;
D O I
10.1142/S179383091250019X
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Sequence assembly from short reads is an important problem in biology. It is known that solving the sequence assembly problem exactly on a bi-directed de Bruijn graph or a string graph is intractable. However, finding a shortest double stranded DNA string (SDDNA) containing all the k-long words in the reads seems to be a good heuristic to get close to the original genome. This problem is equivalent to finding a cyclic Chinese Postman (CP) walk on the underlying unweightedbi- directed de Bruijn graph built from the reads. The Chinese Postman walk Problem (CPP) is solved by reducing it to a general bi-directed flow on this graph which runs in O(|E| (2)log(2)(|V |)) time. In this paper we show that the cyclic CPP on bi-directed graphs can be solved without reducing it to bi-directed flow. We present a Xi(p(|V |+| E|) log(|V |)+(dmaxp)(3)) time algorithm to solve the cyclic CPP on a weighted bi-directed de Bruijn graph, where p = max{|{v| d(in)(v) - d(out)( v) > 0}|, |{v| d(in)(v) - d(out)(v) < 0}|} and d(max) = max{| d(in)(v) - d(out)(v)}. Our algorithm performs asymptotically better than the bi-directed flow algorithm when the number of imbalanced nodes p is much less than the nodes in the bi-directed graph. From our experimental results on various datasets, we have noticed that the value of p/|V | lies between 0.08% and 0.13% with 95% probability. Many practical bi-directed de Bruijn graphs do not have cyclic CP walks. In such cases it is not clear how the bi-directed flow can be useful in identifying contigs. Our algorithm can handle such situations and identify maximal bi-directed sub-graphs that have CP walks. A Xi(p(|V | + | E|)) time heuristic algorithm based on these ideas has been implemented for the SDDNA problem. This algorithm was tested on short reads from a plant genome and achieves an approximation ratio of at most 1.0134. We also present a Xi((|V | + | E|) log(V)) time algorithm for the single source shortest path problem on bi-directed de Bruijn graphs, which may be of independent interest.
引用
收藏
页数:16
相关论文
共 33 条
  • [1] An Efficient Algorithm for Chinese Postman Walk on Bi-directed de Bruijn Graphs
    Kundeti, Vamsi
    Rajasekaran, Sanguthevar
    Heiu Dinh
    [J]. COMBINATORIAL OPTIMIZATION AND APPLICATIONS, PT 1, 2010, 6508 : 184 - 196
  • [2] Efficient parallel and out of core algorithms for constructing large bi-directed de Bruijn graphs
    Vamsi K Kundeti
    Sanguthevar Rajasekaran
    Hieu Dinh
    Matthew Vaughn
    Vishal Thapar
    [J]. BMC Bioinformatics, 11
  • [3] Efficient parallel and out of core algorithms for constructing large bi-directed de Bruijn graphs
    Kundeti, Vamsi K.
    Rajasekaran, Sanguthevar
    Dinh, Hieu
    Vaughn, Matthew
    Thapar, Vishal
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [4] A de novo Genome Assembler based on MapReduce and Bi-directed de Bruijn Graph
    Zhang, Yuehua
    Xuan, Pengfei
    Wang, Yunsheng
    Srimani, Pradip K.
    Luo, Feng
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 65 - 71
  • [5] Identifying codes on directed de Bruijn graphs
    Boutin, Debra
    Goliber, Victoria Horan
    Pelto, Mikko
    [J]. DISCRETE APPLIED MATHEMATICS, 2019, 262 : 29 - 41
  • [6] A NEW ALGORITHM FOR THE DIRECTED CHINESE POSTMAN PROBLEM
    LIN, YX
    ZHAO, YC
    [J]. COMPUTERS & OPERATIONS RESEARCH, 1988, 15 (06) : 577 - 584
  • [7] Compression algorithm for colored de Bruijn graphs
    Rahman, Amatur
    Dufresne, Yoann
    Medvedev, Paul
    [J]. ALGORITHMS FOR MOLECULAR BIOLOGY, 2024, 19 (01)
  • [8] Space Efficient Merging of de Bruijn Graphs and Wheeler Graphs
    Egidi, Lavinia
    Louza, Felipe A.
    Manzini, Giovanni
    [J]. ALGORITHMICA, 2022, 84 (03) : 639 - 669
  • [9] Space Efficient Merging of de Bruijn Graphs and Wheeler Graphs
    Lavinia Egidi
    Felipe A. Louza
    Giovanni Manzini
    [J]. Algorithmica, 2022, 84 : 639 - 669
  • [10] Min-Deviation-Flow in Bi-directed Graphs for T-Mesh Quantization
    Heistermann, Martin
    Warnett, Jethro
    Bommes, David
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (04):