An Efficient Algorithm for Chinese Postman Walk on Bi-directed de Bruijn Graphs

被引:0
|
作者
Kundeti, Vamsi [1 ]
Rajasekaran, Sanguthevar [1 ]
Heiu Dinh [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
基金
美国国家科学基金会;
关键词
SEQUENCE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Sequence assembly from short reads is an important problem in biology. It is known that solving the sequence assembly problem exactly on a bi-directed de Bruijn graph or a string graph is intractable. However finding a Shortest Double stranded DNA string (SDDNA) containing all the k-long words in the reads seems to he a good heuristic to get close to the original genome. This problem is equivalent to finding a cyclic Chinese Postman (CP) walk on the underlying un-weighted hi-directed de Bruijn graph built from the reads. The Chinese Postman walk Problem (CPP) is solved by reducing it to a general bi-directed flow on this graph which runs in O(vertical bar E vertical bar(2) log(2)(vertical bar V vertical bar)) time. In this paper we show that the cyclic CPP on bi-directed graphs can be solved without reducing it to hi-directed flow. We present a Theta(p(vertical bar V vertical bar+vertical bar E vertical bar) log(vertical bar V vertical bar)+(d(max)p)(3)) time algorithm to solve the cyclic CPP on a weighted bi-directed de Bruijn graph, where p = max{vertical bar{v vertical bar d(in)(v) d(out)(v) > 0}vertical bar, vertical bar{v vertical bar d(in) (v) - d(out)(v) < 0}vertical bar} and d(max) = max{vertical bar d(in)(v) - d(out)(v)}. Our algorithm performs asymptotically better than the bi-directed flow algorithm when the number of imbalanced nodes p is much less than the nodes in the bi-directed graph. From our experimental results on various datasets, we have noticed that the value of p/vertical bar V vertical bar lies between 0.08% and 0.13% with 95% probability. Many practical hi-directed de Bruijn graphs do not have cyclic CP walks. In such cases it is not clear how the hi-directed flow can be useful in identifying contigs. Our algorithm can handle such situations and identify maximal bi-directed sub-graphs that have CP walks. We also present a Theta((vertical bar V vertical bar + vertical bar E vertical bar) log(V)) time algorithm for the single source shortest path problem on bi-directed de Bruijn graphs, which may be of independent interest.
引用
收藏
页码:184 / 196
页数:13
相关论文
共 34 条
  • [1] AN EFFICIENT ALGORITHM FOR CHINESE POSTMAN WALK ON BI-DIRECTED DE BRUIJN GRAPHS
    Kundeti, Vamsi
    Rajasekaran, Sanguthevar
    Dinh, Heiu
    [J]. DISCRETE MATHEMATICS ALGORITHMS AND APPLICATIONS, 2012, 4 (02)
  • [2] Efficient parallel and out of core algorithms for constructing large bi-directed de Bruijn graphs
    Vamsi K Kundeti
    Sanguthevar Rajasekaran
    Hieu Dinh
    Matthew Vaughn
    Vishal Thapar
    [J]. BMC Bioinformatics, 11
  • [3] Efficient parallel and out of core algorithms for constructing large bi-directed de Bruijn graphs
    Kundeti, Vamsi K.
    Rajasekaran, Sanguthevar
    Dinh, Hieu
    Vaughn, Matthew
    Thapar, Vishal
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [4] A de novo Genome Assembler based on MapReduce and Bi-directed de Bruijn Graph
    Zhang, Yuehua
    Xuan, Pengfei
    Wang, Yunsheng
    Srimani, Pradip K.
    Luo, Feng
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 65 - 71
  • [5] Identifying codes on directed de Bruijn graphs
    Boutin, Debra
    Goliber, Victoria Horan
    Pelto, Mikko
    [J]. DISCRETE APPLIED MATHEMATICS, 2019, 262 : 29 - 41
  • [6] A NEW ALGORITHM FOR THE DIRECTED CHINESE POSTMAN PROBLEM
    LIN, YX
    ZHAO, YC
    [J]. COMPUTERS & OPERATIONS RESEARCH, 1988, 15 (06) : 577 - 584
  • [7] Compression algorithm for colored de Bruijn graphs
    Rahman, Amatur
    Dufresne, Yoann
    Medvedev, Paul
    [J]. ALGORITHMS FOR MOLECULAR BIOLOGY, 2024, 19 (01)
  • [8] Space Efficient Merging of de Bruijn Graphs and Wheeler Graphs
    Egidi, Lavinia
    Louza, Felipe A.
    Manzini, Giovanni
    [J]. ALGORITHMICA, 2022, 84 (03) : 639 - 669
  • [9] Space Efficient Merging of de Bruijn Graphs and Wheeler Graphs
    Lavinia Egidi
    Felipe A. Louza
    Giovanni Manzini
    [J]. Algorithmica, 2022, 84 : 639 - 669
  • [10] Min-Deviation-Flow in Bi-directed Graphs for T-Mesh Quantization
    Heistermann, Martin
    Warnett, Jethro
    Bommes, David
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (04):