Codes Correcting Long Duplication Errors

被引:0
|
作者
Goshkoder, Daniil [1 ]
Polyanskii, Nikita [2 ]
Vorobyev, Ilya [3 ]
机构
[1] Lomonosov Moscow State Univ, Chair Probabil Theory, Moscow 119991, Russia
[2] IOTA Fdn, D-10437 Berlin, Germany
[3] Tech Univ Munich, Chair Theoret Informat Technol, D-80333 Munich, Germany
关键词
Error correction codes; Computational complexity; Duplication errors; Tandem duplications; Long duplications; CONSTRUCTION; STORAGE; DNA; DELETION; NUMBER;
D O I
10.1109/TMBMC.2024.3403755
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We consider the problem of constructing codes capable of correcting long tandem duplication errors of variable length. We present a subquadratic-complexity algorithm that uses only one symbol of redundancy to encode q-ary length-n words into codewords, which can correct a single duplication of length at least K = 4 center dot [log(q) n ] + 1. We enhance the error-correcting capability by introducing codes without efficient encoding, leading to an improved value of K = [log(q) n] + phi(n), where phi(n) is an arbitrary function such that phi(n) -> infinity as n -> infinity. In the class of codes correcting a single long duplication with redundancy 1, the value K in our constructions is order-optimal. Finally, k-repeat-free codes, in which every codeword contains any k-tuple at most once, are shown to correct any number of independent long duplications, each of length at least K = 2k, occurring simultaneously without any mutual interference.
引用
收藏
页码:272 / 288
页数:17
相关论文
共 50 条