Minimum common string partition;
Bioinformatics;
Integer linear programming;
Computational comparison;
GREEDY ALGORITHM;
EDIT DISTANCE;
D O I:
10.1007/s11590-015-0921-4
中图分类号:
C93 [管理学];
O22 [运筹学];
学科分类号:
070105 ;
12 ;
1201 ;
1202 ;
120202 ;
摘要:
In the minimum common string partition (MCSP) problem two related input strings are given. "Related" refers to the property that both strings consist of the same set of letters appearing the same number of times in each of the two strings. The MCSP seeks a minimum cardinality partitioning of one string into non-overlapping substrings that is also a valid partitioning for the second string. This problem has applications in bioinformatics e.g. in analyzing related DNA or protein sequences. For strings with lengths less than about 1000 letters, a previously published integer linear programming (ILP) formulation yields, when solved with a state-of-the-art solver such as CPLEX, satisfactory results. In this work, we propose a new, alternative ILP model that is compared to the former one. While a polyhedral study shows the linear programming relaxations of the two models to be equally strong, a comprehensive experimental comparison using real-world as well as artificially created benchmark instances indicates substantial computational advantages of the new formulation.
机构:
Univ Aquila, Dipartimento Sci Ingn Informaz & Matemat, Via Vetoio, I-67010 Laquila, ItalyUniv Aquila, Dipartimento Sci Ingn Informaz & Matemat, Via Vetoio, I-67010 Laquila, Italy
Arbib, Claudio
Servilio, Mara
论文数: 0引用数: 0
h-index: 0
机构:
Ist Anal Sistemi & Informat A Ruberti CNR, Via Taurini 19, I-00185 Rome, ItalyUniv Aquila, Dipartimento Sci Ingn Informaz & Matemat, Via Vetoio, I-67010 Laquila, Italy
Servilio, Mara
Ventura, Paolo
论文数: 0引用数: 0
h-index: 0
机构:
Ist Anal Sistemi & Informat A Ruberti CNR, Via Taurini 19, I-00185 Rome, ItalyUniv Aquila, Dipartimento Sci Ingn Informaz & Matemat, Via Vetoio, I-67010 Laquila, Italy