A branch and bound irredundant graph algorithm for large-scale MLCS problems

被引:11
|
作者
Wang, Chunyang [1 ]
Wang, Yuping [1 ]
Cheung, Yiuming [2 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Shaanxi, Peoples R China
[2] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
关键词
Multiple longest common subsequences; Small DAG; Branch and bound; Gene alignment;
D O I
10.1016/j.patcog.2021.108059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding the multiple longest common subsequences (MLCS) among many long sequences (i.e., the large scale MLCS problem) has many important applications, such as gene alignment, disease diagnosis, and documents similarity check, etc. It is an NP-hard problem (Maier et al., 1978). The key bottle neck of this problem is that the existing state-of-the-art algorithms must construct a huge graph (called direct acyclic graph, briefly DAG), and the computer usually has no enough space to store and handle this graph. Thus the existing algorithms cannot solve the large scale MLCS problem. In order to quickly solve the large-scale MLCS problem within limited computer resources, this paper therefore proposes a branch and bound irredundant graph algorithm called Big-MLCS, which constructs a much smaller DAG (called SmallDAG) than the existing algorithms do by a branch and bound method, and designs a new data structure to efficiently store and handle Small-DAG. By these schemes, Big-MLCS is more efficient than the existing algorithms. Also, we compare the proposed algorithm with two state-of-the-art algorithms through the experiments, and the results show that the proposed algorithm outperforms the compared algorithms and is more suitable to large-scale MLCS problems. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] A new algorithm for large-scale scheduling problems: Sequence branch algorithm
    Lee, S
    Bok, JK
    Park, S
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 1998, 37 (10) : 4049 - 4058
  • [2] A branch-and-bound algorithm with growing datasets for large-scale estimation
    Sass, Susanne
    Mitsos, Alexander
    Bongartz, Dominik
    Bell, Ian H.
    Nikolov, Nikolay I.
    Tsoukalas, Angelos
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 316 (01) : 36 - 45
  • [3] A GRAPH-COLORING ALGORITHM FOR LARGE-SCALE SCHEDULING PROBLEMS
    LOTFI, V
    SARIN, S
    COMPUTERS & OPERATIONS RESEARCH, 1986, 13 (01) : 27 - 32
  • [4] Parameter estimation in branch and bound algorithms for large-scale discrete optimization problems
    Sigal, IK
    JOURNAL OF COMPUTER AND SYSTEMS SCIENCES INTERNATIONAL, 2005, 44 (04) : 594 - 599
  • [5] A Parallel Branch and Bound Algorithm for Solving Large Scale Integer Programming Problems
    Ismail, Mahmoud M.
    Abd el-Raoof, Osama
    Abd El-Wahed, Waiel F.
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2014, 8 (04): : 1691 - 1698
  • [6] A revisited branch-and-cut algorithm for large-scale orienteering problems
    Kobeaga, Gorka
    Rojas-Delgado, Jairo
    Merino, Maria
    Lozano, Jose A.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 313 (01) : 44 - 68
  • [7] A branch and bound algorithm to solve large-scale multistage stochastic programs with endogenous uncertainty
    Christian, Brianna
    Cremaschi, Selen
    AICHE JOURNAL, 2018, 64 (04) : 1262 - 1271
  • [8] A Branch and Bound Algorithm to Scale Alignment of Large Ontologies
    Stoutenburg, Suzette
    Ewing, Kaily
    Hines, Lisa
    Kalita, Jugal
    2009 IEEE THIRD INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2009), 2009, : 349 - 354
  • [9] A Study on Reachability Problems of Large-scale Graph
    Ma, Jing-yan
    Zhang, Ke-hong
    2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND TECHNOLOGY (ICCST 2015), 2015, : 244 - 254
  • [10] Petascale computing for large-scale graph problems
    Bader, David A.
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2008, 4967 : 166 - 169