A branch and bound irredundant graph algorithm for large-scale MLCS problems

被引:11
|
作者
Wang, Chunyang [1 ]
Wang, Yuping [1 ]
Cheung, Yiuming [2 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Shaanxi, Peoples R China
[2] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
关键词
Multiple longest common subsequences; Small DAG; Branch and bound; Gene alignment;
D O I
10.1016/j.patcog.2021.108059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding the multiple longest common subsequences (MLCS) among many long sequences (i.e., the large scale MLCS problem) has many important applications, such as gene alignment, disease diagnosis, and documents similarity check, etc. It is an NP-hard problem (Maier et al., 1978). The key bottle neck of this problem is that the existing state-of-the-art algorithms must construct a huge graph (called direct acyclic graph, briefly DAG), and the computer usually has no enough space to store and handle this graph. Thus the existing algorithms cannot solve the large scale MLCS problem. In order to quickly solve the large-scale MLCS problem within limited computer resources, this paper therefore proposes a branch and bound irredundant graph algorithm called Big-MLCS, which constructs a much smaller DAG (called SmallDAG) than the existing algorithms do by a branch and bound method, and designs a new data structure to efficiently store and handle Small-DAG. By these schemes, Big-MLCS is more efficient than the existing algorithms. Also, we compare the proposed algorithm with two state-of-the-art algorithms through the experiments, and the results show that the proposed algorithm outperforms the compared algorithms and is more suitable to large-scale MLCS problems. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] An efficient outcome-space branch-and-bound algorithm for solving a class of large-scale linear multiplicative programs
    Jing, Xia
    Ma, Xiaohua
    Gao, Yuelin
    Liu, Xia
    NUMERICAL ALGORITHMS, 2024,
  • [42] A graph partitioning strategy for solving large-scale crew scheduling problems
    Silke Jütte
    Ulrich W. Thonemann
    OR Spectrum, 2015, 37 : 137 - 170
  • [43] A graph partitioning strategy for solving large-scale crew scheduling problems
    Juette, Silke
    Thonemann, Ulrich W.
    OR SPECTRUM, 2015, 37 (01) : 137 - 170
  • [44] PARALLEL EMPIRICAL STOCHASTIC BRANCH AND BOUND FOR LARGE-SCALE DISCRETE OPTIMIZATION VIA SIMULATION
    Rosen, Scott
    Salemi, Peter
    Wickham, Brian
    Williams, Ashley
    Harvey, Christine
    Catlett, Erin
    Taghiyeh, Sajjad
    Xu, Jie
    2016 WINTER SIMULATION CONFERENCE (WSC), 2016, : 626 - 637
  • [45] AN ADAPTIVE GRADIENT ALGORITHM FOR LARGE-SCALE NONLINEAR BOUND CONSTRAINED OPTIMIZATION
    Cheng, Wanyou
    Cao, Erbao
    ASIA-PACIFIC JOURNAL OF OPERATIONAL RESEARCH, 2013, 30 (03)
  • [46] Large-Scale Dynamic Graph Updating Algorithm in Distributed Computing System
    Rong Xuanyu
    Cui Huanqing
    PROCEEDINGS OF 2019 2ND INTERNATIONAL CONFERENCE ON BIG DATA TECHNOLOGIES (ICBDT 2019), 2019, : 248 - 251
  • [48] A Scalable Distributed Louvain Algorithm for Large-scale Graph Community Detection
    Zeng, Jianping
    Yu, Hongfeng
    2018 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2018, : 268 - 278
  • [49] A Sampling-Based Graph Clustering Algorithm for Large-Scale Networks
    Zhang J.-P.
    Chen H.-C.
    Wang K.
    Zhu K.-J.
    Wang Y.-W.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (08): : 1731 - 1737
  • [50] A parallel SVM training algorithm on large-scale classification problems
    Zhang, JP
    Li, ZW
    Yang, J
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 1637 - 1641