Unified Formal Construction and Isabelle Verification of the Dynamic Programming Algorithms for Biological Sequence Alignment

被引:0
|
作者
Shi, Haihe [1 ]
Lan, Sunwen [1 ]
Liu, Riming [1 ]
Shi, Haipeng [2 ]
Wang, Lan [1 ]
Zhong, Linhui [1 ]
机构
[1] School of Computer Information Engineering, Jiangxi Normal University, Nanchang,330022, China
[2] School of Software, Jiangxi Normal University, Nanchang,330022, China
基金
中国国家自然科学基金;
关键词
Application programs - Bioinformatics - Formal specification - Formal verification;
D O I
10.7544/issn1000-1239.202330698
中图分类号
学科分类号
摘要
Sequence alignment is a classical problem in biological sequence analysis, aiming to find out the similarity between sequences, which is of great significance for discovering functional, structural and evolutionary information in biological sequences. The problem can be divided into two categories: pairwise sequence alignment and multiple sequence alignment. The existing work is focused on specific algorithms, and no general solution is designed. In addition, there are few researches on the trustworthy of algorithms. By deeply analyzing the properties of sequence alignment problem and describing the essential characteristics of problem solving, a unified construction framework of sequence alignment dynamic programming algorithm seqAlign is designed based on the problem formal specification and formal method PAR. The process of constructing a multiple sequence alignment algorithm with three sequences by using the framework is further demonstrated, and the constructed results are formally verified by Isabelle theorem prover. Finally, the C++ executable program of the algorithm is generated by the code generation system of PAR platform. The process of mechanized construction of other sequence alignment algorithms using seqAlign framework is analyzed. Through strict specification refinement and formal verification, the reliability of the generated algorithm is effectively guaranteed. The developed seqAlign framework provides a general solution for the class of sequence alignment problems, which significantly improves the efficiency of generating sequence alignment algorithm families. The successful application of the designed seqAlign framework to sequence alignment problem in biological sequence analysis can provide a reference for the construction of highly reliable algorithms in complex bioinformatics field from the perspective of methodology and practice. © 2025 Science Press. All rights reserved.
引用
收藏
页码:119 / 131
相关论文
共 50 条
  • [21] A novel shape matching method using biological sequence dynamic alignment
    Zhang, SJ
    Ma, KK
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 343 - 346
  • [22] Multiple sequence alignment using modified dynamic programming and particle swarm optimization
    Juang, Wang-Sheng
    Su, Shun-Feng
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2008, 31 (04) : 659 - 673
  • [23] DASH: Localising dynamic programming for order of magnitude faster, accurate sequence alignment
    Gardner-Stephen, P
    Knowles, G
    2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2004, : 732 - 735
  • [24] Design and Implementation of Pairwise Sequence Alignment Algorithm Components Based on Dynamic Programming
    Shi H.
    Zhou W.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (09): : 1907 - 1917
  • [25] A BIT-PARALLEL DYNAMIC PROGRAMMING ALGORITHM SUITABLE FOR DNA SEQUENCE ALIGNMENT
    Kimura, Kouichi
    Koike, Asako
    Nakai, Kenta
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2012, 10 (04)
  • [26] Local sequence alignment using parallel memory-efficient dynamic programming
    Pichl, L
    Arai, M
    Hanabusa, K
    Hayashi, T
    Proceedings of the 8th Joint Conference on Information Sciences, Vols 1-3, 2005, : 1269 - 1272
  • [27] Memory-efficient dynamic programming backtrace and pairwise local sequence alignment
    Newberg, Lee A.
    BIOINFORMATICS, 2008, 24 (16) : 1772 - 1778
  • [28] MASH - AN INTERACTIVE PROGRAM FOR MULTIPLE ALIGNMENT AND CONSENSUS SEQUENCE CONSTRUCTION FOR BIOLOGICAL SEQUENCES
    CHAPPEY, C
    DANCKAERT, A
    DESSEN, P
    HAZOUT, S
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1991, 7 (02): : 195 - 202
  • [29] Unified formal derivation and automatic verification of three binary-tree traversal non-recursive algorithms
    You, Zhen
    Xue, Jinyun
    Zuo, Zhengkang
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (04): : 2145 - 2156
  • [30] Unified formal derivation and automatic verification of three binary-tree traversal non-recursive algorithms
    Zhen You
    Jinyun Xue
    Zhengkang Zuo
    Cluster Computing, 2016, 19 : 2145 - 2156