Aligning DNA sequences to minimize the change in protein - (Extended abstract)

被引:0
|
作者
Hua, YF [1 ]
Jiang, T
Wu, B
机构
[1] IBM Canada, Dept 659, Toronto, ON M3C 1W3, Canada
[2] McMaster Univ, Dept Comp Sci, Hamilton, ON L8S 4K1, Canada
来源
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We study an alignment model for coding DNA sequences recently proposed by J. Hein that takes into account both DNA and protein information, and attempts to minimize the total amount of evolution at both DNA and protein levels. Assuming that the gap penalty function is affine, we design a quadratic time dynamic programming algorithm for the model. Although the algorithm theoretically solves an open question of Hein, its running time is impractical because of the large constant factor embedded in the quadratic time complexity function. We therefore consider a mild simplification of Hein's model and present a much more efficient algorithm for the simplified model. The algorithms have been implemented and tested on both real and simulated sequences, and it is found that they produce almost identical alignments in most cases.
引用
收藏
页码:221 / 234
页数:14
相关论文
共 50 条
  • [31] A Pattern Matching Extended Compression Algorithm for DNA Sequences
    Murugan, A.
    Punitha, K.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (08): : 196 - 202
  • [32] Effectively Predicting Protein Functions by Collective Classification - An Extended Abstract
    Xiong, Wei
    Liu, Hui
    Guan, Jihong
    Zhou, Shuigeng
    2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [33] Extended Abstract: Engineering Technical Communication Programs: A Change in Writing Instruction
    Scarff, Kelly
    2023 IEEE INTERNATIONAL PROFESSIONAL COMMUNICATION CONFERENCE, PROCOMM, 2023, : 57 - 59
  • [34] FAST ALIGNMENT OF DNA AND PROTEIN SEQUENCES
    LANDAU, GM
    VISHKIN, U
    NUSSINOV, R
    METHODS IN ENZYMOLOGY, 1990, 183 : 487 - 502
  • [35] Ordinal Regression with Explainable Distance Metric Learning Based on Ordered Sequences: Extended Abstract
    Luis Suarez, Juan
    Garcia, Salvador
    Herrera, Francisco
    2021 IEEE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2021,
  • [36] DNA Sequences Are as Useful as Protein Sequences for Inferring Deep Phylogenies
    Kapli, Paschalia
    Kotari, Ioanna
    Telford, Maximilian J.
    Goldman, Nick
    Yang, Ziheng
    SYSTEMATIC BIOLOGY, 2023, 72 (05) : 1119 - 1135
  • [37] Approximate protein folding in the HP side chain model on extended cubic lattices - (Extended abstract)
    Heun, V
    ALGORITHMS - ESA'99, 1999, 1643 : 212 - 223
  • [38] An extended backus-system for the representation and analysis of DNA sequences
    Hofestaedt, R.
    Proceedings of the Fifth International Conference on Bioinformatics of Genome Regulation and Structure, Vol 1, 2006, : 48 - 51
  • [39] TOPAL: recombination detection in DNA and protein sequences
    McGuire, G
    Wright, F
    BIOINFORMATICS, 1998, 14 (02) : 219 - 220
  • [40] COMPUTER-ANALYSIS OF DNA AND PROTEIN SEQUENCES
    VONHEIJNE, G
    EUROPEAN JOURNAL OF BIOCHEMISTRY, 1991, 199 (02): : 253 - 256