A tool for aligning very similar DNA sequences

被引:0
|
作者
Chao, KM
Zhang, JH
Ostell, J
Miller, W
机构
[1] PENN STATE UNIV, DEPT COMP SCI & ENGN, UNIVERSITY PK, PA 16802 USA
[2] PROVIDENCE UNIV, DEPT COMP SCI & INFORMAT MANAGEMENT, TAICHUNG 43309, TAIWAN
[3] NIH, NATL CTR BIOTECHNOL INFORMAT, NATL LIB MED, BETHESDA, MD 20892 USA
来源
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Results: We have produced a computer program, named sim3, that solves the following computational problem. Two DNA sequences are given, where the shorter sequence is very similar to some contiguous region of the longer sequence. Sim3 determines such a similar region of the longer sequence, and then computes an optimal set of single-nucleotide changes (i.e., insertions, deletions or substitutions) that will convert the shorter sequence to that region. Thus, the alignment scoring scheme is designed to model sequencing errors, rather than evolutionary processes. The program can align a 100 kb sequence to a I megabase sequence in a few seconds on a workstation, provided that there are very few differences between the shorter sequence and some region in the longer sequence. The program has been used to assemble sequence data for the Genomes Division at the National Center for Biotechnology Information. Availability: A version of sim3 for UNIX machines can be obtained by anonymous ftp from ncbi. nlm. nih, gov, in the pub/sim3 directory. Contact: For portable versions for Macs and PCs, contact zjing@sunset. nlm. nih. gov.
引用
收藏
页码:75 / 80
页数:6
相关论文
共 50 条
  • [31] Aligning multiple sequences by genetic algorithm
    Liu, LF
    Huo, HW
    Wang, BS
    2004 INTERNATIONAL CONFERENCE ON COMMUNICATION, CIRCUITS, AND SYSTEMS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEMS, 2004, : 994 - 998
  • [32] Aligning sequences from multiple cameras
    Korah, T
    Rasmussen, C
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 941 - 944
  • [33] GLProbs: Aligning Multiple Sequences Adaptively
    Ye, Yongtao
    Cheung, David Wai-Lok
    Wang, Yadong
    Yiu, Siu-Ming
    Zhang, Qing
    Lam, Tak-Wah
    Ting, Hing-Fung
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (01) : 67 - 78
  • [34] Phylogenetic relationships of Chaetomium and similar genera based on ribosomal DNA sequences
    Lee, S
    Hanlin, RT
    MYCOLOGIA, 1999, 91 (03) : 434 - 442
  • [35] Aligning non-overlapping sequences
    Caspi, Y
    Irani, M
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 48 (01) : 39 - 51
  • [36] Highly similar noncoding genomic DNA sequences: ultraconserved, or merely widespread?
    Hickey, Donal A.
    GENOME, 2008, 51 (05) : 396 - 397
  • [37] A Guided Dynamic Programming Approach for Searching a Set of Similar DNA Sequences
    Nordin, A. R. M.
    Osman, M. T. A.
    Yazid, M. S. M.
    Aziz, A.
    2009 SECOND INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES (ICADIWT 2009), 2009, : 512 - +
  • [38] GS-aligner: A novel tool for aligning genomic sequences using bit-level operations
    Shih, ACC
    Li, WH
    MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (08) : 1299 - 1309
  • [39] CoVrimer: A tool for aligning SARS-CoV-2 primer sequences and selection of conserved/degenerate primers
    Vural-Ozdeniz, Merve
    Akturk, Aslinur
    Demirdizen, Mert
    Leka, Ronaldo
    Acar, Rana
    Konu, Ozlen
    GENOMICS, 2021, 113 (05) : 3174 - 3184
  • [40] INTEGRATION SITE PREFERENCES OF THE ALU FAMILY AND SIMILAR REPETITIVE DNA-SEQUENCES
    DANIELS, GR
    DEININGER, PL
    NUCLEIC ACIDS RESEARCH, 1985, 13 (24) : 8939 - 8954