FramePlus: aligning DNA to protein sequences

被引:11
|
作者
Halperin, E [1 ]
Faigler, S [1 ]
Gill-More, R [1 ]
机构
[1] Compugen Ltd, IL-69512 Tel Aviv, Israel
关键词
D O I
10.1093/bioinformatics/15.11.867
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Automated annotation of Expressed Sequence Tags (ESTs) is becoming increasingly important as EST databases continue to grow rapidly A common approach to annotation is to align the gene fragments against well-documented databases of protein sequences. The sensitivity of the alignment algorithm is key to the success of such methods. Results: This paper introduces a new algorithm FramePlus, for DNA-protein sequence alignment. The SCOP database was used to develop a general framework for testing the sensitivity of such alignment algorithms when searching large databases. Using this framework, the performance of FramePlus was found to be somewhat better than other algorithms in the presence of moderate and high rates of frameshift errors, and comparable to Translated Search in the absence of sequencing errors. Availability: The source code for FramePlus and the testing datasets are freely available at ftp.compugen.co.il/pub/research. Contact: raveh@compugen.co.il.
引用
收藏
页码:867 / 873
页数:7
相关论文
共 50 条
  • [41] Recognition of different DNA sequences by a DNA-binding protein alters protein dynamics differentially
    Mondol, Tanumoy
    Batabyal, Subrata
    Mazumder, Abhishek
    Roy, Siddhartha
    Pal, Samir Kumar
    FEBS LETTERS, 2012, 586 (03) : 258 - 262
  • [42] RMotifGen: random motif generator for DNA and protein sequences
    Rouchka, Eric C.
    Hardin, C. Timothy
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [43] Cladistic analysis of iridoviruses based on protein and DNA sequences
    Wang, JW
    Deng, RQ
    Wang, XZ
    Huang, YS
    Xing, K
    Feng, JH
    He, JG
    Long, QX
    ARCHIVES OF VIROLOGY, 2003, 148 (11) : 2181 - 2194
  • [44] COMPUTER-PROGRAMS FOR ANALYZING DNA AND PROTEIN SEQUENCES
    LEWITTER, FI
    RINDONE, WP
    METHODS IN ENZYMOLOGY, 1987, 155 : 582 - 593
  • [45] The APC protein binds to A/T rich DNA sequences
    Deka, J
    Herter, P
    Sprenger-Haussels, M
    Koosch, S
    Franz, D
    Müller, KM
    Kuhnen, C
    Hoffmann, I
    Müller, O
    ONCOGENE, 1999, 18 (41) : 5654 - 5661
  • [46] Alignments of DNA and protein sequences containing frameshift errors
    Guan, XJ
    Uberbacher, EC
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1996, 12 (01): : 31 - 40
  • [47] Detecting heterogeneity of substitution along DNA and protein sequences
    Goss, PJE
    Lewontin, RC
    GENETICS, 1996, 143 (01) : 589 - 602
  • [48] DNA SEQUENCES CODING FOR MORE THAN ONE PROTEIN
    LEWIN, B
    NATURE, 1976, 264 (5581) : 11 - 12
  • [49] DNA Motif Recognition Modeling from Protein Sequences
    Wong, Ka-Chun
    ISCIENCE, 2018, 7 : 198 - +
  • [50] Correlations in DNA sequences: The role of protein coding segments
    Herzel, H
    Grosse, I
    PHYSICAL REVIEW E, 1997, 55 (01) : 800 - 810