FramePlus: aligning DNA to protein sequences

被引:11
|
作者
Halperin, E [1 ]
Faigler, S [1 ]
Gill-More, R [1 ]
机构
[1] Compugen Ltd, IL-69512 Tel Aviv, Israel
关键词
D O I
10.1093/bioinformatics/15.11.867
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Automated annotation of Expressed Sequence Tags (ESTs) is becoming increasingly important as EST databases continue to grow rapidly A common approach to annotation is to align the gene fragments against well-documented databases of protein sequences. The sensitivity of the alignment algorithm is key to the success of such methods. Results: This paper introduces a new algorithm FramePlus, for DNA-protein sequence alignment. The SCOP database was used to develop a general framework for testing the sensitivity of such alignment algorithms when searching large databases. Using this framework, the performance of FramePlus was found to be somewhat better than other algorithms in the presence of moderate and high rates of frameshift errors, and comparable to Translated Search in the absence of sequencing errors. Availability: The source code for FramePlus and the testing datasets are freely available at ftp.compugen.co.il/pub/research. Contact: raveh@compugen.co.il.
引用
收藏
页码:867 / 873
页数:7
相关论文
共 50 条
  • [21] Aligning two fragmented sequences
    Veeramachaneni, V
    Berman, P
    Miller, W
    DISCRETE APPLIED MATHEMATICS, 2003, 127 (01) : 119 - 143
  • [22] Aligning Multi Sequences on GPUs
    Hong Phong Pham
    Huu Duc Nguyen
    Thanh Thuy Nguyen
    Context-Aware Systems and Applications, (ICCASA 2012), 2013, 109 : 300 - 309
  • [23] FAST ALIGNMENT OF DNA AND PROTEIN SEQUENCES
    LANDAU, GM
    VISHKIN, U
    NUSSINOV, R
    METHODS IN ENZYMOLOGY, 1990, 183 : 487 - 502
  • [24] DNA Sequences Are as Useful as Protein Sequences for Inferring Deep Phylogenies
    Kapli, Paschalia
    Kotari, Ioanna
    Telford, Maximilian J.
    Goldman, Nick
    Yang, Ziheng
    SYSTEMATIC BIOLOGY, 2023, 72 (05) : 1119 - 1135
  • [25] Aligning Sequences by Minimum Description Length
    Conery, John S.
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2007, (01):
  • [26] Aligning Non-Overlapping Sequences
    Yaron Caspi
    Michal Irani
    International Journal of Computer Vision, 2002, 48 : 39 - 51
  • [27] Aligning multiple sequences by genetic algorithm
    Liu, LF
    Huo, HW
    Wang, BS
    2004 INTERNATIONAL CONFERENCE ON COMMUNICATION, CIRCUITS, AND SYSTEMS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEMS, 2004, : 994 - 998
  • [28] Aligning sequences from multiple cameras
    Korah, T
    Rasmussen, C
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 941 - 944
  • [29] GLProbs: Aligning Multiple Sequences Adaptively
    Ye, Yongtao
    Cheung, David Wai-Lok
    Wang, Yadong
    Yiu, Siu-Ming
    Zhang, Qing
    Lam, Tak-Wah
    Ting, Hing-Fung
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (01) : 67 - 78
  • [30] Aligning non-overlapping sequences
    Caspi, Y
    Irani, M
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 48 (01) : 39 - 51