Drosophila genomic sequence annotation using the BLOCKS plus database

被引:3
|
作者
Henikoff, JG [1 ]
Henikoff, S
机构
[1] Howard Hughes Med Inst, Seattle, WA 98109 USA
[2] Fred Hutchinson Canc Res Ctr, Seattle, WA 98109 USA
关键词
D O I
10.1101/gr.10.4.543
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A simple and general homology-based method for gene finding was applied to the 2.9-Mb Drosophila melanogaster Adh region, the target sequence of the Genome Annotation Assessment Project (GASP). Each strand of the entire sequence was used as query of the BLOCKS database of conserved regions of proteins. This led to functional assignments For more than one-third of the genes and two-thirds of the transposons. Considering the enormous size of the query, the fact that only two false-positive matches were reported emphasizes the high selectivity of protein family-based methods for gene finding. We used the search results to improve BLOCKS+ by identifying compositionally biased blocks. Our results confirm that protein family databases can be used effectively in automated sequence annotation efforts.
引用
收藏
页码:543 / 546
页数:4
相关论文
共 50 条
  • [1] Genomic sequence, splicing, and gene annotation
    Mount, SM
    AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 67 (04) : 788 - 792
  • [2] GAIA: Framework annotation of genomic sequence
    Bailey, LC
    Fischer, S
    Schug, J
    Crabtree, J
    Gibson, M
    Overton, GC
    GENOME RESEARCH, 1998, 8 (03) : 234 - 250
  • [3] Using blocks plus database in Needleman-Wunsch algorithm
    Du, ZH
    Lin, F
    NAFIPS 2004: ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1AND 2: FUZZY SETS IN THE HEART OF THE CANADIAN ROCKIES, 2004, : 854 - 857
  • [4] Biological and computational annotation of the Drosophila genome sequence
    Rubin, G
    FASEB JOURNAL, 2001, 15 (05): : A1223 - A1223
  • [5] GARSA:: genomic analysis resources for sequence annotation
    Dávila, AMR
    Lorenzini, DM
    Mendes, PN
    Satake, TS
    Sousa, GR
    Campos, LM
    Mazzoni, CJ
    Wagner, G
    Pires, PF
    Grisard, EC
    Cavalcanti, MCR
    Campos, MLM
    BIOINFORMATICS, 2005, 21 (23) : 4302 - 4303
  • [6] CHPVDB - a sequence annotation database for Chandipura Virus
    Dikhit, Manas Ranjan
    Rana, Sindhu Prava
    Das, Pradeep
    Sahoo, Ganesh Chandra
    BIOINFORMATION, 2009, 3 (07) : 299 - 302
  • [7] UniSave: the UniProtKB Sequence/Annotation Version database
    Leinonen, R
    Nardone, F
    Zhu, WM
    Apweiler, R
    BIOINFORMATICS, 2006, 22 (10) : 1284 - 1285
  • [8] Annotation of a Newly Sequenced Genomic Region in Drosophila ananassae
    Lopez, Mateo Antonio
    Merkhofer, Evan
    FASEB JOURNAL, 2022, 36
  • [9] SELEX_DB: an activated database on selected randomized DNA/RNA sequences addressed to genomic sequence annotation
    Ponomarenko, JV
    Orlova, GV
    Ponomarenko, MP
    Lavryushev, SV
    Frolov, AS
    Zybova, SV
    Kolchanov, NA
    NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 205 - 208
  • [10] Complete sequence and genomic annotation of carrot torradovirus 1
    Rozado-Aguirre, Zurine
    Adams, Ian
    Fox, Adrian
    Dickinson, Matthew
    Boonham, Neil
    ARCHIVES OF VIROLOGY, 2017, 162 (09) : 2815 - 2819