Motivation: Searching DNA sequences against a DNA database is an essential element of sequence analysis. However, few systematic studies have been carried out to deter-mine when a match between two DNA sequences has biological significance and this is limiting the use that can be made of DNA searching algorithms. Results: A rest set of DNA sequences has been constructed consisting of artificially evolved and real sequences. This set has been used to test various database searching algorithms (BLAST, BLAST2, FASTA and Smith-Waterman) on a subset of the EMBL database. The results of this analysis have been used to determine the sensitivity and coverage of all of the algorithms. Guidelines have been produced which can be used to assess the significance of DNA database search results. The Smith-Water-man algorithm was shown to have the best coverage, but the wet-st sensitivity, whereas the default BLASTN algorithm (word length set to 11) was shown to have good sensitivity, but poor coverage. A sensible compromise between speed, sensitivity and coverage can be obtained using either the FASTA or BLAST (word length set to 6) algorithms. However; analysis of the results also showed that no algorithm works well when the length of the probe sequence is <200 bases. In general, matches can accurately be identified between coding regions of DNA sequences when there is >35% sequence identity between the corresponding proteins. Searching a DNA sequence against a DNA sequence database can, therefore, be a useful tool in sequence analysis.
机构:
Netherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, NetherlandsNetherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, Netherlands
Benschop, Corina C. G.
van de Merwe, Linda
论文数: 0引用数: 0
h-index: 0
机构:
Netherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, NetherlandsNetherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, Netherlands
van de Merwe, Linda
de Jong, Jeroen
论文数: 0引用数: 0
h-index: 0
机构:
Netherlands Forens Inst, Digital & Biometr Traces Div, Forens Software Engn Unit, POB 24044, NL-2490 AA The Hague, NetherlandsNetherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, Netherlands
de Jong, Jeroen
Vanvooren, Vanessa
论文数: 0引用数: 0
h-index: 0
机构:
Natl Inst Criminalist & Criminol, DNA Database Unit, Brussels, BelgiumNetherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, Netherlands
Vanvooren, Vanessa
Kempenaers, Morgane
论文数: 0引用数: 0
h-index: 0
机构:
Natl Inst Criminalist & Criminol, DNA Database Unit, Brussels, BelgiumNetherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, Netherlands
Kempenaers, Morgane
van der Beek, C. P.
论文数: 0引用数: 0
h-index: 0
机构:
Netherlands Forens Inst, DNA Database Dept, POB 24044, NL-2490 AA The Hague, NetherlandsNetherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, Netherlands
van der Beek, C. P.
Barni, Filippo
论文数: 0引用数: 0
h-index: 0
机构:
Carabinieri Sci Invest Grp RaCIS, Sci Invest Dept RIS Rome, Forens Biol Unit, I-00191 Rome, ItalyNetherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, Netherlands
Barni, Filippo
Lopez Reyes, Eusebio
论文数: 0引用数: 0
h-index: 0
机构:
Sci Commissary Gen Police, Natl Police, Madrid, SpainNetherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, Netherlands
Lopez Reyes, Eusebio
Moulin, Lea
论文数: 0引用数: 0
h-index: 0
机构:
Natl Forens Sci Inst, Ecully, FranceNetherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, Netherlands
Moulin, Lea
Pene, Laurent
论文数: 0引用数: 0
h-index: 0
机构:
Natl Forens Sci Inst, Ecully, FranceNetherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, Netherlands
Pene, Laurent
Haned, Hinda
论文数: 0引用数: 0
h-index: 0
机构:
HR Analyt, Ahold Delhaize, Provincialweg 11, NL-1506 MA Zaandam, NetherlandsNetherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, Netherlands
Haned, Hinda
Sijen, Titia
论文数: 0引用数: 0
h-index: 0
机构:
Netherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, NetherlandsNetherlands Forens Inst, Div Biol Traces, POB 24044, NL-2490 AA The Hague, Netherlands
机构:
CENT RES & SUPPORT ESTAB,HOME OFF FORENS SCI SERV,READING RG7 4PN,BERKS,ENGLANDCENT RES & SUPPORT ESTAB,HOME OFF FORENS SCI SERV,READING RG7 4PN,BERKS,ENGLAND
GILL, P
WERRETT, DJ
论文数: 0引用数: 0
h-index: 0
机构:
CENT RES & SUPPORT ESTAB,HOME OFF FORENS SCI SERV,READING RG7 4PN,BERKS,ENGLANDCENT RES & SUPPORT ESTAB,HOME OFF FORENS SCI SERV,READING RG7 4PN,BERKS,ENGLAND
WERRETT, DJ
EVETT, IW
论文数: 0引用数: 0
h-index: 0
机构:
CENT RES & SUPPORT ESTAB,HOME OFF FORENS SCI SERV,READING RG7 4PN,BERKS,ENGLANDCENT RES & SUPPORT ESTAB,HOME OFF FORENS SCI SERV,READING RG7 4PN,BERKS,ENGLAND
EVETT, IW
SULLIVAN, K
论文数: 0引用数: 0
h-index: 0
机构:
CENT RES & SUPPORT ESTAB,HOME OFF FORENS SCI SERV,READING RG7 4PN,BERKS,ENGLANDCENT RES & SUPPORT ESTAB,HOME OFF FORENS SCI SERV,READING RG7 4PN,BERKS,ENGLAND
机构:
University School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector 16C Dwarka
Molecular Biology and Genetic Engineering Laboratory, Defence Institute of Bio Energy Research, GoraparaoUniversity School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector 16C Dwarka
Grover A.
Aishwarya V.
论文数: 0引用数: 0
h-index: 0
机构:
University School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector 16C Dwarka
Division of Hematology/Oncology, Department of Medicine, University of Pennsylvania School of Medicine, Philadelphia, PAUniversity School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector 16C Dwarka
Aishwarya V.
Sharma P.C.
论文数: 0引用数: 0
h-index: 0
机构:
University School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector 16C DwarkaUniversity School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector 16C Dwarka