Assessment of BOLD and GenBank - Their accuracy and reliability for the identification of biological materials

被引:152
|
作者
Meiklejohn, Kelly A. [1 ,3 ]
Damaso, Natalie [1 ]
Robertson, James M. [2 ]
机构
[1] Fed Bur Invest Lab Div, Counterterrorism & Forens Sci Res Unit, Visiting Scientist Program, Quantico, VA USA
[2] Fed Bur Invest Lab Div, Counterterrorism & Forens Sci Res Unit, Quantico, VA USA
[3] North Carolina State Univ, Dept Populat Hlth & Pathobiol, Raleigh, NC 27695 USA
来源
PLOS ONE | 2019年 / 14卷 / 06期
关键词
INTERNAL TRANSCRIBED SPACER; DNA BARCODE; SEQUENCE DATABASES; REGION; COI; VALIDATION; SYSTEM; FUNGI; GENUS; ITS2;
D O I
10.1371/journal.pone.0217084
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Taxonomic identification of biological materials can be achieved through DNA barcoding, where an unknown "barcode" sequence is compared to a reference database. In many disciplines, obtaining accurate taxonomic identifications can be imperative (e.g., evolutionary biology, food regulatory compliance, forensics). The Barcode of Life DataSystems (BOLD) and GenBank are the main public repositories of DNA barcode sequences. In this study, an assessment of the accuracy and reliability of sequences in these databases was performed. To achieve this, 1) curated reference materials for plants, macro-fungi and insects were obtained from national collections, 2) relevant barcode sequences (rbcL, matK, trnH-psbA, ITS and COI) from these reference samples were generated and used for searching against both databases, and 3) optimal search parameters were determined that ensure the best match to the known species in either database. While GenBank outperformed BOLD for species-level identification of insect taxa (53% and 35%, respectively), both databases performed comparably for plants and macro-fungi (similar to 81% and similar to 57%, respectively). Results illustrated that using a multi-locus barcode approach increased identification success. This study outlines the utility of the BLAST search tool in GenBank and the BOLD identification engine for taxonomic identifications and identifies some precautions needed when using public sequence repositories in applied scientific disciplines.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Taxonomic identification accuracy from BOLD and GenBank databases using over a thousand insect DNA barcodes from Colombia
    Baena-Bejarano, Nathalie
    Reina, Catalina
    Martinez-Revelo, Diego Esteban
    Medina, Claudia A.
    Tovar, Eduardo
    Uribe-Soto, Sandra
    Neita-Moreno, Jhon Cesar
    Gonzalez, Mailyn A.
    [J]. PLOS ONE, 2023, 18 (04):
  • [2] BOLD and GenBank revisited - Do identification errors arise in the lab or in the sequence libraries?
    Pentinsaari, Mikko
    Ratnasingham, Sujeevan
    Miller, Scott E.
    Hebert, Paul D. N.
    [J]. PLOS ONE, 2020, 15 (04):
  • [3] Assessing the reliability of medicinal Dendrobium sequences in GenBank for botanical species identification
    Wu, Hoi-Yan
    Chan, Kwun-Tin
    But, Grace Wing-Chiu
    Shaw, Pang-Chui
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [4] Assessing the reliability of medicinal Dendrobium sequences in GenBank for botanical species identification
    Hoi-Yan Wu
    Kwun-Tin Chan
    Grace Wing-Chiu But
    Pang-Chui Shaw
    [J]. Scientific Reports, 11
  • [5] An assessment of the accuracy and reliability of hair identification of south-east Australian mammals
    Lobert, B
    Lumsden, L
    Brunner, H
    Triggs, B
    [J]. WILDLIFE RESEARCH, 2001, 28 (06) : 637 - 641
  • [6] Utility of GenBank and the Barcode of Life Data Systems (BOLD) for the identification of forensically important Diptera from Belgium and France
    Sonet, Gontran
    Jordaens, Kurt
    Braet, Yves
    Bourguignon, Luc
    Dupont, Erena
    Backeljau, Thierry
    De Meyer, Marc
    Desmyter, Stijn
    [J]. ZOOKEYS, 2013, (365) : 307 - 328
  • [7] Assessing the accuracy and reliability of satellite-derived precipitation products in the Kosi River basin (India<bold>)</bold>
    Singh, Aditya Kumar
    Singh, Vivekanand
    [J]. ENVIRONMENTAL MONITORING AND ASSESSMENT, 2024, 196 (07)
  • [8] ACCURACY IN ANALYSES OF BIOLOGICAL-MATERIALS
    MARCHANDISE, H
    [J]. FRESENIUS ZEITSCHRIFT FUR ANALYTISCHE CHEMIE, 1987, 326 (07): : 613 - 617
  • [9] ACCURACY AND RELIABILITY OF VISUAL POSTURAL ASSESSMENT
    ZIMNY, N
    EMERY, M
    [J]. PHYSICAL THERAPY, 1985, 65 (05): : 671 - 671
  • [10] Metrological assessment of the high-accuracy RNAA method for determination of cobalt in biological materials
    H. Polkowska-Motrenko
    B. Danko
    R. Dybczyński
    [J]. Analytical and Bioanalytical Chemistry, 2004, 379 : 221 - 226