RefSeq: an update on mammalian reference sequences

被引:708
|
作者
Pruitt, Kim D. [1 ]
Brown, Garth R. [1 ]
Hiatt, Susan M. [1 ]
Thibaud-Nissen, Francoise [1 ]
Astashyn, Alexander [1 ]
Ermolaeva, Olga [1 ]
Farrell, Catherine M. [1 ]
Hart, Jennifer [1 ]
Landrum, Melissa J. [1 ]
McGarvey, Kelly M. [1 ]
Murphy, Michael R. [1 ]
O'Leary, Nuala A. [1 ]
Pujar, Shashikant [1 ]
Rajput, Bhanu [1 ]
Rangwala, Sanjida H. [1 ]
Riddick, Lillian D. [1 ]
Shkeda, Andrei [1 ]
Sun, Hanzhen [1 ]
Tamez, Pamela [1 ]
Tully, Raymond E. [1 ]
Wallin, Craig [1 ]
Webb, David [1 ]
Weber, Janet [1 ]
Wu, Wendy [1 ]
DiCuccio, Michael [1 ]
Kitts, Paul [1 ]
Maglott, Donna R. [1 ]
Murphy, Terence D. [1 ]
Ostell, James M. [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
DATABASE; RESOURCES;
D O I
10.1093/nar/gkt1114
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database is a collection of annotated genomic, transcript and protein sequence records derived from data in public sequence archives and from computation, curation and collaboration (http://www.ncbi.nlm.nih.gov/refseq/). We report here on growth of the mammalian and human subsets, changes to NCBI's eukaryotic annotation pipeline and modifications affecting transcript and protein records. Recent changes to NCBI's eukaryotic genome annotation pipeline provide higher throughput, and the addition of RNAseq data to the pipeline results in a significant expansion of the number of transcripts and novel exons annotated on mammalian RefSeq genomes. Recent annotation changes include reporting supporting evidence for transcript records, modification of exon feature annotation and the addition of a structured report of gene and sequence attributes of biological interest. We also describe a revised protein annotation policy for alternatively spliced transcripts with more divergent predicted proteins and we summarize the current status of the RefSeqGene project.
引用
收藏
页码:D756 / D763
页数:8
相关论文
共 50 条
  • [1] RefSeq: REFERENCE SEQUENCES FOR ORDERS OF FUNGI
    不详
    IMA FUNGUS, 2014, 5 (02) : 28 - 28
  • [2] Update on RefSeq microbial genomes resources
    Tatusova, Tatiana
    Ciufo, Stacy
    Federhen, Scott
    Fedorov, Boris
    McVeigh, Richard
    O'Neill, Kathleen
    Tolstoy, Igor
    Zaslavsky, Leonid
    NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) : D599 - D605
  • [3] NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy
    Pruitt, Kim D.
    Tatusova, Tatiana
    Brown, Garth R.
    Maglott, Donna R.
    NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D130 - D135
  • [4] RefSeq: an update on prokaryotic genome annotation and curation
    Haft, Daniel H.
    DiCuccio, Michael
    Badretdin, Azat
    Brover, Vyacheslav
    Chetvernin, Vyacheslav
    O'Neill, Kathleen
    Li, Wenjun
    Chitsaz, Farideh
    Derbyshire, Myra K.
    Gonzales, Noreen R.
    Gwadz, Marc
    Lu, Fu
    Marchler, Gabriele H.
    Song, James S.
    Thanki, Narmada
    Yamashita, Roxanne A.
    Zheng, Chanjuan
    Thibaud-Nissen, Francoise
    Geer, Lewis Y.
    Marchler-Bauer, Aron
    Pruitt, Kim D.
    NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) : D851 - D860
  • [5] NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins
    Pruitt, Kim D.
    Tatusova, Tatiana
    Maglott, Donna R.
    NUCLEIC ACIDS RESEARCH, 2007, 35 : D61 - D65
  • [6] Update on cpnDB: a reference database of chaperonin sequences
    Vancuren, Sarah J.
    Hill, Janet E.
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2019,
  • [7] High efficiency on prediction of translation initiation site (TIS) of RefSeq sequences
    Nobre, Cristiane N.
    Ortega, J. Miguel
    Braga, Antonio de Padua
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, PROCEEDINGS, 2007, 4643 : 138 - +
  • [8] A survey of mRNA sequences with a non-AUG start codon in RefSeq database
    Tikole, Suhas
    Sankararamakrishnan, Ramasubbu
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2006, 24 (01): : 33 - 41
  • [9] NCBI RefSeq: reference sequence standards through 25 years of curation and annotation
    Goldfarb, Tamara
    Kodali, Vamsi K.
    Pujar, Shashikant
    Brover, Vyacheslav
    Robbertse, Barbara
    Farrell, Catherine M.
    Oh, Dong-Ha
    Astashyn, Alexander
    Ermolaeva, Olga
    Haddad, Diana
    Hlavina, Wratko
    Hoffman, Jinna
    Jackson, John D.
    Joardar, Vinita S.
    Kristensen, David
    Masterson, Patrick
    Mcgarvey, Kelly M.
    Mcveigh, Richard
    Mozes, Eyal
    Murphy, Michael R.
    Schafer, Susan S.
    Souvorov, Alexander
    Spurrier, Brett
    Strope, Pooja K.
    Sun, Hanzhen
    Vatsan, Anjana R.
    Wallin, Craig
    Webb, David
    Brister, J. Rodney
    Hatcher, Eneida
    Kimchi, Avi
    Klimke, William
    Marchler-Bauer, Aron
    Pruitt, Kim D.
    Thibaud-Nissen, Francoise
    Murphy, Terence D.
    NUCLEIC ACIDS RESEARCH, 2024, 53 (D1) : D243 - D257
  • [10] SCANellome V2: Update of the Primate Anellovirus Reference Sequences Database
    Laubscher, Florian
    Kaiser, Laurent
    Cordey, Samuel
    VIRUSES-BASEL, 2024, 16 (09):