共 50 条
Update on RefSeq microbial genomes resources
被引:100
|作者:
Tatusova, Tatiana
[1
]
Ciufo, Stacy
[1
]
Federhen, Scott
[1
]
Fedorov, Boris
[1
]
McVeigh, Richard
[1
]
O'Neill, Kathleen
[1
]
Tolstoy, Igor
[1
]
Zaslavsky, Leonid
[1
]
机构:
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
基金:
美国国家卫生研究院;
关键词:
DATABASE;
D O I:
10.1093/nar/gku1062
中图分类号:
Q5 [生物化学];
Q7 [分子生物学];
学科分类号:
071010 ;
081704 ;
摘要:
NCBI RefSeq genome collection ext-link-type="uri" xlink:href="http://www.ncbi.nlm.nih.gov/genome" xlink:type="simple">http://www.ncbi.nlm.nih.gov/genome represents all three major domains of life: Eukarya, Bacteria and Archaea as well as Viruses. Prokaryotic genome sequences are the most rapidly growing part of the collection. During the year of 2014 more than 10 000 microbial genome assemblies have been publicly released bringing the total number of prokaryotic genomes close to 30 000. We continue to improve the quality and usability of the microbial genome resources by providing easy access to the data and the results of the pre-computed analysis, and improving analysis and visualization tools. A number of improvements have been incorporated into the Prokaryotic Genome Annotation Pipeline. Several new features have been added to RefSeq prokaryotic genomes data processing pipeline including the calculation of genome groups (clades) and the optimization of protein clusters generation using pan-genome approach.
引用
收藏
页码:D599 / D605
页数:7
相关论文