An effective approach for analyzing "prefinished" genomic sequence data

被引:0
|
作者
Kuehl, PM
Weisemann, JM
Touchman, JW
Green, ED
Boguski, MS [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
[2] NIH, Natl Human Genome Res Inst, Genome Technol Branch, Bethesda, MD 20892 USA
[3] Univ Maryland, Dept Mol & Cell Biol, Baltimore, MD 21201 USA
关键词
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.
引用
收藏
页码:189 / 194
页数:6
相关论文
共 50 条
  • [21] Cyanobacterial ultrastructure in light of genomic sequence data
    Gonzalez-Esquer, C. R.
    Smarda, J.
    Rippka, R.
    Axen, S. D.
    Guglielmi, G.
    Gugger, M.
    Kerfeld, C. A.
    PHOTOSYNTHESIS RESEARCH, 2016, 129 (02) : 147 - 157
  • [22] Speciation: genomic sequence data and the biogeography of speciation
    Johnson, Norman A.
    NATIONAL SCIENCE REVIEW, 2022, 9 (12)
  • [23] Neural Architecture Search for Genomic Sequence Data
    Scheppach, Amadeu
    Guenduez, Hueseyin Anil
    Dorigatti, Emilio
    Muench, Philipp C.
    McHardy, Alice C.
    Bischl, Bernd
    Rezaei, Mina
    Binder, Martin
    2023 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, CIBCB, 2023, : 181 - 190
  • [24] Cyanobacterial ultrastructure in light of genomic sequence data
    C. R. Gonzalez-Esquer
    J. Smarda
    R. Rippka
    S. D. Axen
    G. Guglielmi
    M. Gugger
    C. A. Kerfeld
    Photosynthesis Research, 2016, 129 : 147 - 157
  • [25] Identification of lectins from genomic sequence data
    Drickamer, K
    Taylor, ME
    RECOGNITION OF CARBOHYDRATES IN BIOLOGICAL SYSTEMS PT A: GENERAL PROCEDURES, 2003, 362 : 560 - 567
  • [26] On the effective depth of viral sequence data
    Illingworth, Christopher J. R.
    Roy, Sunando
    Beale, Mathew A.
    Tutill, Helena
    Williams, Rachel
    Breuer, Judith
    VIRUS EVOLUTION, 2017, 3 (02)
  • [27] A fast genomic selection approach for large genomic data
    Hailan Liu
    Guo-Bo Chen
    Theoretical and Applied Genetics, 2017, 130 : 1277 - 1284
  • [28] A fast genomic selection approach for large genomic data
    Liu, Hailan
    Chen, Guo-Bo
    THEORETICAL AND APPLIED GENETICS, 2017, 130 (06) : 1277 - 1284
  • [29] ULTRA-effective labeling of tandem repeats in genomic sequence
    Olson, Daniel R.
    Wheeler, Travis J.
    BIOINFORMATICS ADVANCES, 2024, 4 (01):
  • [30] An Operator Theoretic Approach for Analyzing Sequence Neural Networks
    Naiman, Ilan
    Azencot, Omri
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9268 - 9276