A binary search approach to whole-genome data analysis

被引:7
|
作者
Brodsky, Leonid [1 ]
Kogan, Simon [1 ]
BenJacob, Eshel [2 ]
Nevo, Eviatar [1 ]
机构
[1] Univ Haifa, Inst Evolut, IL-31905 Haifa, Israel
[2] Tel Aviv Univ, Sch Phys & Astron, IL-69978 Tel Aviv, Israel
关键词
genome segmentation; tiling array; next-generation sequencing; MODEL-BASED ANALYSIS; TILING MICROARRAY; CHIP-SEQ; MAP;
D O I
10.1073/pnas.1011134107
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A sequence analysis-oriented binary search-like algorithm was transformed to a sensitive and accurate analysis tool for processing whole-genome data. The advantage of the algorithm over previous methods is its ability to detect the margins of both short and long genome fragments, enriched by up-regulated signals, at equal accuracy. The score of an enriched genome fragment reflects the difference between the actual concentration of up-regulated signals in the fragment and the chromosome signal baseline. The "divide-and-conquer"-type algorithm detects a series of nonintersecting fragments of various lengths with locally optimal scores. The procedure is applied to detected fragments in a nested manner by recalculating the lower-than-baseline signals in the chromosome. The algorithm was applied to simulated whole-genome data, and its sensitivity/specificity were compared with those of several alternative algorithms. The algorithm was also tested with four biological tiling array datasets comprising Arabidopsis (i) expression and (ii) histone 3 lysine 27 trimethylation CHIP-on-chip datasets; Saccharomyces cerevisiae (iii) spliced intron data and (iv) chromatin remodeling factor binding sites. The analyses' results demonstrate the power of the algorithm in identifying both the short up-regulated fragments (such as exons and transcription factor binding sites) and the long-even moderately up-regulated zones-at their precise genome margins. The algorithm generates an accurate whole-genome landscape that could be used for cross-comparison of signals across the same genome in evolutionary and general genomic studies.
引用
收藏
页码:16893 / 16898
页数:6
相关论文
共 50 条
  • [21] Whole-genome analysis of photosynthetic prokaryotes
    Raymond, J
    Zhaxybayeva, O
    Gogarten, JP
    Gerdes, SY
    Blankenship, RE
    SCIENCE, 2002, 298 (5598) : 1616 - 1620
  • [22] Whole-genome analysis of Drosophila gastrulation
    Stathopoulos, A
    Levine, M
    CURRENT OPINION IN GENETICS & DEVELOPMENT, 2004, 14 (05) : 477 - 484
  • [23] Whole-Genome Sequencing in Outbreak Analysis
    Gilchrist, Carol A.
    Turner, Stephen D.
    Riley, Margaret F.
    Petri, William A., Jr.
    Hewlett, Erik L.
    CLINICAL MICROBIOLOGY REVIEWS, 2015, 28 (03) : 541 - 563
  • [24] A genome-wide scan statistic framework for whole-genome sequence data analysis
    He, Zihuai
    Xu, Bin
    Buxbaum, Joseph
    Ionita-Laza, Iuliana
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [25] A genome-wide scan statistic framework for whole-genome sequence data analysis
    Zihuai He
    Bin Xu
    Joseph Buxbaum
    Iuliana Ionita-Laza
    Nature Communications, 10
  • [26] Whole-Genome Analysis of Metastatic Tumors
    不详
    CLINICAL PHARMACOLOGY & THERAPEUTICS, 2020, 107 (02) : 309 - 309
  • [27] Whole-genome QTL analysis for MAGIC
    Verbyla, Arunas P.
    George, Andrew W.
    Cavanagh, Colin R.
    Verbyla, Klara L.
    THEORETICAL AND APPLIED GENETICS, 2014, 127 (08) : 1753 - 1770
  • [28] Exploiting the Potential of Whole-genome Analysis
    不详
    ATLA-ALTERNATIVES TO LABORATORY ANIMALS, 2010, 38 (03): : 200 - 200
  • [29] Whole-genome cancer analysis as an approach to deeper understanding of tumour biology
    R L Strausberg
    A J G Simpson
    British Journal of Cancer, 2010, 102 : 243 - 248
  • [30] Whole-genome cancer analysis as an approach to deeper understanding of tumour biology
    Strausberg, R. L.
    Simpson, A. J. G.
    BRITISH JOURNAL OF CANCER, 2010, 102 (02) : 243 - 248