A binary search approach to whole-genome data analysis

被引:7
|
作者
Brodsky, Leonid [1 ]
Kogan, Simon [1 ]
BenJacob, Eshel [2 ]
Nevo, Eviatar [1 ]
机构
[1] Univ Haifa, Inst Evolut, IL-31905 Haifa, Israel
[2] Tel Aviv Univ, Sch Phys & Astron, IL-69978 Tel Aviv, Israel
关键词
genome segmentation; tiling array; next-generation sequencing; MODEL-BASED ANALYSIS; TILING MICROARRAY; CHIP-SEQ; MAP;
D O I
10.1073/pnas.1011134107
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A sequence analysis-oriented binary search-like algorithm was transformed to a sensitive and accurate analysis tool for processing whole-genome data. The advantage of the algorithm over previous methods is its ability to detect the margins of both short and long genome fragments, enriched by up-regulated signals, at equal accuracy. The score of an enriched genome fragment reflects the difference between the actual concentration of up-regulated signals in the fragment and the chromosome signal baseline. The "divide-and-conquer"-type algorithm detects a series of nonintersecting fragments of various lengths with locally optimal scores. The procedure is applied to detected fragments in a nested manner by recalculating the lower-than-baseline signals in the chromosome. The algorithm was applied to simulated whole-genome data, and its sensitivity/specificity were compared with those of several alternative algorithms. The algorithm was also tested with four biological tiling array datasets comprising Arabidopsis (i) expression and (ii) histone 3 lysine 27 trimethylation CHIP-on-chip datasets; Saccharomyces cerevisiae (iii) spliced intron data and (iv) chromatin remodeling factor binding sites. The analyses' results demonstrate the power of the algorithm in identifying both the short up-regulated fragments (such as exons and transcription factor binding sites) and the long-even moderately up-regulated zones-at their precise genome margins. The algorithm generates an accurate whole-genome landscape that could be used for cross-comparison of signals across the same genome in evolutionary and general genomic studies.
引用
收藏
页码:16893 / 16898
页数:6
相关论文
共 50 条
  • [41] A whole-genome shotgun approach to human reference genome sequencing
    Morishita, Shinichi
    NATURE REVIEWS GENETICS, 2024, 25 (04) : 236 - 236
  • [42] A whole-genome shotgun approach to human reference genome sequencing
    Shinichi Morishita
    Nature Reviews Genetics, 2024, 25 : 236 - 236
  • [43] Computationally efficient whole-genome regression for quantitative and binary traits
    Joelle Mbatchou
    Leland Barnard
    Joshua Backman
    Anthony Marcketta
    Jack A. Kosmicki
    Andrey Ziyatdinov
    Christian Benner
    Colm O’Dushlaine
    Mathew Barber
    Boris Boutkov
    Lukas Habegger
    Manuel Ferreira
    Aris Baras
    Jeffrey Reid
    Goncalo Abecasis
    Evan Maxwell
    Jonathan Marchini
    Nature Genetics, 2021, 53 : 1097 - 1103
  • [44] Computationally efficient whole-genome regression for quantitative and binary traits
    Mbatchou, Joelle
    Barnard, Leland
    Backman, Joshua
    Marcketta, Anthony
    Kosmicki, Jack A.
    Ziyatdinov, Andrey
    Benner, Christian
    O'Dushlaine, Colm
    Barber, Mathew
    Boutkov, Boris
    Habegger, Lukas
    Ferreira, Manuel
    Baras, Aris
    Reid, Jeffrey
    Abecasis, Goncalo
    Maxwell, Evan
    Marchini, Jonathan
    NATURE GENETICS, 2021, 53 (07) : 1097 - +
  • [45] Genome analysis TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data
    Pan, Junhang
    Li, Xiangchen
    Zhang, Mingwu
    Lu, Yewei
    Zhu, Yelei
    Wu, Kunyang
    Wu, Yiwen
    Wang, Weixin
    Chen, Bin
    Liu, Zhengwei
    Wang, Xiaomeng
    Gao, Junshun
    BIOINFORMATICS, 2023, 39 (01)
  • [46] SeqAnt: Cloud-Based Whole-Genome Annotation and Search
    Kotlar, Alex V.
    Trevino, Cristina E.
    Zwick, Michael E.
    Cutler, David J.
    Wingo, Thomas S.
    ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 621 - 621
  • [47] Noninvasive Prenatal Whole-Genome Sequencing: A Solution in Search of a Problem
    Kaposy, Chris
    AMERICAN JOURNAL OF BIOETHICS, 2017, 17 (01): : 42 - 44
  • [48] Whole-Genome Sequence Approach and Phylogenomic Stratification Improve the Association Analysis of Mutations With Patient Data in Influenza Surveillance
    Van Poelvoorde, Laura
    Vanneste, Kevin
    De Keersmaecker, Sigrid C. J.
    Thomas, Isabelle
    Van Goethem, Nina
    Van Gucht, Steven
    Saelens, Xavier
    Roosens, Nancy H. C.
    FRONTIERS IN MICROBIOLOGY, 2022, 13
  • [49] Population analysis of the Korean native duck using whole-genome sequencing data
    Lee, Daehwan
    Lee, Jongin
    Heo, Kang-Neung
    Kwon, Kisang
    Moon, Youngbeen
    Lim, Dajeong
    Lee, Kyung-Tai
    Kim, Jaebum
    BMC GENOMICS, 2020, 21 (01)
  • [50] BacWGSpipe: A Snakemake Workflow for a Complete Analysis of Bacterial Whole-Genome Sequencing Data
    Wang, Weixin
    Li, Xiangcheng
    Lu, Yewei
    2023 11TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, ICBCB, 2023, : 26 - 31