An Empirical Bayes approach for the identification of long-range chromosomal interaction from Hi-C data

被引:0
|
作者
Zhang, Qi [1 ]
Xu, Zheng [2 ]
Lai, Yutong [3 ]
机构
[1] Univ New Hampshire, Dept Math & Stat, Durham, NH 03824 USA
[2] Wright State Univ, Dept Math & Stat, Dayton, OH 45435 USA
[3] ClinChoice, Ft Washington, PA 19034 USA
关键词
empirical Bayes; epigenetics; Hi-C; peak identification; RHODOPSIN KINASE GENE; MODEL; NULL; ARCHITECTURE; GENOME; MAP;
D O I
10.1515/sagmb-2020-0026
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Hi-C experiments have become very popular for studying the 3D genome structure in recent years. Identification of long-range chromosomal interaction, i.e., peak detection, is crucial for Hi-C data analysis. But it remains a challenging task due to the inherent high dimensionality, sparsity and the over-dispersion of the Hi-C count data matrix. We propose EBHiC, an empirical Bayes approach for peak detection from Hi-C data. The proposed framework provides flexible over-dispersion modeling by explicitly including the "true" interaction intensities as latent variables. To implement the proposed peak identification method (via the empirical Bayes test), we estimate the overall distributions of the observed counts semiparametrically using a Smoothed Expectation Maximization algorithm, and the empirical null based on the zero assumption. We conducted extensive simulations to validate and evaluate the performance of our proposed approach and applied it to real datasets. Our results suggest that EBHiC can identify better peaks in terms of accuracy, biological interpretability, and the consistency across biological replicates. The source code is available on Github (https:// github.com/QiZhangStat/EBHiC).
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] FastHiC: a fast and accurate algorithm to detect long-range chromosomal interactions from Hi-C data
    Xu, Zheng
    Zhang, Guosheng
    Wu, Cong
    Li, Yun
    Hu, Ming
    BIOINFORMATICS, 2016, 32 (17) : 2692 - 2695
  • [2] HiCEnterprise: identifying long range chromosomal contacts in Hi-C data
    Kranas, Hanna
    Tuszynska, Irina
    Wilczynski, Bartek
    PEERJ, 2021, 9
  • [3] A hidden Markov random field-based Bayesian method for the detection of long-range chromosomal interactions in Hi-C data
    Xu, Zheng
    Zhang, Guosheng
    Jin, Fulai
    Chen, Mengjie
    Furey, Terrence S.
    Sullivan, Patrick F.
    Qin, Zhaohui
    Hu, Ming
    Li, Yun
    BIOINFORMATICS, 2016, 32 (05) : 650 - 656
  • [4] Reconstructing A/B compartments as revealed by Hi-C using long-range correlations in epigenetic data
    Jean-Philippe Fortin
    Kasper D. Hansen
    Genome Biology, 16
  • [5] Reconstructing A/B compartments as revealed by Hi-C using long-range correlations in epigenetic data
    Fortin, Jean-Philippe
    Hansen, Kasper D.
    GENOME BIOLOGY, 2015, 16
  • [6] Inferring Radial Organization of Chromosomal Territories from HI-C Data
    Das, Priyojit
    Sanders, Jacob T.
    Shen, Tongye
    McCord, Rachel P.
    BIOPHYSICAL JOURNAL, 2020, 118 (03) : 549A - 549A
  • [7] Statistical Challenges in Analyzing Methylation and Long-Range Chromosomal Interaction Data
    Qin Z.
    Li B.
    Conneely K.N.
    Wu H.
    Hu M.
    Ayyala D.
    Park Y.
    Jin V.X.
    Zhang F.
    Zhang H.
    Li L.
    Lin S.
    Statistics in Biosciences, 2016, 8 (2) : 284 - 309
  • [8] Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C
    Mifsud, Borbala
    Tavares-Cadete, Filipe
    Young, Alice N.
    Sugar, Robert
    Schoenfelder, Stefan
    Ferreira, Lauren
    Wingett, Steven W.
    Andrews, Simon
    Grey, William
    Ewels, Philip A.
    Herman, Bram
    Happe, Scott
    Higgs, Andy
    LeProust, Emily
    Follows, George A.
    Fraser, Peter
    Luscombe, Nicholas M.
    Osborne, Cameron S.
    NATURE GENETICS, 2015, 47 (06) : 598 - 606
  • [9] Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C
    Borbala Mifsud
    Filipe Tavares-Cadete
    Alice N Young
    Robert Sugar
    Stefan Schoenfelder
    Lauren Ferreira
    Steven W Wingett
    Simon Andrews
    William Grey
    Philip A Ewels
    Bram Herman
    Scott Happe
    Andy Higgs
    Emily LeProust
    George A Follows
    Peter Fraser
    Nicholas M Luscombe
    Cameron S Osborne
    Nature Genetics, 2015, 47 : 598 - 606
  • [10] Mapping long-range contacts between risk loci and target genes in human diseases with Capture Hi-C
    Cao, Canhui
    Xu, Qian
    Lin, Shitong
    Zhi, Wenhua
    Lazare, Cordelle
    Meng, Yifan
    Wu, Ping
    Gao, Peipei
    Li, Kezhen
    Wei, Juncheng
    Wu, Peng
    Li, Guoliang
    CLINICAL AND TRANSLATIONAL MEDICINE, 2020, 10 (05):