Inference of 3D genome architecture by modeling overdispersion of Hi-C data

被引:6
|
作者
Varoquaux, Nelle [1 ]
Noble, William S. [2 ,3 ]
Vert, Jean-Philippe [4 ,5 ]
机构
[1] Univ Grenoble Alpes, TIMC, CNRS, F-38000 Grenoble, France
[2] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[3] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA
[4] Google Res, Brain Team, F-75009 Paris, France
[5] PSL Univ, Ctr Computat Biol, MINES Paris Tech, F-75006 Paris, France
基金
美国国家卫生研究院;
关键词
DIFFERENTIAL EXPRESSION ANALYSIS; REVEALS; ORGANIZATION; PRINCIPLES; MAP; DISPERSION;
D O I
10.1093/bioinformatics/btac838
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: We address the challenge of inferring a consensus 3D model of genome architecture from Hi-C data. Existing approaches most often rely on a two-step algorithm: first, convert the contact counts into distances, then optimize an objective function akin to multidimensional scaling (MDS) to infer a 3D model. Other approaches use a maximum likelihood approach, modeling the contact counts between two loci as a Poisson random variable whose intensity is a decreasing function of the distance between them. However, a Poisson model of contact counts implies that the variance of the data is equal to the mean, a relationship that is often too restrictive to properly model count data.Results: We first confirm the presence of overdispersion in several real Hi-C datasets, and we show that the overdispersion arises even in simulated datasets. We then propose a new model, called Pastis-NB, where we replace the Poisson model of contact counts by a negative binomial one, which is parametrized by a mean and a separate dispersion parameter. The dispersion parameter allows the variance to be adjusted independently from the mean, thus better modeling overdispersed data. We compare the results of Pastis-NB to those of several previously published algorithms, both MDS-based and statistical methods. We show that the negative binomial inference yields more accurate structures on simulated data, and more robust structures than other models across real Hi-C replicates and across different resolutions.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Mapping 3D genome architecture through in situ DNase Hi-C
    Ramani, Vijay
    Cusanovich, Darren A.
    Hause, Ronald J.
    Ma, Wenxiu
    Qiu, Ruolan
    Deng, Xinxian
    Blau, C. Anthony
    Disteche, Christine M.
    Noble, William S.
    Shendure, Jay
    Duan, Zhijun
    NATURE PROTOCOLS, 2016, 11 (11) : 59 - 76
  • [2] Mapping 3D genome architecture through in situ DNase Hi-C
    Vijay Ramani
    Darren A Cusanovich
    Ronald J Hause
    Wenxiu Ma
    Ruolan Qiu
    Xinxian Deng
    C Anthony Blau
    Christine M Disteche
    William S Noble
    Jay Shendure
    Zhijun Duan
    Nature Protocols, 2016, 11 : 2104 - 2121
  • [3] 3D Genome Reconstruction with ShRec3D+and Hi-C Data
    Li, Jiangeng
    Zhang, Wei
    Li, Xiaodan
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (02) : 460 - 468
  • [4] 3D Genome Reconstruction from Partially Phased Hi-C Data
    Diego Cifuentes
    Jan Draisma
    Oskar Henriksson
    Annachiara Korchmaros
    Kaie Kubjas
    Bulletin of Mathematical Biology, 2024, 86
  • [5] GeneRHi-C: 3D GENomE Reconstruction from Hi-C data
    MacKay, Kimberly
    Carlsson, Mats
    Kusalik, Anthony
    PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS-BIOLOGY AND BIOINFORMATICS (CSBIO 2019), 2019,
  • [6] 3D Genome Reconstruction from Partially Phased Hi-C Data
    Cifuentes, Diego
    Draisma, Jan
    Henriksson, Oskar
    Korchmaros, Annachiara
    Kubjas, Kaie
    BULLETIN OF MATHEMATICAL BIOLOGY, 2024, 86 (04)
  • [7] Deciphering Hi-C: from 3D genome to function
    Siyuan Kong
    Yubo Zhang
    Cell Biology and Toxicology, 2019, 35 : 15 - 32
  • [8] Integrating Hi-C and FISH data for modeling of the 3D organization of chromosomes
    Abbas, Ahmed
    He, Xuan
    Niu, Jing
    Zhou, Bin
    Zhu, Guangxiang
    Ma, Tszshan
    Song, Jiangpeikun
    Gao, Juntao
    Zhang, Michael Q.
    Zeng, Jianyang
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [9] Integrating Hi-C and FISH data for modeling of the 3D organization of chromosomes
    Ahmed Abbas
    Xuan He
    Jing Niu
    Bin Zhou
    Guangxiang Zhu
    Tszshan Ma
    Jiangpeikun Song
    Juntao Gao
    Michael Q. Zhang
    Jianyang Zeng
    Nature Communications, 10
  • [10] Deciphering Hi-C: from 3D genome to function
    Kong, Siyuan
    Zhang, Yubo
    CELL BIOLOGY AND TOXICOLOGY, 2019, 35 (01) : 15 - 32