HiCDiff: single-cell Hi-C data denoising with diffusion models

被引:0
|
作者
Wang, Yanli [1 ]
Cheng, Jianlin [1 ]
机构
[1] Univ Missouri, NextGen Precis Hlth Inst, Dept Elect Engn & Comp Sci, Columbia, MO 65211 USA
关键词
single-cell Hi-C; diffusion model; deep learning; Hi-C data denoising; GENOMES;
D O I
10.1093/bib/bbae279
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The genome-wide single-cell chromosome conformation capture technique, i.e. single-cell Hi-C (ScHi-C), was recently developed to interrogate the conformation of the genome of individual cells. However, single-cell Hi-C data are much sparser than bulk Hi-C data of a population of cells, and noise in single-cell Hi-C makes it difficult to apply and analyze them in biological research. Here, we developed the first generative diffusion models (HiCDiff) to denoise single-cell Hi-C data in the form of chromosomal contact matrices. HiCDiff uses a deep residual network to remove the noise in the reverse process of diffusion and can be trained in both unsupervised and supervised learning modes. Benchmarked on several single-cell Hi-C test datasets, the diffusion models substantially remove the noise in single-cell Hi-C data. The unsupervised HiCDiff outperforms most supervised non-diffusion deep learning methods and achieves the performance comparable to the state-of-the-art supervised deep learning method in terms of multiple metrics, demonstrating that diffusion models are a useful approach to denoising single-cell Hi-C data. Moreover, its good performance holds on denoising bulk Hi-C data.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Unsupervised embedding of single-cell Hi-C data
    Liu, Jie
    Lin, Dejun
    Yardimci, Galip Gurkan
    Noble, William Stafford
    [J]. BIOINFORMATICS, 2018, 34 (13) : 96 - 104
  • [2] Single-cell Hi-C data analysis: safety in numbers
    Galitsyna, Aleksandra A.
    Gelfand, Mikhail S.
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [3] Massively multiplex single-cell Hi-C
    Ramani, Vijay
    Deng, Xinxian
    Qiu, Ruolan
    Gunderson, Kevin L.
    Steemers, Frank J.
    Disteche, Christine M.
    Noble, William S.
    Duan, Zhijun
    Shendure, Jay
    [J]. NATURE METHODS, 2017, 14 (03) : 263 - +
  • [4] Massively multiplex single-cell Hi-C
    Ramani V.
    Deng X.
    Qiu R.
    Gunderson K.L.
    Steemers F.J.
    Disteche C.M.
    Noble W.S.
    Duan Z.
    Shendure J.
    [J]. Nature Methods, 2017, 14 (3) : 263 - 266
  • [5] Single-cell technologies meet Hi-C
    Hughes, Jim R.
    Davies, James O. J.
    [J]. NATURE GENETICS, 2024, : 1542 - 1543
  • [6] Scool: a new data storage format for single-cell Hi-C data
    Wolff, Joachim
    Abdennur, Nezar
    Backofen, Rolf
    Gruening, Bjorn
    [J]. BIOINFORMATICS, 2021, 37 (14) : 2053 - 2054
  • [7] GiniQC: a measure for quantifying noise in single-cell Hi-C data
    Horton, Connor A.
    Alver, Burak H.
    Park, Peter J.
    [J]. BIOINFORMATICS, 2020, 36 (09) : 2902 - 2904
  • [8] scHiCTools: A computational toolbox for analyzing single-cell Hi-C data
    Li, Xinjun
    Feng, Fan
    Pu, Hongxi
    Leung, Wai Yan
    Liu, Jie
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (05)
  • [9] Advances in methods and applications of single-cell Hi-C data analysis
    Gong, Haiyan
    Ma, Fuqiang
    Zhang, Xiaotong
    [J]. Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2023, 40 (05): : 1033 - 1039
  • [10] Deep neural network models for cell type prediction based on single-cell Hi-C data
    Zhou, Bing
    Liu, Quanzhong
    Wang, Meili
    Wu, Hao
    [J]. BMC GENOMICS, 2024, 22 (SUPPL 5):